summaryrefslogtreecommitdiff
path: root/SI/Resource/Fundamentals of Data Mining/Content/pattern discovery.md
blob: f88fe80c76fb8b32fd2f6f3410178b4e3f6a186e (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
---
id: pattern discovery
aliases:
  - What is Pattern Discovery?
tags: []
---

## What is Pattern Discovery?

- ==What are patterns?==
  - ==Patterns==: A set of items, subsequences, or substructures that occur
    frequently together (or strongly correlated) in a data set
  - Patterns represent ==intrinsic== and ==important properties== of datasets
- ==Pattern discovery==: Uncovering patterns from massive data sets
- Motivation examples:
  - What products were often purchased together?
  - What are the subsequent purchases after buying an iPad?
  - What code segments likely contain copy-and-paste bugs?
  - What word sequences likely form phrases in this corpus? ![[CleanShot
2023-10-26 at 01.53.56@2x.png]] ![[CleanShot 2023-10-26 at 01.54.32@2x.png]]
    ![[CleanShot 2023-10-26 at 01.54.44@2x.png]] ![[CleanShot 2023-10-26 at
01.55.00@2x.png]]

## Efficient Pattern Mining Methods

- The [[Apriori]] Algorithm
- [[FP-Growth]]: A Frequent Pattern-Growth Approach