summaryrefslogtreecommitdiff
path: root/SI/Resource/Fundamentals of Data Mining/Content/pattern discovery.md
diff options
context:
space:
mode:
authorTheSiahxyz <164138827+TheSiahxyz@users.noreply.github.com>2024-04-29 22:06:12 -0400
committerTheSiahxyz <164138827+TheSiahxyz@users.noreply.github.com>2024-04-29 22:06:12 -0400
commit4d53fa14ee0cd615444aca6f6ba176e0ccc1b5be (patch)
tree4d9f0527d9e6db4f92736ead0aa9bb3f840a0f89 /SI/Resource/Fundamentals of Data Mining/Content/pattern discovery.md
init
Diffstat (limited to 'SI/Resource/Fundamentals of Data Mining/Content/pattern discovery.md')
-rw-r--r--SI/Resource/Fundamentals of Data Mining/Content/pattern discovery.md27
1 files changed, 27 insertions, 0 deletions
diff --git a/SI/Resource/Fundamentals of Data Mining/Content/pattern discovery.md b/SI/Resource/Fundamentals of Data Mining/Content/pattern discovery.md
new file mode 100644
index 0000000..f88fe80
--- /dev/null
+++ b/SI/Resource/Fundamentals of Data Mining/Content/pattern discovery.md
@@ -0,0 +1,27 @@
+---
+id: pattern discovery
+aliases:
+ - What is Pattern Discovery?
+tags: []
+---
+
+## What is Pattern Discovery?
+
+- ==What are patterns?==
+ - ==Patterns==: A set of items, subsequences, or substructures that occur
+ frequently together (or strongly correlated) in a data set
+ - Patterns represent ==intrinsic== and ==important properties== of datasets
+- ==Pattern discovery==: Uncovering patterns from massive data sets
+- Motivation examples:
+ - What products were often purchased together?
+ - What are the subsequent purchases after buying an iPad?
+ - What code segments likely contain copy-and-paste bugs?
+ - What word sequences likely form phrases in this corpus? ![[CleanShot
+2023-10-26 at 01.53.56@2x.png]] ![[CleanShot 2023-10-26 at 01.54.32@2x.png]]
+ ![[CleanShot 2023-10-26 at 01.54.44@2x.png]] ![[CleanShot 2023-10-26 at
+01.55.00@2x.png]]
+
+## Efficient Pattern Mining Methods
+
+- The [[Apriori]] Algorithm
+- [[FP-Growth]]: A Frequent Pattern-Growth Approach