summaryrefslogtreecommitdiff
path: root/SI/Resource/Fundamentals of Data Mining/Content/Compare and Contrast.md
blob: d72dd09a7fe4b56ba729cf32900625d6e98cdcdc (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
---
id: Compare and Contrast
aliases:
  - clustering algorithms
tags:
  - Compare-and-Contrast
---

## [[clustering algorithms]]

- [[K-Means]] vs [[K-Medoids]]
  - In _K-means_ algorithm, they choose means as the centroids but in the
    _K-medoids_, data points are chosen to be the medoids[^1].
- [[K-Means]] vs [[K-Medians]]

| K-Means                                                    | K-Medians                                     |
| ---------------------------------------------------------- | --------------------------------------------- |
| The center is not necessarily one of the input data points | Centers will be chosen from data points       |
| Not flexible                                               | More flexible                                 |
| Not immune to noise and outliers                           | More robust to noise and outliers             |
| Minimize the sum of squared Euclidian distance             | Minimize a sum of pairwise of dissimilarities |

[^1]:
    Medoids are **representative objects of a data set or a cluster within a
    data set whose sum of dissimilarities to all the objects in the cluster is
    minimal**. Medoids are similar in concept to means or centroids, but medoids are
    always restricted to be members of the data set.