Review K-means results

Cluster Analysis in R

Dmitriy (Dima) Gorenshteyn

Lead Data Scientist, Memorial Sloan Kettering Cancer Center

Three clustering results

oes_clusters

Cluster Analysis in R

Comparing the two clustering methods

Hierarchical Clustering k-means
Distance Used: virtually any euclidean only
Results Stable: Yes No
Evaluating # of Clusters: dendrogram, silhouette, elbow silhouette, elbow
Computation Complexity: Relatively Higher Relatively Lower

Cluster Analysis in R

What have you learned?

  • Chapter 1:
    • What is distance
    • Why is scale important
  • Chapter 2:
    • How linkage works
    • How the dendrogram is formed
    • How to analyze your clusters
  • Chapter 3:
    • How k-means works
    • How to estimate k
    • How to analyze how well an observation fits in a cluster
Cluster Analysis in R

A lot more to learn

  • k-mediods
  • DBSCAN
  • Optics
Cluster Analysis in R

Congratulations!

Cluster Analysis in R

Preparing Video For Download...