Making sense of the clusters

Cluster Analysis in R

Dmitriy Gorenshteyn

Lead Data Scientist, Memorial Sloan Kettering Cancer Center

Wholesale dataset

  • 45 observations
  • 3 features:
    • Milk Spending
    • Grocery Spending
    • Frozen Food Spending
Cluster Analysis in R

Wholesale dataset

print(customers_spend)
    Milk Grocery Frozen
1  11103   12469    902
2   2013    6550    909
3   1897    5234    417
4   1304    3643   3045
5   3199    6986   1455
...  ...     ...    ...
Cluster Analysis in R

Exploring more than 2 dimensions

  • Plot 2 dimensions at a time
  • Visualize using PCA
  • Summary statistics by feature
Cluster Analysis in R

Segment the customers

Cluster Analysis in R

Preparing Video For Download...