Clustering

Data Science for Business

Ramnath Vaidyanathan

VP of Product Research, DataCamp

What is clustering?

clustering.jpg

  • Divide data into categories
  • Use cases
    • Customer segmentation
    • Image segmentation
    • Anomaly detection
Data Science for Business

Supervised Machine Learning

 

supervised-learning.jpg

Unsupervised Machine Learning

 

unsupervised-learning.jpg

Data Science for Business

Case study: customer segmentation

customer-segmentation-1.jpg

Data Science for Business

Case study: customer segmentation

Define features

  • Number of flights in the past year
  • Percent international
  • Advanced planning
  • Percent business class

airplane.jpg

Data Science for Business

Case study: customer segmentation

  • Define number of clusters

cluster-data.png

Data Science for Business

Case study: customer segmentation

two-cluster.png

three-cluster.png

Data Science for Business

Clustering review

Definition

  • Divide unlabeled dataset into different categories

Steps

  • Select features
  • Select number of clusters
  • Use clusters to solve business problems
Data Science for Business

Let's practice!

Data Science for Business

Preparing Video For Download...