Getting insights from the data

Case study di Data Literacy: analisi del lavoro da remoto

Maarten Van den Broeck

Senior Content Developer at DataCamp

Cluster analysis

  • Common descriptive and exploratory technique
  • Goal: find naturally occurring groups in the data

  • Possible applications:

    • Customer segmentation
    • Making a classification
    • Identifying subgroups
  • Two main steps:

    • Finding the optimal number of groups
    • Investigating the characteristics of each group

Example cluster analysis

Case study di Data Literacy: analisi del lavoro da remoto

Finding the optimal number of groups

  • Calculation of the optimal solution
  • Domain knowledge: experts/business
  • Not always an exact solution!

2 vs. 3 clusters

Case study di Data Literacy: analisi del lavoro da remoto

Characteristics of the groups

  • How are they different?
  • Common characteristics between a subset of groups
  • Variables of interest

Plot of counts for each cluster for satisfaction rate

Case study di Data Literacy: analisi del lavoro da remoto

Let's practice!

Case study di Data Literacy: analisi del lavoro da remoto

Preparing Video For Download...