Exploring the data

Case study di Data Literacy: analisi del lavoro da remoto

Maarten Van den Broeck

Senior Content Developer at DataCamp

Exploratory data analysis

  • Why?
    • Assessing the main characteristics of the data
    • Finding relationships, patterns, or groups
    • Suggesting questions or hypotheses for future analysis
  • How?
    • Describing the data numerically
    • Visualizing the data

Person looking at data with magnifier

Case study di Data Literacy: analisi del lavoro da remoto

Example: age characteristics

  • What are the main age characteristics of the respondents?
    • Average/median age: 43
    • Youngest: 21
    • Oldest: 66

Descriptive statistics of age

Age
Mean 43.77
Std.dev 11.84
Min 21.00
Median 43.00
Max 66.00
N valid 1512
Case study di Data Literacy: analisi del lavoro da remoto

Are all age groups represented?

Histogram of age

Case study di Data Literacy: analisi del lavoro da remoto

Are all age groups represented?

Histogram with right side higlighted

Case study di Data Literacy: analisi del lavoro da remoto

Example: remote frequency vs. preference

Count plot of remote frequency vs remote preference

Case study di Data Literacy: analisi del lavoro da remoto

Example: remote frequency vs. preference

Count plot with highlighted diagonal

Case study di Data Literacy: analisi del lavoro da remoto

Example: remote frequency vs. preference

Count plot with highlighted group

Case study di Data Literacy: analisi del lavoro da remoto

Let's practice!

Case study di Data Literacy: analisi del lavoro da remoto

Preparing Video For Download...