Exploring the data

Data Literacy Case Study: Remote Working Analysis

Maarten Van den Broeck

Senior Content Developer at DataCamp

Exploratory data analysis

  • Why?
    • Assessing the main characteristics of the data
    • Finding relationships, patterns, or groups
    • Suggesting questions or hypotheses for future analysis
  • How?
    • Describing the data numerically
    • Visualizing the data

Person looking at data with magnifier

Data Literacy Case Study: Remote Working Analysis

Example: age characteristics

  • What are the main age characteristics of the respondents?
    • Average/median age: 43
    • Youngest: 21
    • Oldest: 66

Descriptive statistics of age

Age
Mean 43.77
Std.dev 11.84
Min 21.00
Median 43.00
Max 66.00
N valid 1512
Data Literacy Case Study: Remote Working Analysis

Are all age groups represented?

Histogram of age

Data Literacy Case Study: Remote Working Analysis

Are all age groups represented?

Histogram with right side higlighted

Data Literacy Case Study: Remote Working Analysis

Example: remote frequency vs. preference

Count plot of remote frequency vs remote preference

Data Literacy Case Study: Remote Working Analysis

Example: remote frequency vs. preference

Count plot with highlighted diagonal

Data Literacy Case Study: Remote Working Analysis

Example: remote frequency vs. preference

Count plot with highlighted group

Data Literacy Case Study: Remote Working Analysis

Let's practice!

Data Literacy Case Study: Remote Working Analysis

Preparing Video For Download...