Common data mistakes

Introduzione ai dati

Maarten Van den Broeck

Senior Content Developer at DataCamp

Common mistakes about data

An error while working with data

Introduzione ai dati

Common mistakes about data

  • Not having a clear goal or question

Icons representing an error while working with data due to a poorly defined problem

Introduzione ai dati

Common mistakes about data

  • Not having a clear goal or question
  • Insufficient or wrong data

Icons representing an error while working with data due to a poorly defined problem and wrong data

Introduzione ai dati

Common mistakes about data

  • Not having a clear goal or question
  • Insufficient or wrong data
  • Lack of appropriate analysis

Icons representing an error while working with data due to a poorly defined problem, and wrong data and statistics

Introduzione ai dati

Common mistakes about data

  • Not having a clear goal or question
  • Insufficient or wrong data
  • Lack of appropriate analysis
  • No clear communication of results

$$

Carefully plan the data analysis process

Icons representing an error while working with data due to a poorly defined problem, and wrong data, statistics, and communication

Introduzione ai dati

Not clearly defining the problem

"Did you buy anything in the last month?"

$$

"Where did you make your last purchase?"

"Which payment method did you use?"

May lead to inappropriate data collection, analysis, and conclusions

defining a data question

Introduzione ai dati

Insufficient or wrong data

wrong data

$$

$$

Data bias: the data sample doesn't represent all the data

  • Collecting the wrong data doesn't allow you to answer the research question
  • Data still needs cleaning before analysis
Introduzione ai dati

Lack of appropriate analysis

$$

  • Jumping to conclusions too quickly
  • Lack of context: a missing reason explaining the results
  • Other examples include
    • Incorrect aggregations and calculations
    • Confusing correlation with causation

poor data analysis

Introduzione ai dati

No clear communication of results

data communication

$$

  • Most valuable part of data life cycle
  • Could lead to misunderstandings or incorrect conclusions
  • Examples:
    • Too technical
    • Cherry-picking data points
    • Unclear visualizations
Introduzione ai dati

Let's practice!

Introduzione ai dati

Preparing Video For Download...