Data as a resource

Introduzione ai dati

Joe Franklin

Senior Curriculum Manager at DataCamp

Overwhelming data

Data is often too large in its raw form

  • Even "simple" analytics require large amounts of data
  • More complex analysis can leverage millions or billions of records

 

Being able to summarize a dataset into smaller pieces is required to make informed decisions

Man overwhelmed by data

Introduzione ai dati

Summing things up

Aggregation.png

Aggregations translate raw data into summaries that are easier to understand

Common aggregations:

  • Simple average (mean)
  • Totals also known as sums
  • Minimums and maximums
  • Modes

$$

Aggregations allow you to focus on a specific attribute of a dataset

Introduzione ai dati

See the big picture

 

Aggregations appear in many ways throughout organizations

  • Metrics
  • Benchmarks
  • Key Performance Indicators (KPIs)

 

Understanding how these aggregations are created is extremely helpful for many investigations

BigPicture.png

Introduzione ai dati

Curiosity

Curiousity.png

Take a moment to ask why?

Introduzione ai dati

Data flow

Data Flow.png

Data flow within organizations is often highly complex

  • Data from many different source systems
  • Processed through other systems
  • Displayed and manipulated in other systems

The field of data management is responsible for trying to unify all these flows

Introduzione ai dati

Data domains

Data Governance: ensure data is consistent, trustworthy and isn't misused

 

Data Quality: ensure data is accurate, valid, complete and consistent

 

Data Privacy and Security: ensure proper data access, use and protection

Domains.png

Introduzione ai dati

Let's practice!

Introduzione ai dati

Preparing Video For Download...