Data as a resource

Introduction to Data

Joe Franklin

Senior Curriculum Manager at DataCamp

Overwhelming data

Data is often too large in its raw form

  • Even "simple" analytics require large amounts of data
  • More complex analysis can leverage millions or billions of records

 

Being able to summarize a dataset into smaller pieces is required to make informed decisions

Man overwhelmed by data

Introduction to Data

Summing things up

Aggregation.png

Aggregations translate raw data into summaries that are easier to understand

Common aggregations:

  • Simple average (mean)
  • Totals also known as sums
  • Minimums and maximums
  • Modes

$$

Aggregations allow you to focus on a specific attribute of a dataset

Introduction to Data

See the big picture

 

Aggregations appear in many ways throughout organizations

  • Metrics
  • Benchmarks
  • Key Performance Indicators (KPIs)

 

Understanding how these aggregations are created is extremely helpful for many investigations

BigPicture.png

Introduction to Data

Curiosity

Curiousity.png

Take a moment to ask why?

Introduction to Data

Data flow

Data Flow.png

Data flow within organizations is often highly complex

  • Data from many different source systems
  • Processed through other systems
  • Displayed and manipulated in other systems

The field of data management is responsible for trying to unify all these flows

Introduction to Data

Data domains

Data Governance: ensure data is consistent, trustworthy and isn't misused

 

Data Quality: ensure data is accurate, valid, complete and consistent

 

Data Privacy and Security: ensure proper data access, use and protection

Domains.png

Introduction to Data

Let's practice!

Introduction to Data

Preparing Video For Download...