Cloud providers

Introduction to Data Engineering

Vincent Vankrunkelsven

Data Engineer @ DataCamp

Data processing in the cloud

Clusters of machines required

Problem: self-host data-center

  • Cover electrical and maintenance costs
  • Peaks vs. quiet moments: hard to optimize

 

Solution: use the cloud

Image representing cloud computing

Image of a  Data Center

Introduction to Data Engineering

Data storage in the cloud

Reliability is required

Problem: self-host data-center

  • Disaster will strike
  • Need different geographical locations

 

Solution: use the cloud

Image representing cloud computing

Cartoon image of explosion

Introduction to Data Engineering

The big three: AWS, Azure and Google

 

AWS logo 32% market share in 2018

Azure logo 17% market share in 2018

Google Cloud logo 10% market share in 2018

 

 

  • Storage
  • Computation
  • Databases.
Introduction to Data Engineering

Storage

Upload files, e.g. storing product images

Services

  • AWS S3
  • Azure Blob Storage
  • Google Cloud Storage
Introduction to Data Engineering

Computation

Perform calculations, e.g. hosting a web server

Services

  • AWS EC2
  • Azure Virtual Machines
  • Google Compute Engine
Introduction to Data Engineering

Databases

Hold structured information

Services

  • AWS RDS
  • Azure SQL Database
  • Google Cloud SQL
Introduction to Data Engineering

Let's practice!

Introduction to Data Engineering

Preparing Video For Download...