Cloud computing

Capire il Data Engineering

Hadrien Lacroix

Content Developer

Cloud computing for data processing

Servers on premises

  • Bought
  • Need space
  • Electrical and maintenance cost
  • Enough power for peak moments
  • Processing power unused at quieter times

Servers on the cloud

  • Rented
  • Don't need space
  • Use just the resources we need
  • When we need them
  • The closer to the user the better
Capire il Data Engineering

Cloud computing for data storage

  • Database reliability: data replication
  • Risk with sensitive data
Capire il Data Engineering

AWS

Capire il Data Engineering

Microsoft Azure

Capire il Data Engineering

Google Cloud

Capire il Data Engineering

file storage

Capire il Data Engineering

S3

Capire il Data Engineering

Azure Blob Storage

Capire il Data Engineering

Google Cloud Storage

Capire il Data Engineering

computation

Capire il Data Engineering

EC2

Capire il Data Engineering

Azure Virtual Machines

Capire il Data Engineering

Google Compute Engine

Capire il Data Engineering

databases

Capire il Data Engineering

AWS RDS

Capire il Data Engineering

Azure SQL Database

Capire il Data Engineering

Gogle Cloud SQL

Capire il Data Engineering

data pipeline

Capire il Data Engineering

using AWS S3 to store cover albums

Capire il Data Engineering

using EC2 to convert songs

Capire il Data Engineering

using RDS to store employees

Capire il Data Engineering

Multicloud

Pros

  • Reducing reliance on a single vendor
  • Cost-efficiencies
  • Local laws requiring certain data to be physically present within the country
  • Mitigating against disasters

Cons

  • Cloud providers try to lock in consumers
  • Incompatibility
  • Security and governance
Capire il Data Engineering

Summary

  • Benefits and risks of cloud computing
  • How it is implemented at Spotflix
  • Can cite the main cloud providers and their services
Capire il Data Engineering

Let's practice!

Capire il Data Engineering

Preparing Video For Download...