Data Ingestion

Comprendere la data architecture moderna

Miller Trujillo

Senior Software Engineer

What is data ingestion?

  • Functional requirements
  • Functional can be impacted by analytics

Generic Big data architecture including data sources, ingestion, storage, processing, orchestration, governance, serving, and analytics storage and reporting

Comprendere la data architecture moderna

Batch ingestion

  • Scheduled to ingest data periodically
  • Copy in our platform for analytics
  • Reading all data vs reading what is new to us
  • Big datasets requires reading partially
  • Smaller datasets could be overwritten
Comprendere la data architecture moderna

Batch ingestion: Bring only what changed

  • Infinite resources are impossible
  • Ingest only what has changed
  • Updated at timestamp, or flag
  • Latest state of data
  • Deletion will require a flag or consolidation
Comprendere la data architecture moderna

Streaming ingestion

  • Push model
  • Event queues
  • 24/7 compute
  • Landing zone

Streaming ingestion workflow

Comprendere la data architecture moderna

Let's practice!

Comprendere la data architecture moderna

Preparing Video For Download...