Data Ingestion
Understanding Modern Data Architecture
Miller Trujillo
Senior Software Engineer
What is data ingestion?
Functional requirements
Functional can be impacted by analytics
Batch ingestion
Scheduled to ingest data periodically
Copy in our platform for analytics
Reading all data vs reading what is new to us
Big datasets requires reading partially
Smaller datasets could be overwritten
Batch ingestion: Bring only what changed
Infinite resources are impossible
Ingest only what has changed
Updated at timestamp, or flag
Latest state of data
Deletion will require a flag or consolidation
Streaming ingestion
Push model
Event queues
24/7 compute
Landing zone
Let's practice!
Understanding Modern Data Architecture
Preparing Video For Download...