Examples of the modern data architecture

Comprendere la data architecture moderna

Miller Trujillo

Senior Software Engineer

Lambda architecture

  • Data processing architecture
  • Enables processing of large-scale datasets
  • Enables real-time data processing
  • Combines batch and streaming
  • 3 main layers:
    • Batch layer
    • Speed layer
    • Serving layer

Lambda Architecture with batch layer highlighted

Comprendere la data architecture moderna

Lambda architecture layers: Batch

Lambda Architecture with batch layer highlighted

  • Master dataset
    • Immutable data
    • Append-only
  • Pre-computed views
  • Bank accounts
    • Master dataset: List of transactions
    • Views: Transactions applied to get the balance
  • Perfect accuracy
Comprendere la data architecture moderna

Lambda architecture layers: Speed

Lambda Architecture with speed layer highlighted

  • Batch jobs at a given schedule
  • Gap between known data and reality
  • Speed layer close that gap
  • Not perfect accuracy
Comprendere la data architecture moderna

Lambda architecture layers: Serving

Lambda Architecture with serving layer highlighted

  • Merge batch and real-time views
  • Unified view of data
Comprendere la data architecture moderna

Complexity of Lambda architecture

  • Two layers increase complexity
  • Duplicated logic, code, algorithms
  • Multiple stacks: Batch and streaming
Comprendere la data architecture moderna

Kappa architecture

Kappa Architecture

Comprendere la data architecture moderna

Lambda vs. Kappa

Aspect Lambda Architecture Kappa Architecture
Batch Processing Present By processing the source of events again
Stream Processing Present Present
Complexity Higher Lower
Real-time Processing Delayed by batch layer Direct and streamlined
Use Cases Complex analysis, historical data Real-time analytics, real-time monitoring
Comprendere la data architecture moderna

Let's practice!

Comprendere la data architecture moderna

Preparing Video For Download...