Introduction to Apache Kafka
Introduction to Apache Kafka
Mike Metzger
Data Engineering Consultant
About me
25+ year IT career, 18+ in data
Tools:
Apache Spark
Apache Airflow
Apache Kafka
What is Apache Kafka?
Open-source
Distributed
Event streaming
Designed to handle large quantities of data
Scalable
Event streaming in Kafka
Obtain source information
Stored reliably
Distribute data to clients
Common uses
Ecommerce
Order tracking
Ride-share / Food delivery
Sensor data
Cybersecurity
1
Photo by Carlos Muza on Unsplash
Kafka components
User facing components
Kafka components
User facing components
Topics
Kafka components
User facing components
Topics
Producers (writers)
Kafka components
User facing components
Topics
Producers (writers)
Consumers (readers)
Kafka components
User facing components
Topics
Producers (writers)
Consumers (readers)
Kafka components
User facing components
Topics
Producers (writers)
Consumers (readers)
Let's practice!
Introduction to Apache Kafka
Preparing Video For Download...