Einführung in das Data Engineering
Vincent Vankrunkelsven
Data Engineer @ DataCamp



SELECT year, AVG(age)
FROM views.athlete_events
GROUP BY year


.map() oder .filter().count() oder .first()
# Load the dataset into athlete_events_spark first
(athlete_events_spark
.groupBy('Year')
.mean('Age')
.show())
SELECT year, AVG(age)
FROM views.athlete_events
GROUP BY year
Einführung in das Data Engineering