What have we learned?
Introduction to PySpark
Benjamin Schmidt
Data Engineer
What you did
- Learned about PySparks clusters
- PySpark critical syntax
- RDDs and DataFrames
- Spark SQL
What you haven't done (yet)
- Cluster management
- Complex job optimization
- PySpark at scale
- Machine learning
What you can do next on DataCamp
- Big Data Fundamentals with PySpark
- Cleaning Data with PySpark
- Machine Learning with PySpark
Keep going and practicing
Introduction to PySpark
Preparing Video For Download...