What have we learned?

Introduction to PySpark

Benjamin Schmidt

Data Engineer

What you did

  • Learned about PySparks clusters
  • PySpark critical syntax
  • RDDs and DataFrames
  • Spark SQL

PYSPARK!

Introduction to PySpark

What you haven't done (yet)

  • Cluster management
  • Complex job optimization
  • PySpark at scale
  • Machine learning

What you can do next on DataCamp

  • Big Data Fundamentals with PySpark
  • Cleaning Data with PySpark
  • Machine Learning with PySpark
Introduction to PySpark

Keep going and practicing

Introduction to PySpark

Preparing Video For Download...