What have we learned?
    Introduction to PySpark
     
   
  
    Benjamin Schmidt
    Data Engineer
   
 
    
   
        
  
    
  What you did
  
    
      
        
          
    
  
    
- Learned about PySparks clusters
- PySpark critical syntax
- RDDs and DataFrames
- Spark SQL
 
 
         
        
       
      
  
     
   
 
    
   
        
  
    
  What you haven't done (yet)
  
    
      
        
    
  
    
- Cluster management
- Complex job optimization
- PySpark at scale
- Machine learning
What you can do next on DataCamp
- Big Data Fundamentals with PySpark
- Cleaning Data with PySpark
- Machine Learning with PySpark
 
 
       
      
  
     
   
 
    
   
        
  
    
  
    Keep going and practicing
    Introduction to PySpark
   
 
    
   
       
     
   
  
    
    
       Preparing Video For Download...
      Preparing Video For Download...