Competitions overview

Winning a Kaggle Competition in Python

Yauhen Babakhin

Kaggle Grandmaster

Instructor

 

Yauhen Babakhin

  • Master’s Degree in Applied Data Analysis
  • 5 years of working experience in Data Science
  • Kaggle competitions Grandmaster
  • Gold medals in both classic Machine Learning and Deep Learning competitions

instructor photo

Winning a Kaggle Competition in Python

 

 

kaggle logo

Winning a Kaggle Competition in Python

Kaggle benefits

 

  1. Get practical experience on the real-world data
  2. Develop portfolio projects
  3. Meet a great Data Science community
  4. Try new domain or model type
  5. Keep up-to-date with the best performing methods
Winning a Kaggle Competition in Python

Competition process

 

 

kaggle competition process

Winning a Kaggle Competition in Python

Competition process

 

 

kaggle competition process

Winning a Kaggle Competition in Python

Competition process

 

 

kaggle competition process

Winning a Kaggle Competition in Python

How to participate

 

  1. Go to http://kaggle.com website and select the competition
  2. Download the data
  3. Start building the models!
Winning a Kaggle Competition in Python

New York city taxi fare prediction

New York city taxi fare prediction page

Winning a Kaggle Competition in Python

Train and Test data

import pandas as pd

# Read train data
taxi_train = pd.read_csv('taxi_train.csv')
taxi_train.columns.to_list()
['key',
 'fare_amount',
 'pickup_datetime',
 'pickup_longitude',
 'pickup_latitude',
 'dropoff_longitude',
 'dropoff_latitude',
 'passenger_count']
# Read test data
taxi_test = pd.read_csv('taxi_test.csv')
taxi_test.columns.to_list()
['key',
 'pickup_datetime',
 'pickup_longitude',
 'pickup_latitude',
 'dropoff_longitude',
 'dropoff_latitude',
 'passenger_count']
Winning a Kaggle Competition in Python

Sample submission

# Read sample submission
taxi_sample_sub = pd.read_csv('taxi_sample_submission.csv')
taxi_sample_sub.head()
                              key     fare_amount
0     2015-01-27 13:08:24.0000002     11.35
1     2015-01-27 13:08:24.0000003     11.35
2     2011-10-08 11:53:44.0000002     11.35
3     2012-12-01 21:12:12.0000002     11.35
4     2012-12-01 21:12:12.0000003     11.35
Winning a Kaggle Competition in Python

Let's practice!

Winning a Kaggle Competition in Python

Preparing Video For Download...