Feature engineering

Preprocessing for Machine Learning in Python

James Chapman

Curriculum Manager, DataCamp

What is feature engineering?

Feature engineering: Creation of new features from existing ones

  • Improve performance
  • Insight into relationships between features
  • Need to understand the data first!
  • Highly dataset-dependent
Preprocessing for Machine Learning in Python

Feature engineering scenarios

Id Text
1 "Feature engineering is fun!"
2 "Feature engineering is a lot of work."
3 "I don't mind feature engineering."

 

user fav_color
1 blue
2 green
3 orange
Preprocessing for Machine Learning in Python

Feature engineering scenarios

Id Date
4 July 30 2011
5 January 29 2011
6 February 05 2011

 

user test1 test2 test3
1 90.5 89.6 91.4
2 65.5 70.6 67.3
3 78.1 80.7 81.8
Preprocessing for Machine Learning in Python

Let's practice!

Preprocessing for Machine Learning in Python

Preparing Video For Download...