Feature engineering

Concetti di MLOps

Folkert Stijnman

ML Engineer

Feature engineering

MLOps phases feature engineering

Concetti di MLOps

Feature engineering

... is the process of selecting, manipulating, and transforming raw data into features.

  • A feature is a variable, such as the column in a table
  • We can use raw data, but also create our own
Concetti di MLOps

Customer data

Example customer data

Concetti di MLOps

Customer data

Example customer data with new feature

Concetti di MLOps

Feature engineering

  • Goal is to enhance model performance
  • Tools and techniques help to process, select, and maintain features:
    • Feature selection
    • Feature store
    • Data version control
Concetti di MLOps

Feature selection

  • Domain-specific knowledge
  • Correlation
  • Feature importances
  • Other methods: univariate selection, Principal Component Analysis (PCA), Recursive Feature Elimination (RFE)

correlation plot

1 https://www.datacamp.com/tutorial/tutorial-datails-on-correlation
Concetti di MLOps

The feature store

Feature store

Only relevant for large teams working on multiple projects that use the same features

Concetti di MLOps

Data version control

  • Tracking dataset changes
  • Maintaining consistency throughout the development lifecycle

graphic depicting data version control and git

1 https://www.datacamp.com/courses/cicd-for-machine-learning
Concetti di MLOps

Let's practice!

Concetti di MLOps

Preparing Video For Download...