Welcome to the course!

Extreme Gradient Boosting with XGBoost

Sergey Fogelson

Head of Data Science, TelevisaUnivision

Before we get to XGBoost...

  • Need to understand the basics of
    • Supervised classification
    • Decision trees
    • Boosting
Extreme Gradient Boosting with XGBoost

Supervised learning

  • Relies on labeled data
  • Have some understanding of past behavior
Extreme Gradient Boosting with XGBoost

Supervised learning example

  • Does a specific image contain a person's face?
  • Training data: vectors of pixel values
  • Labels: 1 or 0
Extreme Gradient Boosting with XGBoost

Supervised learning: Classification

  • Outcome can be binary or multi-class
Extreme Gradient Boosting with XGBoost

Binary classification example

  • Will a person purchase the insurance package given some quote?

Extreme Gradient Boosting with XGBoost

Multi-class classification example

  • Classifying the species of a given bird
Extreme Gradient Boosting with XGBoost

AUC: Metric for binary classification models

  • Area under the ROC curve (AUC)
  • Larger area under the ROC curve = better model

AUC example

Extreme Gradient Boosting with XGBoost

Accuracy score and confusion matrix

  • Confusion matrix confusion matrix example

  • Accuracy $$\frac{tp+tn}{tp+tn+fp+fn}$$

Extreme Gradient Boosting with XGBoost

Review

Extreme Gradient Boosting with XGBoost

Other supervised learning considerations

  • Features can be either numeric or categorical
  • Numeric features should be scaled (Z-scored)
  • Categorical features should be encoded (one-hot)
Extreme Gradient Boosting with XGBoost

Ranking

  • Predicting an ordering on a set of choices

Extreme Gradient Boosting with XGBoost

Recommendation

  • Recommending an item to a user
  • Based on consumption history and profile
  • Example: Netflix
Extreme Gradient Boosting with XGBoost

Let's practice!

Extreme Gradient Boosting with XGBoost

Preparing Video For Download...