R ile Kredi Riski Modellemesi
Lore Dirick
Manager of Data Science Curriculum at Flatiron School



1) Tüm değişkenleri içeren bir modelle başlayın (bizde 7) ve AUC’yi hesaplayın
log_model_full <- glm(loan_status ~ loan_amnt + grade + home_ownership +
annual_inc + age + emp_cat + ir_cat,
family = "binomial", data = training_set)
predictions_model_full <- predict(log_model_full,
newdata = test_set, type ="response")
AUC_model_full <- auc(test_set$loan_status, predictions_model_full)
Eğri altındaki alan: 0.6512
2) Her seferinde bir değişkeni çıkararak 7 yeni model kurun ve test seti ile PD-tahminleri yapın
log_1_remove_amnt <- glm(loan_status ~ grade + home_ownership + annual_inc + age + emp_cat + ir_cat,
family = "binomial",
data = training_set)
log_1_remove_grade <- glm(loan_status ~ loan_amnt + home_ownership + annual_inc + age + emp_cat + ir_cat,
family = "binomial",
data = training_set)
log_1_remove_home <- glm(loan_status ~ loan_amnt + grade + annual_inc + age + emp_cat + ir_cat,
family = "binomial",
data = training_set)
pred_1_remove_amnt <- predict(log_1_remove_amnt, newdata = test_set, type = "response")
pred_1_remove_grade <- predict(log_1_remove_grade, newdata = test_set, type = "response")
pred_1_remove_home <- predict(log_1_remove_home, newdata = test_set, type = "response")
...
3) En iyi AUC’ye ulaştıran modeli koruyun (tam model AUC’si: 0.6512)
auc(test_set$loan_status, pred_1_remove_amnt)
Eğri altındaki alan: 0.6537
auc(test_set$loan_status, pred_1_remove_grade)
Eğri altındaki alan: 0.6438
auc(test_set$loan_status, pred_1_remove_home)
Eğri altındaki alan: 0.6537
4) AUC (anlamlı biçimde) düşene kadar tekrarlayın
R ile Kredi Riski Modellemesi