GAM to learn non-linear transformations

Supervised Learning in R: Regression

Nina Zumel and John Mount

Win-Vector, LLC

Generalized Additive Models (GAMs)

$$ y \sim b0 + s1(x1) + s2(x2) + .... $$

gam(formula, family, data)

family:

Best for larger datasets

anx ~ s(hassles)

model <- gam(
  anx ~ s(hassles), 
  data = hassleframe, 
  family = gaussian
)

summary(model)

...

R-sq.(adj) =  0.619   Deviance explained = 64.1%
GCV = 49.132  Scale est. = 45.153    n = 40

plot(model)

$y$ values: predict(model, type = "terms")

predict(model, newdata = hassleframe, type = "response")

Knowing the correct transformation is best, but GAM is useful when transformation isn't known

Supervised Learning in R: Regression