Supervised Learning in R: Regression
Nina Zumel and John Mount
Win-Vector, LLC
Example of an additive relationship:
plant_height ~ bacteria + sun
The simultaneous influence of two variables on the outcome is not additive.
plant_height ~ bacteria + sun + bacteria:sun
The simultaneous influence of two variables on the outcome is not additive.
plant_height ~ bacteria + sun + bacteria:sun
sun
: categorical {"sun", "shade"}Like two separate models: one for sun, one for shade.
yield ~ Stress + SO2 + O3
Metabol ~ Gastric + Sex
Interaction - Colon (:
)
y ~ a:b
Main effects and interaction - Asterisk (*
)
y ~ a*b
# Both mean the same
y ~ a + b + a:b
Expressing the product of two variables - I
y ~ I(a*b)
same as $y \propto a b$
Formula | RMSE (cross validation) |
---|---|
Metabol ~ Gastric + Sex |
1.46 |
Metabol ~ Gastric * Sex |
1.48 |
Metabol ~ Gastric + Gastric:Sex |
1.39 |
Supervised Learning in R: Regression