Modelvalidatie in Python
Kasey Jones
Data Scientist

Voordelen:
Nadelen:
from sklearn.model_selection import RandomizedSearchCV
random_search = RandomizedSearchCV()
Parameterverdeling:
param_dist = {"max_depth": [4, 6, 8, None],
"max_features": range(2, 11),
"min_samples_split": range(2, 11)}
Parameters:
estimator: te gebruiken modelparam_distributions: dictionary met hyperparameters en mogelijke waardenn_iter: aantal iteratiesscoring: te gebruiken scoringsmethodeparam_dist = {"max_depth": [4, 6, 8, None],
"max_features": range(2, 11),
"min_samples_split": range(2, 11)}
from sklearn.ensemble import RandomForestRegressor
from sklearn.metrics import make_scorer, mean_absolute_error
rfr = RandomForestRegressor(n_estimators=20, random_state=1111)
scorer = make_scorer(mean_absolute_error)
Random search opzetten:
random_search =\
RandomizedSearchCV(estimator=rfr,
param_distributions=param_dist,
n_iter=40,
cv=5)
Random search opzetten:
random_search =\
RandomizedSearchCV(estimator=rfr,
param_distributions=param_dist,
n_iter=40,
cv=5)
Rond de random search af:
random_search.fit(X, y)
Modelvalidatie in Python