Practicing Machine Learning Interview Questions in Python
Lisa Stuart
Data Scientist
"An appropriate dissimilarity measure is far more important in obtaining success with clustering than choice of clustering algorithm." - from Elements of Statistical Learning
Function/method | returns |
---|---|
sklearn.cluster.Kmeans |
K-Means clustering algorithm |
sklearn.cluster.AgglomerativeClustering |
Agglomerative clustering algorithm |
kmeans.inertia_ |
SS distances of observations to closest cluster center |
scipy.cluster.hierarchy as sch |
Hierachical clustering for dendrograms |
sch.dendrogram() |
Dendrogram function |
Practicing Machine Learning Interview Questions in Python