Unsupervised Learning em Python
Benjamin Wilson
Director of Research at lateral.io

PCA(n_components=2)samples = array com medidas do iris (4 features)species = lista com códigos das espéciesfrom sklearn.decomposition import PCApca = PCA(n_components=2)pca.fit(samples)
PCA(n_components=2)
transformed = pca.transform(samples)
print(transformed.shape)
(150, 2)
import matplotlib.pyplot as plt
xs = transformed[:,0]
ys = transformed[:,1]
plt.scatter(xs, ys, c=species)
plt.show()


scipy.sparse.csr_matrix em vez de array do NumPycsr_matrix guarda só entradas não zero (economiza espaço!)
PCA do scikit-learn não suporta csr_matrixTruncatedSVD do scikit-learnfrom sklearn.decomposition import TruncatedSVD
model = TruncatedSVD(n_components=3)
model.fit(documents) # documents is csr_matrix
transformed = model.transform(documents)
Unsupervised Learning em Python