Analisis Sentimen dengan Python
Violeta Misheva
Data Scientist
Saya bahagia, bukan sedih.
Saya sedih, bukan bahagia.
Unigram: token tunggal
Bigram: pasangan token
Trigram: tiga token
n-gram: urutan n token
Cuaca hari ini indah.
Unigram: { The, weather, today, is, wonderful }
Bigram: {The weather, weather today, today is, is wonderful}
Trigram: {The weather today, weather today is, today is wonderful}
from sklearn.feature_extraction.text import CountVectorizer
vect = CountVectorizer(ngram_range=(min_n, max_n))
# Hanya unigram
ngram_range=(1, 1)
# Uni- dan bigram
ngram_range=(1, 2)
CountVectorizer(max_features, max_df, min_df)
Analisis Sentimen dengan Python