Sentimentanalyse in Python
Violeta Misheva
Data Scientist
Ik ben blij, niet verdrietig.
Ik ben verdrietig, niet blij.
Unigrams: losse tokens
Bigrams: paren van tokens
Trigrams: drie tokens
n-grams: reeks van n tokens
Het weer vandaag is geweldig.
Unigrams: { The, weather, today, is, wonderful }
Bigrams: {The weather, weather today, today is, is wonderful}
Trigrams: {The weather today, weather today is, today is wonderful}
from sklearn.feature_extraction.text import CountVectorizer
vect = CountVectorizer(ngram_range=(min_n, max_n))
# Alleen unigrams
ngram_range=(1, 1)
# Uni- en bigrams
ngram_range=(1, 2)
CountVectorizer(max_features, max_df, min_df)
Sentimentanalyse in Python