Refresher on the text mining workflow
Sentiment Analysis in R
So far ...
polarity()
- tidytext, dplyr, tidyr
- Visualizations
6 defined steps
- Define the problem & specific goals
- Identify the text
- Organize the text
- Extract features
- Analyze
- Draw a conclusion/reach an insight
Step 1: Define your problem
Tips:
- Be precise
- Avoid a "scope creep"
- Iterate and try new methods and/or subjectivity lexicons to ensure some consistency
Step 2: ID your text
Tips:
- Find appropriate sources (e.g. searching Wikipedia for stock prices may make less sense than examining a stock forum)
- Follow the terms of service for a site, be mindful of web scraping
- Text sources affect the language used...become familiar with the source's tone and nuances
Let's practice!
Sentiment Analysis in R
Preparing Video For Download...