Additional NLP analysis

Introduction to Natural Language Processing in R

Kasey Jones

Research Data Scientist

BERT, and ERNIE.

What is it:

  • BERT: Bidirectional Encoder Representations from Transformers
  • A model used in transfer learning for NLP tasks
  • is pre-trained on unlabeled data to create a language representation
  • requires only small amounts of labeled data to train for specific task

What is it used for:

  • supervised tasks
  • to create features for NLP models

ERNIE: Enhanced Representation through kNowledge IntEgration

Introduction to Natural Language Processing in R

Named Entity Recognition

What is it:

  • classifies named entities within text
  • Examples: names, locations, organizations, values

What is it used for:

  • extracting entities from tweets
  • aiding recommendation engines
  • search algorithms
Introduction to Natural Language Processing in R

Part-of-speech tagging

What is it:

  • tagging words with their part-of-speech
    • nouns, verbs, adjectives, etc.

How is it used:

  • aids in sentiment analysis
  • creates features for NLP models
  • enhances what a model knows about each word in text
Introduction to Natural Language Processing in R

Let's recap.

Introduction to Natural Language Processing in R

Preparing Video For Download...