Multilingual NER with polyglot

Introduction to Natural Language Processing in Python

Katharine Jarmul

Founder, kjamistan

What is polyglot?

  • NLP library which uses word vectors
  • Why polyglot?
    • Vectors for many different languages
    • More than 130!

transliteration with polyglot

Introduction to Natural Language Processing in Python

Spanish NER with polyglot

from polyglot.text import Text
 ?ext = """El presidente de la Generalitat de Cataluña,
                  Carles Puigdemont, ha afirmado hoy a la alcaldesa 
                  de Madrid, Manuela Carmena, que en su etapa de 
                  alcalde de Girona (de julio de 2011 a enero de 2016) 
                  hizo una gran promoción de Madrid."""
ptext = Text(text)

ptext.entities
[I-ORG(['Generalitat', 'de']),
 I-LOC(['Generalitat', 'de', 'Cataluña']),
 I-PER(['Carles', 'Puigdemont']),
 I-LOC(['Madrid']),
 I-PER(['Manuela', 'Carmena']),
 I-LOC(['Girona']),
 I-LOC(['Madrid'])]
Introduction to Natural Language Processing in Python

Let's practice!

Introduction to Natural Language Processing in Python

Preparing Video For Download...