Multilingual NLP
Aug 2, 2020
An exciting research direction that we pursue at GMU NLP is building multi-lingual and polyglot systems. The languages of the world often share similar characteristics, and training systems cross-lingually allows us to leverage these similarities and overcome data scarcity issues.

George Mason NLP
The Natural Language Processing group at George Mason University. We work on multilingual models, on and on building robust NLP systems, especially for low-resource and endangered languages.
Related
- Reducing Confusion in Active Learning for Part-Of-Speech Tagging
- X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models
- It's not a Non-Issue: Negation as a Source of Error in Machine Translation
- Comparison of Interactive Knowledge Base Spelling Correction Models for Low-Resource Languages
- Automatic Extraction of Rules Governing Morphological Agreement
Posts
A note on evaluating multilingual benchmarks
A note on evaluating multilingual benchmarks Antonis Anastasopoulos, December 2019. tl;dr: Be careful when reporting averages for multilingual benchmarks, especially if making claims about multilinguality. In addition, averaging by language family can provide additional insights.