It's not a Non-Issue: Negation as a Source of Error in Machine Translation

Md Mosharaf Hossain, Antonios Anastasopoulos, Eduardo Blanco, Alexis Palmer

November 2020

PDF Code

Abstract

As machine translation (MT) systems progress at a rapid pace, questions of their adequacy linger. In this study we focus on negation, a universal, core property of human language that significantly affects the semantics of an utterance. We investigate whether translating negation is an issue for modern MT systems using 17 translation directions as test bed. Through thorough analysis, we find that indeed the presence of negation can significantly impact downstream quality, in some cases resulting in quality reductions of more than 60%. We also provide a linguistically motivated analysis that directly explains the majority of our findings. We release our annotations and code to replicate our analysis here: https://github.com/mosharafhossain/negation-mt.

Type

Conference paper

Publication

Findings of EMNLP

MT multilingual NLP

Antonios Anastasopoulos

Assistant Professor

I work on multilingual models, machine translation, speech recognition, and NLP for under-served languages.

It's not a Non-Issue: Negation as a Source of Error in Machine Translation

Abstract

Antonios Anastasopoulos

Assistant Professor

Related