Training Neural Machine Translation with Semantic Similarity

Beyond BLEU:Training Neural Machine Translation with Semantic Similarity - 2019

Research Area: Machine Learning

Abstract:

While most neural machine translation (NMT) systems are still trained using maximum likelihood estimation, recent work has demonstrated that optimizing systems to directly improve evaluation metrics such as BLEU can substantially improve final translation accuracy. However, training with BLEU has some limitations: it does not assign partial credit, it has a limited range of output values, and it can penalize semantically correct hypotheses if they differ lexically from the reference. In this paper, we introduce an alternative reward function for optimizing NMT systems that is based on recent work in semantic similarity. We evaluate on four disparate languages translated to English, and find that training with our proposed metric results in better translations as evaluated by BLEU, semantic similarity, and human evaluation, and also that the optimization procedure converges faster. Analysis suggests that this is because the proposed metric is more conducive to optimization, assigning partial credit and providing more diversity in scores than BLEU.

Keywords:

Author(s) Name: John Wieting, Taylor Berg-Kirkpatrick, Kevin Gimpel, Graham Neubig

Journal name: Computer Science

Conferrence name:

Publisher name: arXiv:1909.06694

DOI: 10.48550/arXiv.1909.06694

Volume Information:

Paper Link: https://arxiv.org/abs/1909.06694

Office Address

Social List

Beyond BLEU:Training Neural Machine Translation with Semantic Similarity - 2019

Abstract:

S-Logix (OPC) Private Limited

Office Address

Beyond BLEU:Training Neural Machine Translation with Semantic Similarity - 2019

Abstract:

Related Papers