Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent Variables

Wang, Weizhi; Zhang, Zhirui; Du, Yichao; Chen, Boxing; Xie, Jun; Luo, Weihua

Computer Science > Computation and Language

arXiv:2109.04705 (cs)

[Submitted on 10 Sep 2021]

Title:Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent Variables

Authors:Weizhi Wang, Zhirui Zhang, Yichao Du, Boxing Chen, Jun Xie, Weihua Luo

View PDF

Abstract:Zero-shot translation, directly translating between language pairs unseen in training, is a promising capability of multilingual neural machine translation (NMT). However, it usually suffers from capturing spurious correlations between the output language and language invariant semantics due to the maximum likelihood training objective, leading to poor transfer performance on zero-shot translation. In this paper, we introduce a denoising autoencoder objective based on pivot language into traditional training objective to improve the translation accuracy on zero-shot directions. The theoretical analysis from the perspective of latent variables shows that our approach actually implicitly maximizes the probability distributions for zero-shot directions. On two benchmark machine translation datasets, we demonstrate that the proposed method is able to effectively eliminate the spurious correlations and significantly outperforms state-of-the-art methods with a remarkable performance. Our code is available at this https URL.

Comments:	EMNLP Findings 2021
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2109.04705 [cs.CL]
	(or arXiv:2109.04705v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.04705

Submission history

From: Weizhi Wang [view email]
[v1] Fri, 10 Sep 2021 07:18:53 UTC (445 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-09

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zhirui Zhang
Boxing Chen
Jun Xie
Weihua Luo

export BibTeX citation

Computer Science > Computation and Language

Title:Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent Variables

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent Variables

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators