Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model Predictions

Ferrando, Javier; Costa-jussà, Marta R.

Computer Science > Computation and Language

arXiv:2109.05853 (cs)

[Submitted on 13 Sep 2021]

Title:Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model Predictions

Authors:Javier Ferrando, Marta R. Costa-jussà

View PDF

Abstract:This work proposes an extensive analysis of the Transformer architecture in the Neural Machine Translation (NMT) setting. Focusing on the encoder-decoder attention mechanism, we prove that attention weights systematically make alignment errors by relying mainly on uninformative tokens from the source sequence. However, we observe that NMT models assign attention to these tokens to regulate the contribution in the prediction of the two contexts, the source and the prefix of the target sequence. We provide evidence about the influence of wrong alignments on the model behavior, demonstrating that the encoder-decoder attention mechanism is well suited as an interpretability method for NMT. Finally, based on our analysis, we propose methods that largely reduce the word alignment error rate compared to standard induced alignments from attention weights.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2109.05853 [cs.CL]
	(or arXiv:2109.05853v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.05853

Submission history

From: Javier Ferrando [view email]
[v1] Mon, 13 Sep 2021 10:44:02 UTC (2,560 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-09

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Marta R. Costa-jussà

export BibTeX citation

Computer Science > Computation and Language

Title:Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model Predictions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model Predictions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators