Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach

Li, Siqi; Liu, Danni; Niehues, Jan

Computer Science > Computation and Language

arXiv:2409.09009 (cs)

[Submitted on 13 Sep 2024 (v1), last revised 1 Oct 2024 (this version, v2)]

Title:Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach

Authors:Siqi Li, Danni Liu, Jan Niehues

View PDF HTML (experimental)

Abstract:Direct speech translation (ST) models often struggle with rare words. Incorrect translation of these words can have severe consequences, impacting translation quality and user trust. While rare word translation is inherently challenging for neural models due to sparse learning signals, real-world scenarios often allow access to translations of past recordings on similar topics. To leverage these valuable resources, we propose a retrieval-and-demonstration approach to enhance rare word translation accuracy in direct ST models. First, we adapt existing ST models to incorporate retrieved examples for rare word translation, which allows the model to benefit from prepended examples, similar to in-context learning. We then develop a cross-modal (speech-to-speech, speech-to-text, text-to-text) retriever to locate suitable examples. We demonstrate that standard ST models can be effectively adapted to leverage examples for rare word translation, improving rare word translation accuracy over the baseline by 17.6% with gold examples and 8.5% with retrieved examples. Moreover, our speech-to-speech retrieval approach outperforms other modalities and exhibits higher robustness to unseen speakers. Our code is publicly available (this https URL).

Comments:	EMNLP 2024
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2409.09009 [cs.CL]
	(or arXiv:2409.09009v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2409.09009

Submission history

From: Danni Liu [view email]
[v1] Fri, 13 Sep 2024 17:38:03 UTC (7,822 KB)
[v2] Tue, 1 Oct 2024 13:06:20 UTC (7,824 KB)

Computer Science > Computation and Language

Title:Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators