Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms

Trabelsi, Firas; Vilar, David; Finkelstein, Mara; Freitag, Markus

Computer Science > Computation and Language

arXiv:2406.02832 (cs)

[Submitted on 5 Jun 2024]

Title:Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms

Authors:Firas Trabelsi, David Vilar, Mara Finkelstein, Markus Freitag

View PDF HTML (experimental)

Abstract:Minimum Bayes Risk (MBR) decoding is a powerful decoding strategy widely used for text generation tasks, but its quadratic computational complexity limits its practical application. This paper presents a novel approach for approximating MBR decoding using matrix completion techniques, focusing on the task of machine translation. We formulate MBR decoding as a matrix completion problem, where the utility metric scores between candidate hypotheses and pseudo-reference translations form a low-rank matrix. First, we empirically show that the scores matrices indeed have a low-rank structure. Then, we exploit this by only computing a random subset of the scores and efficiently recover the missing entries in the matrix by applying the Alternating Least Squares (ALS) algorithm, thereby enabling a fast approximation of the MBR decoding process. Our experimental results on machine translation tasks demonstrate that the proposed method requires 1/16 utility metric computations compared to vanilla MBR decoding while achieving equal translation quality measured by COMET22 on the WMT22 dataset (en<>de and en<>ru). We also benchmark our method against other approximation methods and we show gains in quality when comparing to them.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2406.02832 [cs.CL]
	(or arXiv:2406.02832v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.02832

Submission history

From: Firas Trabelsi [view email]
[v1] Wed, 5 Jun 2024 00:54:03 UTC (508 KB)

Computer Science > Computation and Language

Title:Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators