Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling

Song, Kaitao; Leng, Yichong; Tan, Xu; Zou, Yicheng; Qin, Tao; Li, Dongsheng

Computer Science > Computation and Language

arXiv:2205.12986 (cs)

[Submitted on 25 May 2022 (v1), last revised 19 Oct 2022 (this version, v4)]

Title:Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling

Authors:Kaitao Song, Yichong Leng, Xu Tan, Yicheng Zou, Tao Qin, Dongsheng Li

View PDF

Abstract:Sentence scoring aims at measuring the likelihood score of a sentence and is widely used in many natural language processing scenarios, like reranking, which is to select the best sentence from multiple candidates. Previous works on sentence scoring mainly adopted either causal language modeling (CLM) like GPT or masked language modeling (MLM) like BERT, which have some limitations: 1) CLM only utilizes unidirectional information for the probability estimation of a sentence without considering bidirectional context, which affects the scoring quality; 2) MLM can only estimate the probability of partial tokens at a time and thus requires multiple forward passes to estimate the probability of the whole sentence, which incurs large computation and time cost. In this paper, we propose \textit{Transcormer} -- a Transformer model with a novel \textit{sliding language modeling} (SLM) for sentence scoring. Specifically, our SLM adopts a triple-stream self-attention mechanism to estimate the probability of all tokens in a sentence with bidirectional context and only requires a single forward pass. SLM can avoid the limitations of CLM (only unidirectional context) and MLM (multiple forward passes) and inherit their advantages, and thus achieve high effectiveness and efficiency in scoring. Experimental results on multiple tasks demonstrate that our method achieves better performance than other language modelings.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2205.12986 [cs.CL]
	(or arXiv:2205.12986v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.12986

Submission history

From: Kaitao Song [view email]
[v1] Wed, 25 May 2022 18:00:09 UTC (1,011 KB)
[v2] Sat, 28 May 2022 09:04:28 UTC (1,010 KB)
[v3] Sun, 5 Jun 2022 14:55:32 UTC (1,010 KB)
[v4] Wed, 19 Oct 2022 03:15:21 UTC (1,015 KB)

Computer Science > Computation and Language

Title:Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators