How Truncating Weights Improves Reasoning in Language Models

Chen, Lei; Bruna, Joan; Bietti, Alberto

Computer Science > Machine Learning

arXiv:2406.03068 (cs)

[Submitted on 5 Jun 2024]

Title:How Truncating Weights Improves Reasoning in Language Models

Authors:Lei Chen, Joan Bruna, Alberto Bietti

View PDF HTML (experimental)

Abstract:In addition to the ability to generate fluent text in various languages, large language models have been successful at tasks that involve basic forms of logical "reasoning" over their context. Recent work found that selectively removing certain components from weight matrices in pre-trained models can improve such reasoning capabilities. We investigate this phenomenon further by carefully studying how certain global associations tend to be stored in specific weight components or Transformer blocks, in particular feed-forward layers. Such associations may hurt predictions in reasoning tasks, and removing the corresponding components may then improve performance. We analyze how this arises during training, both empirically and theoretically, on a two-layer Transformer trained on a basic reasoning task with noise, a toy associative memory model, and on the Pythia family of pre-trained models tested on simple reasoning tasks.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
Cite as:	arXiv:2406.03068 [cs.LG]
	(or arXiv:2406.03068v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.03068

Submission history

From: Lei Chen [view email]
[v1] Wed, 5 Jun 2024 08:51:08 UTC (720 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-06

Change to browse by:

cs
cs.AI
cs.CL
stat
stat.ML

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:How Truncating Weights Improves Reasoning in Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:How Truncating Weights Improves Reasoning in Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators