Know What You Don't Need: Single-Shot Meta-Pruning for Attention Heads

Zhang, Zhengyan; Qi, Fanchao; Liu, Zhiyuan; Liu, Qun; Sun, Maosong

Computer Science > Computation and Language

arXiv:2011.03770 (cs)

[Submitted on 7 Nov 2020]

Title:Know What You Don't Need: Single-Shot Meta-Pruning for Attention Heads

Authors:Zhengyan Zhang, Fanchao Qi, Zhiyuan Liu, Qun Liu, Maosong Sun

View PDF

Abstract:Deep pre-trained Transformer models have achieved state-of-the-art results over a variety of natural language processing (NLP) tasks. By learning rich language knowledge with millions of parameters, these models are usually overparameterized and significantly increase the computational overhead in applications. It is intuitive to address this issue by model compression. In this work, we propose a method, called Single-Shot Meta-Pruning, to compress deep pre-trained Transformers before fine-tuning. Specifically, we focus on pruning unnecessary attention heads adaptively for different downstream tasks. To measure the informativeness of attention heads, we train our Single-Shot Meta-Pruner (SMP) with a meta-learning paradigm aiming to maintain the distribution of text representations after pruning. Compared with existing compression methods for pre-trained models, our method can reduce the overhead of both fine-tuning and inference. Experimental results show that our pruner can selectively prune 50% of attention heads with little impact on the performance on downstream tasks and even provide better text representations. The source code will be released in the future.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2011.03770 [cs.CL]
	(or arXiv:2011.03770v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2011.03770

Submission history

From: Zhengyan Zhang [view email]
[v1] Sat, 7 Nov 2020 12:58:37 UTC (3,211 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-11

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zhengyan Zhang
Fanchao Qi
Zhiyuan Liu
Qun Liu
Maosong Sun

export BibTeX citation

Computer Science > Computation and Language

Title:Know What You Don't Need: Single-Shot Meta-Pruning for Attention Heads

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Know What You Don't Need: Single-Shot Meta-Pruning for Attention Heads

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators