A Universal Discriminator for Zero-Shot Generalization

Xu, Haike; Lin, Zongyu; Zhou, Jing; Zheng, Yanan; Yang, Zhilin

Computer Science > Computation and Language

arXiv:2211.08099 (cs)

[Submitted on 15 Nov 2022 (v1), last revised 6 Jun 2023 (this version, v2)]

Title:A Universal Discriminator for Zero-Shot Generalization

Authors:Haike Xu, Zongyu Lin, Jing Zhou, Yanan Zheng, Zhilin Yang

View PDF

Abstract:Generative modeling has been the dominant approach for large-scale pretraining and zero-shot generalization. In this work, we challenge this convention by showing that discriminative approaches perform substantially better than generative ones on a large number of NLP tasks. Technically, we train a single discriminator to predict whether a text sample comes from the true data distribution, similar to GANs. Since many NLP tasks can be formulated as selecting from a few options, we use this discriminator to predict the concatenation of input and which option has the highest probability of coming from the true data distribution. This simple formulation achieves state-of-the-art zero-shot results on the T0 benchmark, outperforming T0 by 16.0\%, 7.8\%, and 11.5\% respectively on different scales. In the finetuning setting, our approach also achieves new state-of-the-art results on a wide range of NLP tasks, with only 1/4 parameters of previous methods. Meanwhile, our approach requires minimal prompting efforts, which largely improves robustness and is essential for real-world applications. Furthermore, we also jointly train a generalized UD in combination with generative tasks, which maintains its advantage on discriminative tasks and simultaneously works on generative tasks.

Comments:	ACL 2023 main conference (Long paper)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2211.08099 [cs.CL]
	(or arXiv:2211.08099v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2211.08099

Submission history

From: Haike Xu [view email]
[v1] Tue, 15 Nov 2022 12:33:31 UTC (50,489 KB)
[v2] Tue, 6 Jun 2023 03:01:43 UTC (6,034 KB)

Computer Science > Computation and Language

Title:A Universal Discriminator for Zero-Shot Generalization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Universal Discriminator for Zero-Shot Generalization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators