Machine Reading, Fast and Slow: When Do Models "Understand" Language?

Choudhury, Sagnik Ray; Rogers, Anna; Augenstein, Isabelle

Computer Science > Computation and Language

arXiv:2209.07430 (cs)

[Submitted on 15 Sep 2022]

Title:Machine Reading, Fast and Slow: When Do Models "Understand" Language?

Authors:Sagnik Ray Choudhury, Anna Rogers, Isabelle Augenstein

View PDF

Abstract:Two of the most fundamental challenges in Natural Language Understanding (NLU) at present are: (a) how to establish whether deep learning-based models score highly on NLU benchmarks for the 'right' reasons; and (b) to understand what those reasons would even be. We investigate the behavior of reading comprehension models with respect to two linguistic 'skills': coreference resolution and comparison. We propose a definition for the reasoning steps expected from a system that would be 'reading slowly', and compare that with the behavior of five models of the BERT family of various sizes, observed through saliency scores and counterfactual explanations. We find that for comparison (but not coreference) the systems based on larger encoders are more likely to rely on the 'right' information, but even they struggle with generalization, suggesting that they still learn specific lexical patterns rather than the general principles of comparison.

Comments:	Accepted COLING 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2209.07430 [cs.CL]
	(or arXiv:2209.07430v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2209.07430

Submission history

From: Sagnik Ray Choudhury [view email]
[v1] Thu, 15 Sep 2022 16:25:44 UTC (896 KB)

Computer Science > Computation and Language

Title:Machine Reading, Fast and Slow: When Do Models "Understand" Language?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Machine Reading, Fast and Slow: When Do Models "Understand" Language?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators