Mapping Knowledge Representations to Concepts: A Review and New Perspectives

Holmberg, Lars; Davidsson, Paul; Linde, Per

Computer Science > Artificial Intelligence

arXiv:2301.00189 (cs)

[Submitted on 31 Dec 2022]

Title:Mapping Knowledge Representations to Concepts: A Review and New Perspectives

Authors:Lars Holmberg, Paul Davidsson, Per Linde

View PDF

Abstract:The success of neural networks builds to a large extent on their ability to create internal knowledge representations from real-world high-dimensional data, such as images, sound, or text. Approaches to extract and present these representations, in order to explain the neural network's decisions, is an active and multifaceted research field. To gain a deeper understanding of a central aspect of this field, we have performed a targeted review focusing on research that aims to associate internal representations with human understandable concepts. In doing this, we added a perspective on the existing research by using primarily deductive nomological explanations as a proposed taxonomy. We find this taxonomy and theories of causality, useful for understanding what can be expected, and not expected, from neural network explanations. The analysis additionally uncovers an ambiguity in the reviewed literature related to the goal of model explainability; is it understanding the ML model or, is it actionable explanations useful in the deployment domain?

Comments:	10 pages, four figures, presented at AAAI-22 workshop-18: Explainable Agency in Artificial Intelligence
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2301.00189 [cs.AI]
	(or arXiv:2301.00189v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2301.00189

Submission history

From: Lars Holmberg [view email]
[v1] Sat, 31 Dec 2022 12:56:12 UTC (281 KB)

Computer Science > Artificial Intelligence

Title:Mapping Knowledge Representations to Concepts: A Review and New Perspectives

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Mapping Knowledge Representations to Concepts: A Review and New Perspectives

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators