Ask Your Neurons: A Neural-based Approach to Answering Questions about Images

Malinowski, Mateusz; Rohrbach, Marcus; Fritz, Mario

Computer Science > Computer Vision and Pattern Recognition

arXiv:1505.01121 (cs)

[Submitted on 5 May 2015 (v1), last revised 1 Oct 2015 (this version, v3)]

Title:Ask Your Neurons: A Neural-based Approach to Answering Questions about Images

Authors:Mateusz Malinowski, Marcus Rohrbach, Mario Fritz

View PDF

Abstract:We address a question answering task on real-world images that is set up as a Visual Turing Test. By combining latest advances in image representation and natural language processing, we propose Neural-Image-QA, an end-to-end formulation to this problem for which all parts are trained jointly. In contrast to previous efforts, we are facing a multi-modal problem where the language output (answer) is conditioned on visual and natural language input (image and question). Our approach Neural-Image-QA doubles the performance of the previous best approach on this problem. We provide additional insights into the problem by analyzing how much information is contained only in the language part for which we provide a new human baseline. To study human consensus, which is related to the ambiguities inherent in this challenging task, we propose two novel metrics and collect additional answers which extends the original DAQUAR dataset to DAQUAR-Consensus.

Comments:	ICCV'15 (Oral)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:1505.01121 [cs.CV]
	(or arXiv:1505.01121v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1505.01121

Submission history

From: Mateusz Malinowski [view email]
[v1] Tue, 5 May 2015 18:39:29 UTC (4,867 KB)
[v2] Wed, 6 May 2015 08:10:01 UTC (4,867 KB)
[v3] Thu, 1 Oct 2015 12:13:20 UTC (5,336 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2015-05

Change to browse by:

cs
cs.AI
cs.CL

References & Citations

DBLP - CS Bibliography

listing | bibtex

Mateusz Malinowski
Marcus Rohrbach
Mario Fritz

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Ask Your Neurons: A Neural-based Approach to Answering Questions about Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Ask Your Neurons: A Neural-based Approach to Answering Questions about Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators