Can Large Language Models Unlock Novel Scientific Research Ideas?

Kumar, Sandeep; Ghosal, Tirthankar; Goyal, Vinayak; Ekbal, Asif

Computer Science > Computation and Language

arXiv:2409.06185 (cs)

[Submitted on 10 Sep 2024]

Title:Can Large Language Models Unlock Novel Scientific Research Ideas?

Authors:Sandeep Kumar, Tirthankar Ghosal, Vinayak Goyal, Asif Ekbal

View PDF HTML (experimental)

Abstract:"An idea is nothing more nor less than a new combination of old elements" (Young, J.W.). The widespread adoption of Large Language Models (LLMs) and publicly available ChatGPT have marked a significant turning point in the integration of Artificial Intelligence (AI) into people's everyday lives. This study explores the capability of LLMs in generating novel research ideas based on information from research papers. We conduct a thorough examination of 4 LLMs in five domains (e.g., Chemistry, Computer, Economics, Medical, and Physics). We found that the future research ideas generated by Claude-2 and GPT-4 are more aligned with the author's perspective than GPT-3.5 and Gemini. We also found that Claude-2 generates more diverse future research ideas than GPT-4, GPT-3.5, and Gemini 1.0. We further performed a human evaluation of the novelty, relevancy, and feasibility of the generated future research ideas. This investigation offers insights into the evolving role of LLMs in idea generation, highlighting both its capability and limitations. Our work contributes to the ongoing efforts in evaluating and utilizing language models for generating future research ideas. We make our datasets and codes publicly available.

Comments:	24 pages, 12 figures, 6 tables
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
Cite as:	arXiv:2409.06185 [cs.CL]
	(or arXiv:2409.06185v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2409.06185

Submission history

From: Sandeep Kumar [view email]
[v1] Tue, 10 Sep 2024 03:26:42 UTC (1,830 KB)

Computer Science > Computation and Language

Title:Can Large Language Models Unlock Novel Scientific Research Ideas?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Can Large Language Models Unlock Novel Scientific Research Ideas?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators