FineFake: A Knowledge-Enriched Dataset for Fine-Grained Multi-Domain Fake News Detection

Zhou, Ziyi; Zhang, Xiaoming; Zhang, Litian; Liu, Jiacheng; Wang, Senzhang; Liu, Zheng; Zhang, Xi; Li, Chaozhuo; Yu, Philip S.

Computer Science > Computation and Language

arXiv:2404.01336 (cs)

[Submitted on 30 Mar 2024 (v1), last revised 15 Oct 2024 (this version, v3)]

Title:FineFake: A Knowledge-Enriched Dataset for Fine-Grained Multi-Domain Fake News Detection

Authors:Ziyi Zhou, Xiaoming Zhang, Litian Zhang, Jiacheng Liu, Senzhang Wang, Zheng Liu, Xi Zhang, Chaozhuo Li, Philip S. Yu

View PDF HTML (experimental)

Abstract:Existing benchmarks for fake news detection have significantly contributed to the advancement of models in assessing the authenticity of news content. However, these benchmarks typically focus solely on news pertaining to a single semantic topic or originating from a single platform, thereby failing to capture the diversity of multi-domain news in real scenarios. In order to understand fake news across various domains, the external knowledge and fine-grained annotations are indispensable to provide precise evidence and uncover the diverse underlying strategies for fabrication, which are also ignored by existing benchmarks. To address this gap, we introduce a novel multi-domain knowledge-enhanced benchmark with fine-grained annotations, named \textbf{FineFake}. FineFake encompasses 16,909 data samples spanning six semantic topics and eight platforms. Each news item is enriched with multi-modal content, potential social context, semi-manually verified common knowledge, and fine-grained annotations that surpass conventional binary labels. Furthermore, we formulate three challenging tasks based on FineFake and propose a knowledge-enhanced domain adaptation network. Extensive experiments are conducted on FineFake under various scenarios, providing accurate and reliable benchmarks for future endeavors. The entire FineFake project is publicly accessible as an open-source repository at \url{this https URL}.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
Cite as:	arXiv:2404.01336 [cs.CL]
	(or arXiv:2404.01336v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2404.01336

Submission history

From: Ziyi Zhou [view email]
[v1] Sat, 30 Mar 2024 14:39:09 UTC (12,049 KB)
[v2] Sun, 28 Apr 2024 07:26:08 UTC (12,049 KB)
[v3] Tue, 15 Oct 2024 12:40:39 UTC (13,552 KB)

Computer Science > Computation and Language

Title:FineFake: A Knowledge-Enriched Dataset for Fine-Grained Multi-Domain Fake News Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:FineFake: A Knowledge-Enriched Dataset for Fine-Grained Multi-Domain Fake News Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators