Panning for gold: Comparative analysis of cross-platform approaches for automated detection of political content in textual data.

Makhortykh M; de León E; Urman A; Gil-Lopez T; Christner C; Sydorova M; Adam S; Maier M

doi:10.1371/journal.pone.0312865

Panning for gold: Comparative analysis of cross-platform approaches for automated detection of political content in textual data.

Affiliations

1. Institute of Communication and Media Studies, University of Bern, Bern, Switzerland.
Authors
Makhortykh M¹
de León E¹
Sydorova M¹
Adam S¹
(4 authors)
2. Social Computing Group, University of Zurich, Zurich, Switzerland.
Authors
Urman A²
(1 author)
3. Department of Communication, University Carlos III of Madrid, Madrid, Spain.
Authors
Gil-Lopez T³
(1 author)
4. Institute for Communication Psychology and Media Education, Rheinland-Pfälzische Technische Universität Kaiserslautern-Landau, Landau, Germany.
Authors
Christner C⁴
Maier M⁴
(2 authors)

ORCIDs linked to this article

Plos one, 18 Nov 2024, 19(11):e0312865
https://doi.org/10.1371/journal.pone.0312865 PMID: 39556542

Abstract

To understand and measure political information consumption in the high-choice media environment, we need new methods to trace individual interactions with online content and novel techniques to analyse and detect politics-related information. In this paper, we report the results of a comparative analysis of the performance of automated content analysis techniques for detecting political content in the German language across different platforms. Using three validation datasets, we compare the performance of three groups of detection techniques relying on dictionaries, classic supervised machine learning, and deep learning. We also examine the impact of different modes of data preprocessing on the low-cost implementations of these techniques using a large set (n = 66) of models. Our results show the limited impact of preprocessing on model performance, with the best results for less noisy data being achieved by deep learning- and classic machine learning-based models, in contrast to the more robust performance of dictionary-based models on noisy data.

Full text links

Read article at publisher's site: https://doi.org/10.1371/journal.pone.0312865

Funding

Funders who supported this work.

Der Schweizerische Nationalfonds (1)

Grant ID: 100001CL_182630/1
1 publication

Deutsche Forschungsgemeinschaft (1)

Grant ID: MA 2244/9-1
6 publications

Search life-sciences literature (45,103,589 articles, preprints and more)

Panning for gold: Comparative analysis of cross-platform approaches for automated detection of political content in textual data.

Affiliations

Authors

Authors

Authors

Authors

ORCIDs linked to this article

Abstract

Full text links

Funding

Der Schweizerische Nationalfonds (1)

Deutsche Forschungsgemeinschaft (1)

Partnerships & funding

Search life-sciences literature (45,103,589 articles, preprints and more)

Panning for gold: Comparative analysis of cross-platform approaches for automated detection of political content in textual data.

Author information

Affiliations

Authors

Authors

Authors

Authors

ORCIDs linked to this article

Abstract

Full text links

Funding

Der Schweizerische Nationalfonds (1)﻿

Deutsche Forschungsgemeinschaft (1)﻿

Partnerships & funding

Der Schweizerische Nationalfonds (1)

Deutsche Forschungsgemeinschaft (1)