Cross-lingual Emotion Detection

Hassan, Sabit; Shaar, Shaden; Darwish, Kareem

Computer Science > Computation and Language

arXiv:2106.06017 (cs)

[Submitted on 10 Jun 2021 (v1), last revised 4 May 2022 (this version, v2)]

Title:Cross-lingual Emotion Detection

Authors:Sabit Hassan, Shaden Shaar, Kareem Darwish

View PDF

Abstract:Emotion detection can provide us with a window into understanding human behavior. Due to the complex dynamics of human emotions, however, constructing annotated datasets to train automated models can be expensive. Thus, we explore the efficacy of cross-lingual approaches that would use data from a source language to build models for emotion detection in a target language. We compare three approaches, namely: i) using inherently multilingual models; ii) translating training data into the target language; and iii) using an automatically tagged parallel corpus. In our study, we consider English as the source language with Arabic and Spanish as target languages. We study the effectiveness of different classification models such as BERT and SVMs trained with different features. Our BERT-based monolingual models that are trained on target language data surpass state-of-the-art (SOTA) by 4% and 5% absolute Jaccard score for Arabic and Spanish respectively. Next, we show that using cross-lingual approaches with English data alone, we can achieve more than 90% and 80% relative effectiveness of the Arabic and Spanish BERT models respectively. Lastly, we use LIME to analyze the challenges of training cross-lingual models for different language pairs

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2106.06017 [cs.CL]
	(or arXiv:2106.06017v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2106.06017

Submission history

From: Sabit Hassan [view email]
[v1] Thu, 10 Jun 2021 19:52:06 UTC (7,827 KB)
[v2] Wed, 4 May 2022 23:51:03 UTC (8,444 KB)

Computer Science > Computation and Language

Title:Cross-lingual Emotion Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Cross-lingual Emotion Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators