Loneliness Episodes: A Japanese Dataset for Loneliness Detection and Analysis

Naoya Fujikawa, Nguyen Toan, Kazuhiro Ito, Shoko Wakamiya, Eiji Aramaki

Abstract

Loneliness, a significant public health concern, is closely connected to both physical and mental well-being. Hence, detection and intervention for individuals experiencing loneliness are crucial. Identifying loneliness in text is straightforward when it is explicitly stated but challenging when it is implicit. Detecting implicit loneliness requires a manually annotated dataset because whereas explicit loneliness can be detected using keywords, implicit loneliness cannot be. However, there are no freely available datasets with clear annotation guidelines for implicit loneliness. In this study, we construct a freely accessible Japanese loneliness dataset with annotation guidelines grounded in the psychological definition of loneliness. This dataset covers loneliness intensity and the contributing factors of loneliness. We train two models to classify whether loneliness is expressed and the intensity of loneliness. The model classifying loneliness versus non-loneliness achieves an F1-score of 0.833, but the model for identifying the intensity of loneliness has a low F1-score of 0.400, which is likely due to label imbalance and a shortage of a certain label in the dataset. We validate performance in another domain, specifically X (formerly Twitter), and observe a decrease. In addition, we propose improvement suggestions for domain adaptation.

Anthology ID:: 2024.wassa-1.23
Volume:: Proceedings of the 14th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis
Month:: August
Year:: 2024
Address:: Bangkok, Thailand
Editors:: Orphée De Clercq, Valentin Barriere, Jeremy Barnes, Roman Klinger, João Sedoc, Shabnam Tafreshi
Venues:: WASSA | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 280–293
Language:
URL:: https://aclanthology.org/2024.wassa-1.23
DOI:: 10.18653/v1/2024.wassa-1.23
Bibkey:
Cite (ACL):: Naoya Fujikawa, Nguyen Toan, Kazuhiro Ito, Shoko Wakamiya, and Eiji Aramaki. 2024. Loneliness Episodes: A Japanese Dataset for Loneliness Detection and Analysis. In Proceedings of the 14th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis, pages 280–293, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):: Loneliness Episodes: A Japanese Dataset for Loneliness Detection and Analysis (Fujikawa et al., WASSA-WS 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.wassa-1.23.pdf

PDF Cite Search