More Web Proxy on the site http://driver.im/

abstract

Findings of shared task on Sentiment Analysis and Homophobia Detection of YouTube Comments in Code-Mixed Dravidian Languages

Authors:

Subalalitha Chinnaudayar Navaneethakrishnan,

Bharathi Raja Chakravarthi,

Kogilavani Shanmugavadivel,

Malliga Subramanian,

Prasanna Kumar Kumaresan,

Lavanya Sambath Kumar,

Rahul PonnusamyAuthors Info & Claims

FIRE '22: Proceedings of the 14th Annual Meeting of the Forum for Information Retrieval Evaluation

Pages 18 - 21

https://doi.org/10.1145/3574318.3574347

Published: 12 January 2023 Publication History

Abstract

We present an overview of sentiment analysis and homophobia detection of YouTube comments in code-mixed Dravidian languages in this paper. We provide the details of this task and the submitted systems for the tasks. We introduce two studies: task A for detecting sentiment analysis and task B on homophobia detection, which is organized by the FIRE 2022. A total of 95 participants registered for the shared task, 13 teams finally submitted their results for task-A a, and 10 teams submitted their results for task B. The teams explored tasks A and B using traditional machine learning and deep learning models. Most of the benchmark systems have been analyzed by participants capable of handling code-mixed scenarios in Dravidian languages.

References

[1]

R Anita and CN Subalalitha. 2019. An approach to cluster Tamil literatures using discourse connectives. In 2019 IEEE 1st International Conference on Energy, Systems and Information Processing (ICESIP). IEEE, 1–4.

[2]

Bharathi Raja Chakravarthi. 2022. Hope speech detection in YouTube comments. Social Network Analysis and Mining 12, 1 (2022), 1–19.

[3]

Bharathi Raja Chakravarthi. 2022. Multilingual hope speech detection in English and Dravidian languages. International Journal of Data Science and Analytics 14, 4 (2022), 389–406.

[4]

Bharathi Raja Chakravarthi, Adeep Hande, Rahul Ponnusamy, Prasanna Kumar Kumaresan, and Ruba Priyadharshini. 2022. How can we detect Homophobia and Transphobia? Experiments in a multilingual code-mixed setting for social media governance. International Journal of Information Management Data Insights 2, 2(2022), 100119.

[5]

Bharathi Raja Chakravarthi and Vigneshwaran Muralidaran. 2021. Findings of the shared task on hope speech detection for equality, diversity, and inclusion. In Proceedings of the first workshop on language technology for equality, diversity and inclusion. 61–72.

[6]

Bharathi Raja Chakravarthi, Vigneshwaran Muralidaran, Ruba Priyadharshini, Chinnaudayar Navaneethakrishnan Subalalitha, John Philip McCrae, Miguel Ángel García, Salud María Jiménez-Zafra, Rafael Valencia-García, Prasanna Kumaresan, Rahul Ponnusamy, 2022. Overview of the Shared Task on Hope Speech Detection for Equality, Diversity, and Inclusion. In Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion. 378–388.

[7]

Bharathi Raja Chakravarthi, Ruba Priyadharshini, Thenmozhi Durairaj, John Philip McCrae, Paul Buitelaar, Prasanna Kumaresan, and Rahul Ponnusamy. 2022. Overview of The Shared Task on Homophobia and Transphobia Detection in Social Media Comments. In Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion. 369–377.

[8]

Supriya Chanda, Anshika Mishra, and Sukomal Pal. 2022. Sentiment Analysis and Homophobia detection of Code-Mixed Dravidian Languages leveraging pre-trained model and word-level language tag. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.

[9]

Asha "Hegde and H.L." Shashirekha. 2022. Leveraging Dynamic Meta Embedding for Sentiment Analysis and Detection of Homophobic/Transphobic Content in Code-mixed Dravidian Languages. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.

[10]

Manoj Balaji J and Chinmaya Hs. 2022. A Study on Sentimental Analysis, Homophobia-Transphobia Detection for Dravidian Languages. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.

[11]

S. K. Lavanya and Chinnaudayar Navaneethakrishnan Subalalitha. 2022. Building Tamil Text Dataset on LGBTQIA and Offensive Language Detection using Multilingual BERT. In 2022 International Conference on Inventive Computation Technologies (ICICT). IEEE, 489–496.

[12]

Deepalakshmi Manikandan, Malliga Subramanian, and Kogilavani Shanmugavadivel. 2022. A System For Detecting Abusive Contents Against LGBT Community Using Deep Learning Based Transformer Models. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.

[13]

Nieves Moyano and Maria del Mar Sanchez-Fuentes. 2020. Homophobic bullying at schools: A systematic review of research, prevalence, school-related predictors and consequences. Aggression and violent behavior 53 (2020), 101441.

[14]

Filip Nilsson, Sana Sabah Al-Azzawi, and György Kovács. 2022. Leveraging Sentiment Data for the Detection of Homophobic/Transphobic Content in a Multi-Task, Multi-Lingual Setting Using Transformers. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.

[15]

Ruba Priyadharshini, Bharathi Raja Chakravarthi, Chinnaudayar Navaneethakrishnan Subalalitha, Thenmozhi Durairaj, Malliga Subramanian, Kogilavani Shanmugavadivel, Siddhanth U Hegde, and Prasanna Kumar Kumaresan. 2022. Findings of the shared task on Abusive Comment Detection in Tamil. In Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages. Association for Computational Linguistics.

[16]

Anbukkarasi Sampath, Thenmozhi Durairaj, Bharathi Raja Chakravarthi, Ruba Priyadharshini, Chinnaudayar Navaneethakrishnan Subalalitha, Kogilavani Shanmugavadivel, Sajeetha Thavareesan, Sathiyaraj Thangasamy, Parameswari Krishnamurthy, Adeep Hande, 2022. Findings of the shared task on Emotion Analysis in Tamil. In Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages. 279–285.

[17]

Sunil Saumya, Vanshita Jha, and Shankar Biradar. 2022. Sentiment and Homophobia Detection on YouTube using Ensemble Machine Learning Techniques. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.

[18]

Kogilavani Shanmugavadivel, Sai Haritha Sampath, Pramod Nandhakumar, Prasath Mahalingam, Malliga Subramanian, Prasanna Kumar Kumaresan, and Ruba Priyadharshini. 2022. An analysis of machine learning models for sentiment analysis of Tamil code-mixed data. Computer Speech & Language(2022), 101407.

[19]

Kogilavani Shanmugavadivel, Malliga Subramanian, Prasanna Kumar Kumaresan, Bharathi Raja Chakravarthi, B Bharathi, Chinnaudayar Navaneethakrishnan Subalalitha, S. K. Lavanya, Thomas Mandl, Rahul Ponnusamy, Vasanth Palanikumar, and Balaji Manoj J. 2022. Overview of the Shared Task on Sentiment Analysis and Homophobia Detection of YouTube Comments in Code-Mixed Dravidian Languages. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.

[20]

Alexandra A. Siegel. 2019. online hate speech v2. https://alexandra-siegel.com/wp-content/uploads/2019/08/Siegel_Online_Hate_Speech_v2.pdf

[21]

CN Subalalitha and E Poovammal. 2018. Automatic bilingual dictionary construction for Tirukural. Applied Artificial Intelligence 32, 6 (2018), 558–567.

[22]

Malliga Subramanian, Rahul Ponnusamy, Sean Benhur, Kogilavani Shanmugavadivel, Adhithiya Ganesan, Deepti Ravi, Gowtham Krishnan Shanmugasundaram, Ruba Priyadharshini, and Bharathi Raja Chakravarthi. 2022. Offensive language detection in Tamil YouTube comments by adapters and cross-domain knowledge transfer. Computer Speech & Language 76 (2022), 101404.

Digital Library

[23]

Sajeetha Thavareesan and Sinnathamby Mahesan. 2019. Sentiment analysis in Tamil texts: A study on machine learning techniques and feature representation. In 2019 14th Conference on Industrial and Information Systems (ICIIS). IEEE, 320–325.

[24]

Sajeetha Thavareesan and Sinnathamby Mahesan. 2020. Sentiment lexicon expansion using Word2vec and fastText for sentiment prediction in Tamil texts. In 2020 Moratuwa Engineering Research Conference (MERCon). IEEE, 272–276.

[25]

Sajeetha Thavareesan and Sinnathamby Mahesan. 2020. Word embedding-based Part of Speech tagging in Tamil texts. In 2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS). IEEE, 478–482.

[26]

Sushil Ugursandi and Anand Kumar M. 2022. Sentiment Analysis and Homophobia detection of YouTube comments. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.

[27]

Josephine Varsha, B Bharathi, and A Meenakshi. 2022. Sentiment Analysis and Homophobia detection of YouTube comments in Code-Mixed Dravidian Languages using machine learning and transformer models. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.

[28]

Samhita Venkatesan, Sarath Donepudi, Pranith P, and Thenmozhi Durairaj. 2022. Homophobia and Transphobia Detection of Youtube Comments in Code-Mixed Dravidian Languages using Deep learning. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.

Cited By

Vegupatti MKumaresan PValli SPonnusamy KPriyadharshini RThavaresan S(2024)Abusive Social Media Comments Detection for Tamil and TeluguSpeech and Language Technologies for Low-Resource Languages10.1007/978-3-031-58495-4_13(174-187)Online publication date: 24-Apr-2024
https://doi.org/10.1007/978-3-031-58495-4_13

Index Terms

Findings of shared task on Sentiment Analysis and Homophobia Detection of YouTube Comments in Code-Mixed Dravidian Languages
1. Computer systems organization
  1. Dependable and fault-tolerant systems and networks
    1. Redundancy
  2. Embedded and cyber-physical systems
    1. Embedded systems
    2. Robotics
2. Networks
  1. Network properties
    1. Network reliability

Recommendations

Overview of the track on Sentiment Analysis for Dravidian Languages in Code-Mixed Text
FIRE '20: Proceedings of the 12th Annual Meeting of the Forum for Information Retrieval Evaluation

Sentiment analysis of Dravidian languages has received attention in recent years. However, most social media text is code-mixed and there is no research available on sentiment analysis of code-mixed Dravidian languages. The Dravidian-CodeMix-FIRE 2020, ...
DravidianCodeMix: sentiment analysis and offensive language identification dataset for Dravidian languages in code-mixed text
Abstract
This paper describes the development of a multilingual, manually annotated dataset for three under-resourced Dravidian languages generated from social media comments. The dataset was annotated for sentiment analysis and offensive language ...
SiSP: Japanese Situation-dependent Sentiment Polarity Dictionary
MMArt-ACM '22: Proceedings of the 2022 International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia

In order to deal with the variety of meanings and contexts of words, we created a Japanese Situation-dependent Sentiment Polarity Dictionary (SiSP) of sentiment values labeled for 20 different situations. This dictionary was annotated by crowdworkers ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

FIRE '22: Proceedings of the 14th Annual Meeting of the Forum for Information Retrieval Evaluation

December 2022

101 pages

ISBN:9798400700231

DOI:10.1145/3574318

Copyright © 2022 Owner/Author.

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 January 2023

Check for updates

Author Tags

Qualifiers

Abstract
Research
Refereed limited

Conference

FIRE '22

FIRE '22: Forum for Information Retrieval Evaluation

December 9 - 13, 2022

Kolkata, India

Acceptance Rates

Overall Acceptance Rate 19 of 64 submissions, 30%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
87
Total Downloads

Downloads (Last 12 months)22
Downloads (Last 6 weeks)2

Reflects downloads up to 20 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Vegupatti MKumaresan PValli SPonnusamy KPriyadharshini RThavaresan S(2024)Abusive Social Media Comments Detection for Tamil and TeluguSpeech and Language Technologies for Low-Resource Languages10.1007/978-3-031-58495-4_13(174-187)Online publication date: 24-Apr-2024
https://doi.org/10.1007/978-3-031-58495-4_13

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents