[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3574318.3574347acmotherconferencesArticle/Chapter ViewAbstractPublication PagesfireConference Proceedingsconference-collections
abstract

Findings of shared task on Sentiment Analysis and Homophobia Detection of YouTube Comments in Code-Mixed Dravidian Languages

Published: 12 January 2023 Publication History

Abstract

We present an overview of sentiment analysis and homophobia detection of YouTube comments in code-mixed Dravidian languages in this paper. We provide the details of this task and the submitted systems for the tasks. We introduce two studies: task A for detecting sentiment analysis and task B on homophobia detection, which is organized by the FIRE 2022. A total of 95 participants registered for the shared task, 13 teams finally submitted their results for task-A a, and 10 teams submitted their results for task B. The teams explored tasks A and B using traditional machine learning and deep learning models. Most of the benchmark systems have been analyzed by participants capable of handling code-mixed scenarios in Dravidian languages.

References

[1]
R Anita and CN Subalalitha. 2019. An approach to cluster Tamil literatures using discourse connectives. In 2019 IEEE 1st International Conference on Energy, Systems and Information Processing (ICESIP). IEEE, 1–4.
[2]
Bharathi Raja Chakravarthi. 2022. Hope speech detection in YouTube comments. Social Network Analysis and Mining 12, 1 (2022), 1–19.
[3]
Bharathi Raja Chakravarthi. 2022. Multilingual hope speech detection in English and Dravidian languages. International Journal of Data Science and Analytics 14, 4 (2022), 389–406.
[4]
Bharathi Raja Chakravarthi, Adeep Hande, Rahul Ponnusamy, Prasanna Kumar Kumaresan, and Ruba Priyadharshini. 2022. How can we detect Homophobia and Transphobia? Experiments in a multilingual code-mixed setting for social media governance. International Journal of Information Management Data Insights 2, 2(2022), 100119.
[5]
Bharathi Raja Chakravarthi and Vigneshwaran Muralidaran. 2021. Findings of the shared task on hope speech detection for equality, diversity, and inclusion. In Proceedings of the first workshop on language technology for equality, diversity and inclusion. 61–72.
[6]
Bharathi Raja Chakravarthi, Vigneshwaran Muralidaran, Ruba Priyadharshini, Chinnaudayar Navaneethakrishnan Subalalitha, John Philip McCrae, Miguel Ángel García, Salud María Jiménez-Zafra, Rafael Valencia-García, Prasanna Kumaresan, Rahul Ponnusamy, 2022. Overview of the Shared Task on Hope Speech Detection for Equality, Diversity, and Inclusion. In Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion. 378–388.
[7]
Bharathi Raja Chakravarthi, Ruba Priyadharshini, Thenmozhi Durairaj, John Philip McCrae, Paul Buitelaar, Prasanna Kumaresan, and Rahul Ponnusamy. 2022. Overview of The Shared Task on Homophobia and Transphobia Detection in Social Media Comments. In Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion. 369–377.
[8]
Supriya Chanda, Anshika Mishra, and Sukomal Pal. 2022. Sentiment Analysis and Homophobia detection of Code-Mixed Dravidian Languages leveraging pre-trained model and word-level language tag. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.
[9]
Asha "Hegde and H.L." Shashirekha. 2022. Leveraging Dynamic Meta Embedding for Sentiment Analysis and Detection of Homophobic/Transphobic Content in Code-mixed Dravidian Languages. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.
[10]
Manoj Balaji J and Chinmaya Hs. 2022. A Study on Sentimental Analysis, Homophobia-Transphobia Detection for Dravidian Languages. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.
[11]
S. K. Lavanya and Chinnaudayar Navaneethakrishnan Subalalitha. 2022. Building Tamil Text Dataset on LGBTQIA and Offensive Language Detection using Multilingual BERT. In 2022 International Conference on Inventive Computation Technologies (ICICT). IEEE, 489–496.
[12]
Deepalakshmi Manikandan, Malliga Subramanian, and Kogilavani Shanmugavadivel. 2022. A System For Detecting Abusive Contents Against LGBT Community Using Deep Learning Based Transformer Models. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.
[13]
Nieves Moyano and Maria del Mar Sanchez-Fuentes. 2020. Homophobic bullying at schools: A systematic review of research, prevalence, school-related predictors and consequences. Aggression and violent behavior 53 (2020), 101441.
[14]
Filip Nilsson, Sana Sabah Al-Azzawi, and György Kovács. 2022. Leveraging Sentiment Data for the Detection of Homophobic/Transphobic Content in a Multi-Task, Multi-Lingual Setting Using Transformers. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.
[15]
Ruba Priyadharshini, Bharathi Raja Chakravarthi, Chinnaudayar Navaneethakrishnan Subalalitha, Thenmozhi Durairaj, Malliga Subramanian, Kogilavani Shanmugavadivel, Siddhanth U Hegde, and Prasanna Kumar Kumaresan. 2022. Findings of the shared task on Abusive Comment Detection in Tamil. In Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages. Association for Computational Linguistics.
[16]
Anbukkarasi Sampath, Thenmozhi Durairaj, Bharathi Raja Chakravarthi, Ruba Priyadharshini, Chinnaudayar Navaneethakrishnan Subalalitha, Kogilavani Shanmugavadivel, Sajeetha Thavareesan, Sathiyaraj Thangasamy, Parameswari Krishnamurthy, Adeep Hande, 2022. Findings of the shared task on Emotion Analysis in Tamil. In Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages. 279–285.
[17]
Sunil Saumya, Vanshita Jha, and Shankar Biradar. 2022. Sentiment and Homophobia Detection on YouTube using Ensemble Machine Learning Techniques. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.
[18]
Kogilavani Shanmugavadivel, Sai Haritha Sampath, Pramod Nandhakumar, Prasath Mahalingam, Malliga Subramanian, Prasanna Kumar Kumaresan, and Ruba Priyadharshini. 2022. An analysis of machine learning models for sentiment analysis of Tamil code-mixed data. Computer Speech & Language(2022), 101407.
[19]
Kogilavani Shanmugavadivel, Malliga Subramanian, Prasanna Kumar Kumaresan, Bharathi Raja Chakravarthi, B Bharathi, Chinnaudayar Navaneethakrishnan Subalalitha, S. K. Lavanya, Thomas Mandl, Rahul Ponnusamy, Vasanth Palanikumar, and Balaji Manoj J. 2022. Overview of the Shared Task on Sentiment Analysis and Homophobia Detection of YouTube Comments in Code-Mixed Dravidian Languages. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.
[20]
Alexandra A. Siegel. 2019. online hate speech v2. https://alexandra-siegel.com/wp-content/uploads/2019/08/Siegel_Online_Hate_Speech_v2.pdf
[21]
CN Subalalitha and E Poovammal. 2018. Automatic bilingual dictionary construction for Tirukural. Applied Artificial Intelligence 32, 6 (2018), 558–567.
[22]
Malliga Subramanian, Rahul Ponnusamy, Sean Benhur, Kogilavani Shanmugavadivel, Adhithiya Ganesan, Deepti Ravi, Gowtham Krishnan Shanmugasundaram, Ruba Priyadharshini, and Bharathi Raja Chakravarthi. 2022. Offensive language detection in Tamil YouTube comments by adapters and cross-domain knowledge transfer. Computer Speech & Language 76 (2022), 101404.
[23]
Sajeetha Thavareesan and Sinnathamby Mahesan. 2019. Sentiment analysis in Tamil texts: A study on machine learning techniques and feature representation. In 2019 14th Conference on Industrial and Information Systems (ICIIS). IEEE, 320–325.
[24]
Sajeetha Thavareesan and Sinnathamby Mahesan. 2020. Sentiment lexicon expansion using Word2vec and fastText for sentiment prediction in Tamil texts. In 2020 Moratuwa Engineering Research Conference (MERCon). IEEE, 272–276.
[25]
Sajeetha Thavareesan and Sinnathamby Mahesan. 2020. Word embedding-based Part of Speech tagging in Tamil texts. In 2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS). IEEE, 478–482.
[26]
Sushil Ugursandi and Anand Kumar M. 2022. Sentiment Analysis and Homophobia detection of YouTube comments. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.
[27]
Josephine Varsha, B Bharathi, and A Meenakshi. 2022. Sentiment Analysis and Homophobia detection of YouTube comments in Code-Mixed Dravidian Languages using machine learning and transformer models. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.
[28]
Samhita Venkatesan, Sarath Donepudi, Pranith P, and Thenmozhi Durairaj. 2022. Homophobia and Transphobia Detection of Youtube Comments in Code-Mixed Dravidian Languages using Deep learning. In Working Notes of FIRE 2022 - Forum for Information Retrieval Evaluation (Hybrid). CEUR.

Cited By

View all
  • (2024)Abusive Social Media Comments Detection for Tamil and TeluguSpeech and Language Technologies for Low-Resource Languages10.1007/978-3-031-58495-4_13(174-187)Online publication date: 24-Apr-2024

Index Terms

  1. Findings of shared task on Sentiment Analysis and Homophobia Detection of YouTube Comments in Code-Mixed Dravidian Languages

          Recommendations

          Comments

          Please enable JavaScript to view thecomments powered by Disqus.

          Information & Contributors

          Information

          Published In

          cover image ACM Other conferences
          FIRE '22: Proceedings of the 14th Annual Meeting of the Forum for Information Retrieval Evaluation
          December 2022
          101 pages
          ISBN:9798400700231
          DOI:10.1145/3574318
          Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 12 January 2023

          Check for updates

          Author Tags

          1. Datasets
          2. Evaluations
          3. Homophobia
          4. Sentiment

          Qualifiers

          • Abstract
          • Research
          • Refereed limited

          Conference

          FIRE '22
          FIRE '22: Forum for Information Retrieval Evaluation
          December 9 - 13, 2022
          Kolkata, India

          Acceptance Rates

          Overall Acceptance Rate 19 of 64 submissions, 30%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)22
          • Downloads (Last 6 weeks)2
          Reflects downloads up to 20 Jan 2025

          Other Metrics

          Citations

          Cited By

          View all
          • (2024)Abusive Social Media Comments Detection for Tamil and TeluguSpeech and Language Technologies for Low-Resource Languages10.1007/978-3-031-58495-4_13(174-187)Online publication date: 24-Apr-2024

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media