F-DenseCNN: feature-based dense convolutional neural networks and swift text word embeddings for enhanced hate speech prediction

S. Shilpashree^1,2^na1 &
D. V. Ashoka³^na1

138 Accesses
Explore all metrics

Abstract

Hate speech on social media platforms poses a significant threat to individuals and society, necessitating robust automated detection systems. While existing approaches employ supervised machine learning with text mining elements, they often fall short in capturing the nuanced and evolving nature of hate speech, including subtle linguistic cues, implicit biases, and coded language. This study addresses these limitations by introducing two novel techniques: the feature-based dense convolutional neural network and the swift text word embedding technique. Our key contributions include the development of F-DenseCNN, a deep learning architecture designed to extract complex features from textual data, and the introduction of the swift text word embedding technique, offering efficient and context-aware word representations. Extensive experimentation and evaluation demonstrate that our proposed method significantly outperforms conventional approaches, achieving a 96.2% accuracy in hate speech detection. This substantial improvement in detection accuracy has important implications for content moderation systems, potentially enhancing their reliability and effectiveness in combating online hate speech. Our findings underscore the potential of advanced deep learning techniques in addressing the evolving challenges of hate speech detection on social media platforms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Institutional subscriptions

Hate and Offensive Language Identification from Social Media: A Machine Learning Approach

Hate Speech Detection Using Machine Learning and Deep Learning Techniques

Comparative Performance of Multi-level Pre-trained Embeddings on CNN, LSTM and CNN-LSTM for Hate Speech and Offensive Language Detection

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data availability

The data will be made available on request.

References

Airlangga G (2024) Comparative analysis of NLP techniques for hate speech classification in online communications. G-Tech J Teknol Terap 8(1):674–683
Article Google Scholar
Altın LSM, Serrano lB, Saggion H (2019) LaSTUS/TALN at SemEval-2019 task 6: identification and categorization of offensive language in social media with attention-based bi-LSTM model. In: Proceedings of the 13th international workshop on semantic evaluation, pp 672–677
Azumah SW, Elsayed N, ElSayed Z, Ozer M (2023) Cyberbullying in text content detection: an analytical review. arXiv preprint arXiv:2303.10502
Badjatiya P, Gupta S, Gupta M, Varma V (2017) Deep learning for hate speech detection in tweets. In: Proceedings of the 26th international conference on world wide web companion, pp 759–760
Contributors W (2021) Application programming interface. https://en.wikipedia.org/wiki/API. Online Accessed 24 June 2021
Davidson T, Warmsley D, Macy M, Weber I (2017) Automated hate speech detection and the problem of offensive language. In: Proceedings of the international AAAI conference on web and social media, vol 11
Djuric N, Zhou J, Morris R, Grbovic M, Radosavljevic V, Bhamidipati N (2015) Hate speech detection with comment embeddings. In: Proceedings of the 24th international conference on world wide web—WWW companion, pp 29–30
Dorris W, Hu R, Vishwamitra N, Luo F, Costello M (2020) Towards automatic detection and explanation of hate speech and offensive language. In: Proceedings of the 6th international workshop on security and privacy analytics, pp 23–29
d’Sa AG, Illina I, Fohr D, Klakow D, Ruiter D (2020) Label propagation-based semi-supervised learning for hate speech classification. In: Insights from negative results workshop, EMNLP 2020
Faris H, Aljarah I, Habib M, Castillo PA (2020) Hate speech detection using word embedding and deep learning in the Arabic language context. In: Proceedings of the 9th international conference on pattern recognition applications and methods (ICPRAM), pp 453–460
Gambäck B, Sikdar UK (2017) Using convolutional neural networks to classify hate-speech. In: Proceedings of the first workshop on abusive language online, pp 85–90
García-Díaz JA, Jiménez-Zafra SM, García-Cumbreras MA, Valencia-García R (2023) Evaluating feature combination strategies for hate-speech detection in Spanish using linguistic features and transformers. Complex Intell Syst 9(3):2893–2914
Article Google Scholar
Ghosal S, Jain A, Tayal DK, Menon VG, Kumar A (2023) Inculcating context for emoji powered Bengali hate speech detection using extended fuzzy SVM and text embedding models. ACM Trans Asian Low-Resour Lang Inf Process. https://doi.org/10.1145/3589001
Article Google Scholar
Gokhale O, Kane A, Patankar S, Chavan T, Joshi R (2022) Spread love not hate: undermining the importance of hateful pre-training for hate speech detection. arXiv preprint arXiv:2210.04267
Jahan MS, Oussalah M (2023) A systematic review of hate speech automatic detection using natural language processing. Neurocomputing 546:126232
Article Google Scholar
MacAvaney S, Yao H, Yang E, Russell K, Goharian N, Frieder O (2019) Hate speech detection: challenges and solutions. PLoS ONE 14(8):e0221152
Article Google Scholar
Malmasi S, Zampieri M (2017) Detecting hate speech in social media. arXiv:1712.06427
Malmasi S, Zampieri M (2018) Challenges in discriminating profanity from hate speech. J Exp Theor Artif Intell 30(2):187–202
Article Google Scholar
Markov I, Daelemans W (2021) Improving cross-domain hate speech detection by reducing the false positive rate. In: Proceedings of the fourth workshop on NLP for internet freedom: censorship, disinformation, and propaganda, pp 17–22
Martins R, Gomes M, Almeida JJ, Novais P, Henriques P (2018) Hate speech classification in social media using emotional analysis. In: Proceedings of the 7th Brazilian conference on intelligent systems (BRACIS), pp 61–66
Modha S, Mandl T, Shahi GK, Madhu H, Satapara S, Ranasinghe T, Zampieri M (2021) Overview of the HASOC track at fire 2021: hate speech and offensive content identification in English and Indo-Aryan languages. In: FIRE (working notes), pp 1–6
Mollas I, Chrysopoulou Z, Karlos S, Tsoumakas G (2020) Ethos: an online hate speech detection dataset. arXiv preprint. arXiv:2006.08328
Mossie Z, Wang J-H (2020) Vulnerable community identification using hate speech detection on social media. Inf Process Manag 57:102087
Article Google Scholar
Mozafari M, Farahbakhsh R, Crespi N (2019) A bert-based transfer learning approach for hate speech detection in online social media. In: Proceedings of the international conference on complex networks and their applications. Springer, Cham, pp 928–940
Mozafari M, Farahbakhsh R, Crespi N (2020) A bert-based transfer learning approach for hate speech detection in online social media. In: Complex networks and their applications VIII, pp 928–940
Nagar S, Barbhuiya FA, Dey K (2023) Towards more robust hate speech detection: using social context and user data. Soc Netw Anal Min 13(1):47
Article Google Scholar
Nobata C, Tetreault J, Thomas A, Mehdad Y, Chang Y (2015) Abusive language detection in online user content. In: Proceedings of the 25th international conference on world wide web, pp 145–153
Ousidhoum N, Lin Z, Zhang H, Song Y, Yeung D-Y (2019) Multilingual and multi-aspect hate speech analysis. In: Proceedings of the 2019 conference on empirical methods in natural language processing and 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp 4675–4684
Park JH, Fung P (2017) One-step and two-step classification for abusive language detection on twitter. arXiv:1706.01206
Quadri SMK (2024) Hate speech detection on social media using machine learning and deep learning: a review. Grenze Int J Eng Technol GIJET 10(1):1–27
Google Scholar
Rai N, Meena P, Agrawal C (2020) Improving the hate speech analysis through dimensionality reduction approach. In: Proceedings of the 6th international conference on advanced computing and communication systems (ICACCS), pp 321–325
Rajput G, Punn NS, Sonbhadra SK, Agarwal S (2021) Hate speech detection using static bert embeddings. In: International conference on big data analytics. Springer, pp 67–77
Rasel RI, Sultana N, Akhter S, Meesad P (2018) Detection of cyber aggressive comments on social media networks: a machine learning and text mining approach. In: Proceedings of the 2nd international conference on natural language processing and information retrieval, pp 37–41
Rathpisey H, Adji TB (2019) Handling imbalance issue in hate speech classification using sampling-based methods. In: Proceedings of the 5th international conference on computer science and information technology (ICSITech), pp 193–198
Ribeiro M, Calais P, Santos Y, Almeida V, Meira W Jr (2018) Characterizing and detecting hateful users on twitter. Proceedings of the international AAAI conference on web social media 12:1–10
Article Google Scholar
Rizos G, Hemker K, Schuller B (2019) Augment to prevent: short-text data augmentation in deep learning for hate-speech classification. In: Proceedings of the 28th ACM international conference on information and knowledge management, pp 991–1000
Saleh H, Alhothali A, Moria K (2023) Detection of hate speech using bert and hate speech word embedding with deep model. Appl Artif Intell 37(1):2166719
Article Google Scholar
Samoshyn A (2020) Hate speech and offensive language dataset. https://www.kaggle.com/datasets/mrmorj/hate-speech-and-offensive-language-dataset
Schmidt A, Wiegand M (2017) A survey on hate speech detection using natural language processing. In: Proceedings of the fifth international workshop on natural language processing for social media, pp 1–10
Soliman AB, Eissa K, El-Beltagy SR (2017) AraVec: a set of Arabic word embedding models for use in Arabic NLP. Procedia Comput Sci 117:256–265
Article Google Scholar
Sultan D, Toktarova A, Zhumadillayeva A, Aldeshov S, Mussiraliyeva S, Beissenova G, Tursynbayev A, Baenova G, Imanbayeva A (2023) Cyberbullying-related hate speech detection using shallow-to-deep learning. Comput Mater Contin 75(1):2115–2131
Google Scholar
Tesfaye SG, Kakeba K (2020) Automated Amharic hate speech posts and comments detection model using recurrent neural network
Waseem Z, Hovy D (2016) Hateful symbols or hateful people? Predictive features for hate speech detection on twitter. In: Proceedings of the NAACL student research workshop, pp 88–93
Yadav D, Sain MK (2023) Comparative analysis and assessment on different hate speech detection learning techniques. J Algebraic Stat 14(1):29–48
Google Scholar
Yaosheng Z, Tiegang Z, Tingjun Y, Li H (2024) Domain-enhanced prompt learning for Chinese implicit hate speech detection. IEEE Access 12:13773–13782
Article Google Scholar
Yuan L, Wang T, Ferraro G, Suominen H, Rizoiu M-A (2023) Transfer learning for hate speech detection in social media. J Comput Soc Sci 6(2):1081–1101
Article Google Scholar
Zhang Z, Luo L (2019) Hate speech detection: a solved problem? The challenging case of long tail on twitter. Semant Web 10:925–945
Article Google Scholar

Download references

Funding

The funding information is not available.

Author information

S. Shilpashree and D. V. Ashoka have contributed equally to this work.

Authors and Affiliations

School of Advanced Studies, Computer Science Department, S-Vyasa University, Global City Campus, Bengaluru, Karnataka, 560059, India
S. Shilpashree
Department of Computer Science and Engineering, JSS Academy of Technical Education, Bengaluru, Visvesvaraya Technological University, Belagavi, Dr. Vishnuvardhana Road, Bengaluru, Karnataka, 560060, India
S. Shilpashree
Department of Information Science and Engineering, JSS Academy of Technical Education, Bengaluru, Visvesvaraya Technological University, Belagavi, Dr. Vishnuvardhana Road, Bengaluru, Karnataka, 560060, India
D. V. Ashoka

Authors

S. Shilpashree
View author publications
You can also search for this author in PubMed Google Scholar
D. V. Ashoka
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to S. Shilpashree.

Ethics declarations

Conflict of interest

The authors declare that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Shilpashree, S., Ashoka, D.V. F-DenseCNN: feature-based dense convolutional neural networks and swift text word embeddings for enhanced hate speech prediction. Soc. Netw. Anal. Min. 14, 192 (2024). https://doi.org/10.1007/s13278-024-01345-3

Download citation

Received: 16 May 2024
Revised: 22 August 2024
Accepted: 28 August 2024
Published: 24 September 2024
DOI: https://doi.org/10.1007/s13278-024-01345-3

F-DenseCNN: feature-based dense convolutional neural networks and swift text word embeddings for enhanced hate speech prediction

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Hate and Offensive Language Identification from Social Media: A Machine Learning Approach

Hate Speech Detection Using Machine Learning and Deep Learning Techniques

Comparative Performance of Multi-level Pre-trained Embeddings on CNN, LSTM and CNN-LSTM for Hate Speech and Offensive Language Detection

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

F-DenseCNN: feature-based dense convolutional neural networks and swift text word embeddings for enhanced hate speech prediction

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Hate and Offensive Language Identification from Social Media: A Machine Learning Approach

Hate Speech Detection Using Machine Learning and Deep Learning Techniques

Comparative Performance of Multi-level Pre-trained Embeddings on CNN, LSTM and CNN-LSTM for Hate Speech and Offensive Language Detection

Explore related subjects

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation