Abstract
Hate speech on social media platforms poses a significant threat to individuals and society, necessitating robust automated detection systems. While existing approaches employ supervised machine learning with text mining elements, they often fall short in capturing the nuanced and evolving nature of hate speech, including subtle linguistic cues, implicit biases, and coded language. This study addresses these limitations by introducing two novel techniques: the feature-based dense convolutional neural network and the swift text word embedding technique. Our key contributions include the development of F-DenseCNN, a deep learning architecture designed to extract complex features from textual data, and the introduction of the swift text word embedding technique, offering efficient and context-aware word representations. Extensive experimentation and evaluation demonstrate that our proposed method significantly outperforms conventional approaches, achieving a 96.2% accuracy in hate speech detection. This substantial improvement in detection accuracy has important implications for content moderation systems, potentially enhancing their reliability and effectiveness in combating online hate speech. Our findings underscore the potential of advanced deep learning techniques in addressing the evolving challenges of hate speech detection on social media platforms.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
The data will be made available on request.
References
Airlangga G (2024) Comparative analysis of NLP techniques for hate speech classification in online communications. G-Tech J Teknol Terap 8(1):674–683
Altın LSM, Serrano lB, Saggion H (2019) LaSTUS/TALN at SemEval-2019 task 6: identification and categorization of offensive language in social media with attention-based bi-LSTM model. In: Proceedings of the 13th international workshop on semantic evaluation, pp 672–677
Azumah SW, Elsayed N, ElSayed Z, Ozer M (2023) Cyberbullying in text content detection: an analytical review. arXiv preprint arXiv:2303.10502
Badjatiya P, Gupta S, Gupta M, Varma V (2017) Deep learning for hate speech detection in tweets. In: Proceedings of the 26th international conference on world wide web companion, pp 759–760
Contributors W (2021) Application programming interface. https://en.wikipedia.org/wiki/API. Online Accessed 24 June 2021
Davidson T, Warmsley D, Macy M, Weber I (2017) Automated hate speech detection and the problem of offensive language. In: Proceedings of the international AAAI conference on web and social media, vol 11
Djuric N, Zhou J, Morris R, Grbovic M, Radosavljevic V, Bhamidipati N (2015) Hate speech detection with comment embeddings. In: Proceedings of the 24th international conference on world wide web—WWW companion, pp 29–30
Dorris W, Hu R, Vishwamitra N, Luo F, Costello M (2020) Towards automatic detection and explanation of hate speech and offensive language. In: Proceedings of the 6th international workshop on security and privacy analytics, pp 23–29
d’Sa AG, Illina I, Fohr D, Klakow D, Ruiter D (2020) Label propagation-based semi-supervised learning for hate speech classification. In: Insights from negative results workshop, EMNLP 2020
Faris H, Aljarah I, Habib M, Castillo PA (2020) Hate speech detection using word embedding and deep learning in the Arabic language context. In: Proceedings of the 9th international conference on pattern recognition applications and methods (ICPRAM), pp 453–460
Gambäck B, Sikdar UK (2017) Using convolutional neural networks to classify hate-speech. In: Proceedings of the first workshop on abusive language online, pp 85–90
García-Díaz JA, Jiménez-Zafra SM, García-Cumbreras MA, Valencia-García R (2023) Evaluating feature combination strategies for hate-speech detection in Spanish using linguistic features and transformers. Complex Intell Syst 9(3):2893–2914
Ghosal S, Jain A, Tayal DK, Menon VG, Kumar A (2023) Inculcating context for emoji powered Bengali hate speech detection using extended fuzzy SVM and text embedding models. ACM Trans Asian Low-Resour Lang Inf Process. https://doi.org/10.1145/3589001
Gokhale O, Kane A, Patankar S, Chavan T, Joshi R (2022) Spread love not hate: undermining the importance of hateful pre-training for hate speech detection. arXiv preprint arXiv:2210.04267
Jahan MS, Oussalah M (2023) A systematic review of hate speech automatic detection using natural language processing. Neurocomputing 546:126232
MacAvaney S, Yao H, Yang E, Russell K, Goharian N, Frieder O (2019) Hate speech detection: challenges and solutions. PLoS ONE 14(8):e0221152
Malmasi S, Zampieri M (2017) Detecting hate speech in social media. arXiv:1712.06427
Malmasi S, Zampieri M (2018) Challenges in discriminating profanity from hate speech. J Exp Theor Artif Intell 30(2):187–202
Markov I, Daelemans W (2021) Improving cross-domain hate speech detection by reducing the false positive rate. In: Proceedings of the fourth workshop on NLP for internet freedom: censorship, disinformation, and propaganda, pp 17–22
Martins R, Gomes M, Almeida JJ, Novais P, Henriques P (2018) Hate speech classification in social media using emotional analysis. In: Proceedings of the 7th Brazilian conference on intelligent systems (BRACIS), pp 61–66
Modha S, Mandl T, Shahi GK, Madhu H, Satapara S, Ranasinghe T, Zampieri M (2021) Overview of the HASOC track at fire 2021: hate speech and offensive content identification in English and Indo-Aryan languages. In: FIRE (working notes), pp 1–6
Mollas I, Chrysopoulou Z, Karlos S, Tsoumakas G (2020) Ethos: an online hate speech detection dataset. arXiv preprint. arXiv:2006.08328
Mossie Z, Wang J-H (2020) Vulnerable community identification using hate speech detection on social media. Inf Process Manag 57:102087
Mozafari M, Farahbakhsh R, Crespi N (2019) A bert-based transfer learning approach for hate speech detection in online social media. In: Proceedings of the international conference on complex networks and their applications. Springer, Cham, pp 928–940
Mozafari M, Farahbakhsh R, Crespi N (2020) A bert-based transfer learning approach for hate speech detection in online social media. In: Complex networks and their applications VIII, pp 928–940
Nagar S, Barbhuiya FA, Dey K (2023) Towards more robust hate speech detection: using social context and user data. Soc Netw Anal Min 13(1):47
Nobata C, Tetreault J, Thomas A, Mehdad Y, Chang Y (2015) Abusive language detection in online user content. In: Proceedings of the 25th international conference on world wide web, pp 145–153
Ousidhoum N, Lin Z, Zhang H, Song Y, Yeung D-Y (2019) Multilingual and multi-aspect hate speech analysis. In: Proceedings of the 2019 conference on empirical methods in natural language processing and 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp 4675–4684
Park JH, Fung P (2017) One-step and two-step classification for abusive language detection on twitter. arXiv:1706.01206
Quadri SMK (2024) Hate speech detection on social media using machine learning and deep learning: a review. Grenze Int J Eng Technol GIJET 10(1):1–27
Rai N, Meena P, Agrawal C (2020) Improving the hate speech analysis through dimensionality reduction approach. In: Proceedings of the 6th international conference on advanced computing and communication systems (ICACCS), pp 321–325
Rajput G, Punn NS, Sonbhadra SK, Agarwal S (2021) Hate speech detection using static bert embeddings. In: International conference on big data analytics. Springer, pp 67–77
Rasel RI, Sultana N, Akhter S, Meesad P (2018) Detection of cyber aggressive comments on social media networks: a machine learning and text mining approach. In: Proceedings of the 2nd international conference on natural language processing and information retrieval, pp 37–41
Rathpisey H, Adji TB (2019) Handling imbalance issue in hate speech classification using sampling-based methods. In: Proceedings of the 5th international conference on computer science and information technology (ICSITech), pp 193–198
Ribeiro M, Calais P, Santos Y, Almeida V, Meira W Jr (2018) Characterizing and detecting hateful users on twitter. Proceedings of the international AAAI conference on web social media 12:1–10
Rizos G, Hemker K, Schuller B (2019) Augment to prevent: short-text data augmentation in deep learning for hate-speech classification. In: Proceedings of the 28th ACM international conference on information and knowledge management, pp 991–1000
Saleh H, Alhothali A, Moria K (2023) Detection of hate speech using bert and hate speech word embedding with deep model. Appl Artif Intell 37(1):2166719
Samoshyn A (2020) Hate speech and offensive language dataset. https://www.kaggle.com/datasets/mrmorj/hate-speech-and-offensive-language-dataset
Schmidt A, Wiegand M (2017) A survey on hate speech detection using natural language processing. In: Proceedings of the fifth international workshop on natural language processing for social media, pp 1–10
Soliman AB, Eissa K, El-Beltagy SR (2017) AraVec: a set of Arabic word embedding models for use in Arabic NLP. Procedia Comput Sci 117:256–265
Sultan D, Toktarova A, Zhumadillayeva A, Aldeshov S, Mussiraliyeva S, Beissenova G, Tursynbayev A, Baenova G, Imanbayeva A (2023) Cyberbullying-related hate speech detection using shallow-to-deep learning. Comput Mater Contin 75(1):2115–2131
Tesfaye SG, Kakeba K (2020) Automated Amharic hate speech posts and comments detection model using recurrent neural network
Waseem Z, Hovy D (2016) Hateful symbols or hateful people? Predictive features for hate speech detection on twitter. In: Proceedings of the NAACL student research workshop, pp 88–93
Yadav D, Sain MK (2023) Comparative analysis and assessment on different hate speech detection learning techniques. J Algebraic Stat 14(1):29–48
Yaosheng Z, Tiegang Z, Tingjun Y, Li H (2024) Domain-enhanced prompt learning for Chinese implicit hate speech detection. IEEE Access 12:13773–13782
Yuan L, Wang T, Ferraro G, Suominen H, Rizoiu M-A (2023) Transfer learning for hate speech detection in social media. J Comput Soc Sci 6(2):1081–1101
Zhang Z, Luo L (2019) Hate speech detection: a solved problem? The challenging case of long tail on twitter. Semant Web 10:925–945
Funding
The funding information is not available.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shilpashree, S., Ashoka, D.V. F-DenseCNN: feature-based dense convolutional neural networks and swift text word embeddings for enhanced hate speech prediction. Soc. Netw. Anal. Min. 14, 192 (2024). https://doi.org/10.1007/s13278-024-01345-3
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-024-01345-3