[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ Skip to main content

Advertisement

Log in

F-DenseCNN: feature-based dense convolutional neural networks and swift text word embeddings for enhanced hate speech prediction

  • Original Article
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

Hate speech on social media platforms poses a significant threat to individuals and society, necessitating robust automated detection systems. While existing approaches employ supervised machine learning with text mining elements, they often fall short in capturing the nuanced and evolving nature of hate speech, including subtle linguistic cues, implicit biases, and coded language. This study addresses these limitations by introducing two novel techniques: the feature-based dense convolutional neural network and the swift text word embedding technique. Our key contributions include the development of F-DenseCNN, a deep learning architecture designed to extract complex features from textual data, and the introduction of the swift text word embedding technique, offering efficient and context-aware word representations. Extensive experimentation and evaluation demonstrate that our proposed method significantly outperforms conventional approaches, achieving a 96.2% accuracy in hate speech detection. This substantial improvement in detection accuracy has important implications for content moderation systems, potentially enhancing their reliability and effectiveness in combating online hate speech. Our findings underscore the potential of advanced deep learning techniques in addressing the evolving challenges of hate speech detection on social media platforms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Algorithm 1
Fig. 6
Algorithm 2
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

The data will be made available on request.

References

  • Airlangga G (2024) Comparative analysis of NLP techniques for hate speech classification in online communications. G-Tech J Teknol Terap 8(1):674–683

    Article  Google Scholar 

  • Altın LSM, Serrano lB, Saggion H (2019) LaSTUS/TALN at SemEval-2019 task 6: identification and categorization of offensive language in social media with attention-based bi-LSTM model. In: Proceedings of the 13th international workshop on semantic evaluation, pp 672–677

  • Azumah SW, Elsayed N, ElSayed Z, Ozer M (2023) Cyberbullying in text content detection: an analytical review. arXiv preprint arXiv:2303.10502

  • Badjatiya P, Gupta S, Gupta M, Varma V (2017) Deep learning for hate speech detection in tweets. In: Proceedings of the 26th international conference on world wide web companion, pp 759–760

  • Contributors W (2021) Application programming interface. https://en.wikipedia.org/wiki/API. Online Accessed 24 June 2021

  • Davidson T, Warmsley D, Macy M, Weber I (2017) Automated hate speech detection and the problem of offensive language. In: Proceedings of the international AAAI conference on web and social media, vol 11

  • Djuric N, Zhou J, Morris R, Grbovic M, Radosavljevic V, Bhamidipati N (2015) Hate speech detection with comment embeddings. In: Proceedings of the 24th international conference on world wide web—WWW companion, pp 29–30

  • Dorris W, Hu R, Vishwamitra N, Luo F, Costello M (2020) Towards automatic detection and explanation of hate speech and offensive language. In: Proceedings of the 6th international workshop on security and privacy analytics, pp 23–29

  • d’Sa AG, Illina I, Fohr D, Klakow D, Ruiter D (2020) Label propagation-based semi-supervised learning for hate speech classification. In: Insights from negative results workshop, EMNLP 2020

  • Faris H, Aljarah I, Habib M, Castillo PA (2020) Hate speech detection using word embedding and deep learning in the Arabic language context. In: Proceedings of the 9th international conference on pattern recognition applications and methods (ICPRAM), pp 453–460

  • Gambäck B, Sikdar UK (2017) Using convolutional neural networks to classify hate-speech. In: Proceedings of the first workshop on abusive language online, pp 85–90

  • García-Díaz JA, Jiménez-Zafra SM, García-Cumbreras MA, Valencia-García R (2023) Evaluating feature combination strategies for hate-speech detection in Spanish using linguistic features and transformers. Complex Intell Syst 9(3):2893–2914

    Article  Google Scholar 

  • Ghosal S, Jain A, Tayal DK, Menon VG, Kumar A (2023) Inculcating context for emoji powered Bengali hate speech detection using extended fuzzy SVM and text embedding models. ACM Trans Asian Low-Resour Lang Inf Process. https://doi.org/10.1145/3589001

    Article  Google Scholar 

  • Gokhale O, Kane A, Patankar S, Chavan T, Joshi R (2022) Spread love not hate: undermining the importance of hateful pre-training for hate speech detection. arXiv preprint arXiv:2210.04267

  • Jahan MS, Oussalah M (2023) A systematic review of hate speech automatic detection using natural language processing. Neurocomputing 546:126232

    Article  Google Scholar 

  • MacAvaney S, Yao H, Yang E, Russell K, Goharian N, Frieder O (2019) Hate speech detection: challenges and solutions. PLoS ONE 14(8):e0221152

    Article  Google Scholar 

  • Malmasi S, Zampieri M (2017) Detecting hate speech in social media. arXiv:1712.06427

  • Malmasi S, Zampieri M (2018) Challenges in discriminating profanity from hate speech. J Exp Theor Artif Intell 30(2):187–202

    Article  Google Scholar 

  • Markov I, Daelemans W (2021) Improving cross-domain hate speech detection by reducing the false positive rate. In: Proceedings of the fourth workshop on NLP for internet freedom: censorship, disinformation, and propaganda, pp 17–22

  • Martins R, Gomes M, Almeida JJ, Novais P, Henriques P (2018) Hate speech classification in social media using emotional analysis. In: Proceedings of the 7th Brazilian conference on intelligent systems (BRACIS), pp 61–66

  • Modha S, Mandl T, Shahi GK, Madhu H, Satapara S, Ranasinghe T, Zampieri M (2021) Overview of the HASOC track at fire 2021: hate speech and offensive content identification in English and Indo-Aryan languages. In: FIRE (working notes), pp 1–6

  • Mollas I, Chrysopoulou Z, Karlos S, Tsoumakas G (2020) Ethos: an online hate speech detection dataset. arXiv preprint. arXiv:2006.08328

  • Mossie Z, Wang J-H (2020) Vulnerable community identification using hate speech detection on social media. Inf Process Manag 57:102087

    Article  Google Scholar 

  • Mozafari M, Farahbakhsh R, Crespi N (2019) A bert-based transfer learning approach for hate speech detection in online social media. In: Proceedings of the international conference on complex networks and their applications. Springer, Cham, pp 928–940

  • Mozafari M, Farahbakhsh R, Crespi N (2020) A bert-based transfer learning approach for hate speech detection in online social media. In: Complex networks and their applications VIII, pp 928–940

  • Nagar S, Barbhuiya FA, Dey K (2023) Towards more robust hate speech detection: using social context and user data. Soc Netw Anal Min 13(1):47

    Article  Google Scholar 

  • Nobata C, Tetreault J, Thomas A, Mehdad Y, Chang Y (2015) Abusive language detection in online user content. In: Proceedings of the 25th international conference on world wide web, pp 145–153

  • Ousidhoum N, Lin Z, Zhang H, Song Y, Yeung D-Y (2019) Multilingual and multi-aspect hate speech analysis. In: Proceedings of the 2019 conference on empirical methods in natural language processing and 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp 4675–4684

  • Park JH, Fung P (2017) One-step and two-step classification for abusive language detection on twitter. arXiv:1706.01206

  • Quadri SMK (2024) Hate speech detection on social media using machine learning and deep learning: a review. Grenze Int J Eng Technol GIJET 10(1):1–27

    Google Scholar 

  • Rai N, Meena P, Agrawal C (2020) Improving the hate speech analysis through dimensionality reduction approach. In: Proceedings of the 6th international conference on advanced computing and communication systems (ICACCS), pp 321–325

  • Rajput G, Punn NS, Sonbhadra SK, Agarwal S (2021) Hate speech detection using static bert embeddings. In: International conference on big data analytics. Springer, pp 67–77

  • Rasel RI, Sultana N, Akhter S, Meesad P (2018) Detection of cyber aggressive comments on social media networks: a machine learning and text mining approach. In: Proceedings of the 2nd international conference on natural language processing and information retrieval, pp 37–41

  • Rathpisey H, Adji TB (2019) Handling imbalance issue in hate speech classification using sampling-based methods. In: Proceedings of the 5th international conference on computer science and information technology (ICSITech), pp 193–198

  • Ribeiro M, Calais P, Santos Y, Almeida V, Meira W Jr (2018) Characterizing and detecting hateful users on twitter. Proceedings of the international AAAI conference on web social media 12:1–10

    Article  Google Scholar 

  • Rizos G, Hemker K, Schuller B (2019) Augment to prevent: short-text data augmentation in deep learning for hate-speech classification. In: Proceedings of the 28th ACM international conference on information and knowledge management, pp 991–1000

  • Saleh H, Alhothali A, Moria K (2023) Detection of hate speech using bert and hate speech word embedding with deep model. Appl Artif Intell 37(1):2166719

    Article  Google Scholar 

  • Samoshyn A (2020) Hate speech and offensive language dataset. https://www.kaggle.com/datasets/mrmorj/hate-speech-and-offensive-language-dataset

  • Schmidt A, Wiegand M (2017) A survey on hate speech detection using natural language processing. In: Proceedings of the fifth international workshop on natural language processing for social media, pp 1–10

  • Soliman AB, Eissa K, El-Beltagy SR (2017) AraVec: a set of Arabic word embedding models for use in Arabic NLP. Procedia Comput Sci 117:256–265

    Article  Google Scholar 

  • Sultan D, Toktarova A, Zhumadillayeva A, Aldeshov S, Mussiraliyeva S, Beissenova G, Tursynbayev A, Baenova G, Imanbayeva A (2023) Cyberbullying-related hate speech detection using shallow-to-deep learning. Comput Mater Contin 75(1):2115–2131

    Google Scholar 

  • Tesfaye SG, Kakeba K (2020) Automated Amharic hate speech posts and comments detection model using recurrent neural network

  • Waseem Z, Hovy D (2016) Hateful symbols or hateful people? Predictive features for hate speech detection on twitter. In: Proceedings of the NAACL student research workshop, pp 88–93

  • Yadav D, Sain MK (2023) Comparative analysis and assessment on different hate speech detection learning techniques. J Algebraic Stat 14(1):29–48

    Google Scholar 

  • Yaosheng Z, Tiegang Z, Tingjun Y, Li H (2024) Domain-enhanced prompt learning for Chinese implicit hate speech detection. IEEE Access 12:13773–13782

    Article  Google Scholar 

  • Yuan L, Wang T, Ferraro G, Suominen H, Rizoiu M-A (2023) Transfer learning for hate speech detection in social media. J Comput Soc Sci 6(2):1081–1101

    Article  Google Scholar 

  • Zhang Z, Luo L (2019) Hate speech detection: a solved problem? The challenging case of long tail on twitter. Semant Web 10:925–945

    Article  Google Scholar 

Download references

Funding

The funding information is not available.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to S. Shilpashree.

Ethics declarations

Conflict of interest

The authors declare that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shilpashree, S., Ashoka, D.V. F-DenseCNN: feature-based dense convolutional neural networks and swift text word embeddings for enhanced hate speech prediction. Soc. Netw. Anal. Min. 14, 192 (2024). https://doi.org/10.1007/s13278-024-01345-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-024-01345-3

Keywords

Navigation