Bridging the Gaps: Multi Task Learning for Domain Transfer of Hate Speech Detection

Zeerak Talat³³,
James Thorne³³ &
Joachim Bingel³⁴

Part of the book series: Human–Computer Interaction Series ((HCIS))

2732 Accesses
32 Citations
4 Altmetric

A correction to this publication are available online at https://doi.org/10.1007/978-3-319-78583-7_12

Abstract

Accurately detecting hate speech using supervised classification is dependent on data that is annotated by humans. Attaining high agreement amongst annotators though is difficult due to the subjective nature of the task, and different cultural, geographic and social backgrounds of the annotators. Furthermore, existing datasets capture only single types of hate speech such as sexism or racism; or single demographics such as people living in the United States, which negatively affects the recall when classifying data that are not captured in the training examples. End users of websites where hate speech may occur are exposed to risk of being exposed to explicit content due to the shortcomings in the training of automatic hate speech detection systems where unseen forms of hate speech or hate speech towards unseen groups are not captured. In this paper, we investigate methods for bridging differences in annotation and data collection of abusive language tweets such as different annotation schemes, labels, or geographic and cultural influences from data sampling. We consider three distinct sets of annotations, namely the annotations provided by Talat (2016), Talat and Hovy (2016), and Davidson et al. (2017). Specifically, we train a machine learning model using a multi-task learning (MTL) framework, where typically some auxiliary task is learned alongside a main task in order to gain better performance on the latter. Our approach distinguishes itself from most previous work in that we aim to train a model that is robust across data originating from different distributions and labeled under differing annotation guidelines, and that we understand these different datasets as different learning objectives in the way that classical work in multi-task learning does with different tasks. Here, we experiment with using fine-grained tags for annotation. Aided by the predictions in our models as well as the baseline models, we seek to show that it is possible to utilize distinct domains for classification as well as showing how cultural contexts influence classifier performance as the datasets we use are collected either exclusively from the U.S. Davidson et al. (2017) or collected globally with no geographic restriction (Talat 2016; Talat and Hovy 2016). Our choice for a multi-task learning set-up is motivated by a number of factors. Most importantly, MTL allows us to share knowledge between two or more objectives, such that we can leverage information encoded in one dataset to better fit another. As shown by Bingel and Søgaard (2017) and Martínez Alonso and Plank (2017), this is particularly promising when the auxiliary task has a more coarse-grained set of labels in comparison to the main task. Another benefit of MTL is that it lets us learn lower-level representations from greater amounts of data when compared to a single-task setup. This, in connection with MTL being known to work as a regularizer, is not only promising when it comes to fitting the training data, but also helps to prevent overfitting, especially when we have to deal with small datasets.

The original version of this chapter was revised. Co-author name "Zeerak Waseem" has been changed to read as "Zeerak Talat". The correction to this chapter can be found at https://doi.org/10.1007/978-3-319-78583-7_12. All the authors contributed equally

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 71.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 89.99; Price includes VAT (United Kingdom)

Hardcover Book: GBP 89.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A New Measure of Polarization in the Annotation of Hate Speech

Label modification and bootstrapping for zero-shot cross-lingual hate speech detection

Article Open access 18 February 2023

Empowering Hate Speech Detection: A Comparative Exploration of Deep Learning Models

Change history

13 January 2022
In the original version of the book, the misspelt co-author name “Zeerak Waseem” has been changed to read as “Zeerak Talat” in Chapter “3”. The erratum chapter and the book have been updated with the change.

Notes

1.
After re-annotation to unify class labels, if necessary.
2.
The fact that in MTL we tend to learn both tasks simultaneously rather than in succession weakens this analogy to some degree. In fact, the simultaneous learning of two languages could actually make learning harder for humans. For a machine, however, the temporal order is less critical given its far superior memory when compared to humans.
3.
Such choices include the number and width of the hidden layers, input representations, task-specific learning rates, training schedules, among others.
4.
Context is not defined more clearly in their paper.
5.
Note that in principle, hard parameter sharing also allows us to predict the different tasks at different depths of the model, e.g. to compute the output for task A from some hidden representation \(h_m\) and task B from \(h_n\) (with \(m \ne n\)). Yet another possible variation is to compute further hidden representations that are task-specific and not shared, but ultimately draw on some common lower-level representation.
6.
A one-hot vector is a binary vector of indicator features that are 1 if that feature occurs in the document otherwise 0 in the feature does not occur in document.
7.
Emoticons used in the text are removed, urls are replaced with “<url>” token, and usernames are replaced with “@user”.
8.
“My n*ggah my n*ggah” is a reference to Denzel Washington’s character in the movie Training Day.

References

Allen WR, Epps EG, Guillory EA, Suh SA, Bonous-Hammarth M (2000) The black academic: faculty status among African Americans in U.S. higher education. J Negro Educ 69(1/2):112–127. http://www.jstor.org/stable/2696268
Badjatiya P, Gupta S, Gupta M, Varma V (2017) Deep learning for hate speech detection in tweets. In: Proceedings of the 26th international conference on world wide web companion, WWW ’17 Companion, pp 759–760. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland. https://doi.org/10.1145/3041021.3054223
Bingel J, Søgaard (2017) A identifying beneficial task relations for multi-task learning in deep neural networks. In: Proceedings of the 15th conference of the European chapter of the association for computational linguistics, short papers, vol 2, pp 164–169. Association for Computational Linguistics, Valencia, Spain. http://www.aclweb.org/anthology/E17-2026
Bjerva J (2017) One model to rule them all: multitask and multilingual modelling for lexical analysis. arXiv:1711.01100
Bjerva J (2017) Will my auxiliary tagging task help? estimating auxiliary tasks effectivity in multi-task learning. In: Proceedings of the 21st Nordic conference on computational linguistics, NoDaLiDa, 22–24 May 2017, Gothenburg, Sweden, 131, pp 216–220. Linköping University Electronic Press (2017)
Google Scholar
Boeckmann RJ, Liew J (2002) Hate speech: Asian American students justice judgments and psychological responses. J Soc Issues 58(2):363–381. https://doi.org/10.1111/1540-4560.00265
Bollmann M, Bingel J, Søgaard A (2017) Learning attention for historical text normalization by learning to pronounce. In: Proceedings of the 55th annual meeting of the association for computational linguistics, long papers, vol 1, pp 332–344
Google Scholar
Boyle K (2001) Hate speech-the united states versus the rest of the world. Maine Law Rev 53(2):487–502
Google Scholar
Caruana R (1998) Multitask learning. Learning to learn, pp 95–133. Springer
Google Scholar
Caruana RA (1993) Multitask connectionist learning. In: Proceedings of the 1993 connectionist models summer school. CiteSeer
Google Scholar
Chandrasekharan E, Samory M, Srinivasan A, Gilbert E (2017) The bag of communities: identifying abusive behavior online with preexisting internet data. In: Proceedings of the 2017 CHI conference on human factors in computing systems, CHI ’17, ACM, New York, NY, USA, pp 3175–3187. https://doi.org/10.1145/3025453.3026018
Cohen PN, Huffman ML (2007) Black under-representation in management across U.S. labor markets. Ann Am Acad Polit Soc Sci 609(1):181–199. https://doi.org/10.1177/0002716206296734
Crawford K, Gillespie T (2014) What is a flag for? social media reporting tools and the vocabulary of complaint. New Media Soc 18(3):410–428. https://doi.org/10.1177/1461444814543163
Crenshaw K (1989) Demarginalizing the intersection of race and sex: a black feminist critique of antidiscrimination doctrine, feminist theory and antiracist politics. Univ Chicago Legal Forum 1989(1)
Google Scholar
Crenshaw K (2016) The urgency of intersectionality. https://www.ted.com/talks/kimberle_crenshaw_the_urgency_of_intersectionality
Davidson T, Warmsley D, Macy M, Weber I (2017) Automated hate speech detection and the problem of offensive language. In: Proceedings of ICWSM
Google Scholar
Desmond-Harris J (2015) Are black communities overpoliced or underpoliced? both. https://www.vox.com/2015/4/14/8411733/black-community-policing-crime
Dixon T, Linz D (2000) Overrepresentation and underrepresentation of African Americans and latinos as lawbreakers on television news. J Commun 50(2):131–154. https://doi.org/10.1111/j.1460-2466.2000.tb02845.x
European Commission (2016) Code of conduct on countering illegal hate speech online. Technical report
Google Scholar
Gambäck B, Sikdar UK (2017) Using convolutional neural networks to classify hate-speech. In: Proceedings of the first workshop on abusive language online, pp 85–90. Association for Computational Linguistics. http://aclweb.org/anthology/W17-3013
Girshick R (2015) Fast R-CNN. In: Proceedings of the IEEE international conference on computer vision, pp 1440–1448
Google Scholar
Heinzerling B, Strube M (2017) BPEmb: tokenization-free pre-trained subword embeddings in 275 languages. CoRR abs/1710.02187. http://arxiv.org/abs/1710.02187
Home office (2016) Action against hate the UK governments plan for tackling hate crime. Technical report (2016)
Google Scholar
Jha A, Mamidi R (2017) When does a compliment become sexist? analysis and classification of ambivalent sexism using twitter data. In: Proceedings of the second workshop on NLP and computational social science, pp 7–16. Association for Computational Linguistics. http://aclweb.org/anthology/W17-2902
Jørgensen A, Hovy D, Søgaard A (2016) Learning a pos tagger for AAVE-like language. In: Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 1115–1120. Association for Computational Linguistics, San Diego, California. http://www.aclweb.org/anthology/N16-1130
Kim Y (2014) Convolutional neural networks for sentence classification. CoRR abs/1408.5882. http://arxiv.org/abs/1408.5882
Klerke S, Goldberg Y, Søgaard A (2016) Improving sentence compression by learning to predict gaze. In: Proceedings of NAACL-HLT, pp 1528–1533
Google Scholar
Levin S (2017) Moderators who had to view child abuse content sue Microsoft, claiming PTSD
Google Scholar
Luong MT, Le QV, Sutskever I, Vinyals O, Kaiser L (2015) Multi-task sequence to sequence learning. arXiv:1511.06114
Martínez Alonso H, Plank B (2017) When is multitask learning effective? semantic sequence prediction under varying data conditions. In: Proceedings of the 15th conference of the European chapter of the association for computational linguistics, long papers, vol 1, pp 44–53. Association for Computational Linguistics, Valencia, Spain. http://www.aclweb.org/anthology/E17-1005
McIntosh P (1988) White privilege and male privilege: a personal account of coming to see correpondences through work in women’s studies
Google Scholar
Müller K, Schwarz C (2017) Fanning the flames of hate: social media and hate crime
Google Scholar
Nobata C, Tetreault J, Thomas A, Mehdad Y, Chang Y (2016) Abusive language detection in online user content. In: Proceedings of the 25th international conference on world wide web, WWW ’16, pp 145–153. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland. https://doi.org/10.1145/2872427.2883062
Park JH, Fung P (2017) One-step and two-step classification for abusive language detection on twitter. In: Proceedings of the first workshop on abusive language online, pp 41–45. Association for Computational Linguistics. http://aclweb.org/anthology/W17-3006
Pew Research Center (2017) Online harassment. http://www.pewinternet.org/2014/10/22/online-harassment/
Rahman J (2012) The n word: its history and use in the African American community. J English Linguist 40(2):137–171. https://doi.org/10.1177/0075424211414807
Ramsundar B, Kearnes S, Riley P, Webster D, Konerding D, Pande V (2015) Massively multitask networks for drug discovery. arXiv:1502.02072
Roberts DE (2004) The social and moral cost of mass incarceration in African American communities. Stanf Law Rev 56(5):1271–1306
Google Scholar
Ross B, Rist M, Carbonell G, Cabrera B, Kurowsky N, Wojatzki M (2016) Measuring the reliability of hate speech annotations: the case of the European refugee crisis. In: Beißwenger M, Wojatzki M, Zesch T (eds) Proceedings of NLP4CMC III: 3rd workshop on natural language processing for computer-mediated communication, Bochumer Linguistische Arbeitsberichte, vol 17, pp 6–9. Bochum
Google Scholar
Smith PK, Mahdavi J, Carvalho M, Fisher S, Russell S, Tippett N (2008) Cyberbullying: its nature and impact in secondary school pupils. J Child Psychol Psychiatry 49(4):376–385. https://doi.org/10.1111/j.1469-7610.2007.01846.x
Socher R, Perelygin A, Wu J, Chuang J, Manning CD, Ng AY, Potts C (2013) Recursive deep models for semantic compositionality over a sentiment treebank. In: Proceedings of the 2013 conference on empirical methods in natural language processing, pp 1631–1642. Association for Computational Linguistics, Stroudsburg, PA
Google Scholar
The Guardian (2017) Germany approves plans to fine social media firms up to €50 M (2017)
Google Scholar
Talat Z (2016) Are you a racist or am i seeing things? annotator influence on hate speech detection on twitter. In: Proceedings of the first workshop on NLP and computational social science, pp 138–142. Association for Computational Linguistics, Austin, Texas. http://aclweb.org/anthology/W16-5618
Talat Z, Davidson T, Warmsley D, Weber I (2017) Understanding abuse: a typology of abusive language detection subtasks. In: Proceedings of the first workshop on abusive language online. Association for Computational Linguistics
Google Scholar
Talat Z, Hovy D (2016) Hateful symbols or hateful people? predictive features for hate speech detection on twitter. In: Proceedings of the NAACL student research workshop. Association for Computational Linguistics, San Diego, California
Google Scholar
Wulczyn E, Thain N, Dixon L (2017) Ex machina: personal attacks seen at scale. In: Proceedings of the 26th international conference on world wide web, WWW ’17, pp 1391–1399. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland. https://doi.org/10.1145/3038912.3052591
Yu J, Jiang J (2016) Learning sentence embeddings with auxiliary tasks for cross-domain sentiment classification. Association for Computational Linguistics
Google Scholar

Download references

Author information

Authors and Affiliations

University of Sheffield, Sheffield, UK
Zeerak Talat & James Thorne
University of Copenhagen, Copenhagen, Denmark
Joachim Bingel

Authors

Zeerak Talat
View author publications
You can also search for this author in PubMed Google Scholar
James Thorne
View author publications
You can also search for this author in PubMed Google Scholar
Joachim Bingel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zeerak Talat .

Editor information

Editors and Affiliations

College of Information Studies, University of Maryland, College Park, MD, USA
Jennifer Golbeck

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Talat, Z., Thorne, J., Bingel, J. (2018). Bridging the Gaps: Multi Task Learning for Domain Transfer of Hate Speech Detection. In: Golbeck, J. (eds) Online Harassment. Human–Computer Interaction Series. Springer, Cham. https://doi.org/10.1007/978-3-319-78583-7_3

Download citation

DOI: https://doi.org/10.1007/978-3-319-78583-7_3
Published: 21 July 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-78582-0
Online ISBN: 978-3-319-78583-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Bridging the Gaps: Multi Task Learning for Domain Transfer of Hate Speech Detection

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A New Measure of Polarization in the Annotation of Hate Speech

Label modification and bootstrapping for zero-shot cross-lingual hate speech detection

Empowering Hate Speech Detection: A Comparative Exploration of Deep Learning Models

Change history

13 January 2022

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Bridging the Gaps: Multi Task Learning for Domain Transfer of Hate Speech Detection

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A New Measure of Polarization in the Annotation of Hate Speech

Label modification and bootstrapping for zero-shot cross-lingual hate speech detection

Empowering Hate Speech Detection: A Comparative Exploration of Deep Learning Models

Change history

13 January 2022

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation