More Web Proxy on the site http://driver.im/

research-article

Public Access

Learning from Fact-checkers: Analysis and Generation of Fact-checking Language

Authors:

Kyumin LeeAuthors Info & Claims

SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 335 - 344

https://doi.org/10.1145/3331184.3331248

Published: 18 July 2019 Publication History

Abstract

In fighting against fake news, many fact-checking systems comprised of human-based fact-checking sites (e.g., snopes.com and politifact.com) and automatic detection systems have been developed in recent years. However, online users still keep sharing fake news even when it has been debunked. It means that early fake news detection may be insufficient and we need another complementary approach to mitigate the spread of misinformation. In this paper, we introduce a novel application of text generation for combating fake news. In particular, we (1) leverage online users named fact-checkers, who cite fact-checking sites as credible evidences to fact-check information in public discourse; (2) analyze linguistic characteristics of fact-checking tweets; and (3) propose and build a deep learning framework to generate responses with fact-checking intention to increase the fact-checkers' engagement in fact-checking activities. Our analysis reveals that the fact-checkers tend to refute misinformation and use formal language (e.g. few swear words and Internet slangs). Our framework successfully generates relevant responses, and outperforms competing models by achieving up to 30% improvements. Our qualitative study also confirms that the superiority of our generated responses compared with responses generated from the existing models.

Supplementary Material

MP4 File (cite3-17h40-d1.mp4)

Download
511.86 MB

References

[1]

Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. ICLR (2015).

[2]

Satanjeev Banerjee and Alon Lavie. 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In ACL.

[3]

Guanliang Chen, Jie Yang, Claudia Hauff, and Geert-Jan Houben. 2018. LearningQ: a large-scale dataset for educational question generation. In ICWSM.

[4]

J. Cheng, M. Bernstein, C. Danescu-Niculescu-Mizil, and J. Leskovec. 2017. Anyone Can Become a Troll: Causes of Trolling Behavior in Online Discussions. In CSCW.

Digital Library

[5]

Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014).

[6]

Dipanjan Das, Nathan Schneider, Desai Chen, and Noah A Smith. 2010. Probabilistic frame-semantic parsing. In NAACL. 948--956.

Digital Library

[7]

Thomas Davidson, Dana Warmsley, Michael Macy, and Ingmar Weber. 2017. Automated hate speech detection and the problem of offensive language. arXiv preprint arXiv:1703.04009 (2017).

[8]

Ullrich K. H. Ecker, Stephan Lewandowsky, and David T. W. Tang. 2010. Explicit warnings reduce but do not eliminate the continued influence of misinformation. Memory & cognition, Vol. 38, 8 (2010), 1087--1100.

[9]

Adrien Friggeri, Lada A. Adamic, Dean Eckles, and Justin Cheng. 2014. Rumor Cascades. In ICWSM.

[10]

Aditi Gupta, Hemank Lamba, Ponnurangam Kumaraguru, and Anupam Joshi. 2013. Faking sandy: characterizing and identifying fake images on twitter during hurricane sandy. In WWW. ACM, 729--736.

Digital Library

[11]

Aniko Hannak, Drew Margolin, Brian Keegan, and Ingmar Weber. 2014. Get Back! You Don't Know Me Like That: The Social Mediation of Fact Checking Interventions in Twitter Conversations. In ICWSM.

[12]

Karl Moritz Hermann, Tomas Kocisky, Edward Grefenstette, Lasse Espeholt, Will Kay, Mustafa Suleyman, and Phil Blunsom. 2015. Teaching machines to read and comprehend. In Advances in Neural Information Processing Systems. 1693--1701.

Digital Library

[13]

Benjamin D. Horne and Sibel Adali. 2017. This just in: fake news packs a lot in title, uses simpler, repetitive content in text body, more similar to satire than real news. NECO Workshop (2017).

[14]

Business Insider. 2016. Microsoft is deleting its AI chatbot's incredibly racist tweets. https://read.bi/2DgeRkN. (2016).

[15]

Shan Jiang and Christo Wilson. 2018. Linguistic Signals under Misinformation and Fact-Checking: Evidence from User Comments on Social Media. HCI (2018).

Digital Library

[16]

Jooyeon Kim, Dongkwan Kim, and Alice Oh. 2019. Homogeneity-Based Transmissive Process to Model True and False News in Social Networks. In WSDM.

Digital Library

[17]

Jooyeon Kim, Behzad Tabibian, Alice Oh, Bernhard Schölkopf, and Manuel Gomez-Rodriguez. 2018. Leveraging the Crowd to Detect and Reduce the Spread of Fake News and Misinformation. In WSDM.

Digital Library

[18]

Yoon Kim, Yacine Jernite, David Sontag, and Alexander M. Rush. 2016. Character-Aware Neural Language Models. In AAAI.

Digital Library

[19]

Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).

[20]

Sejeong Kwon, Meeyoung Cha, Kyomin Jung, Wei Chen, and Yajun Wang. 2013. Aspects of rumor spreading on a microblog network. SocInfo.

Digital Library

[21]

Reporter Lab. 2018. Fact-checking triples over four years. https://reporterslab.org/fact-checking-triples-over-four-years/. (2018).

[22]

Hung Le, Truyen Tran, Thin Nguyen, and Svetha Venkatesh. 2018. Variational memory encoder-decoder. In NIPS.

Digital Library

[23]

Quoc Le and Tomas Mikolov. 2014. Distributed representations of sentences and documents. In ICML.

Digital Library

[24]

Kyumin Lee, James Caverlee, and Steve Webb. 2010. Uncovering social spammers: social honeypots+ machine learning. In SIGIR.

Digital Library

[25]

Jiwei Li, Will Monroe, Tianlin Shi, Sébastien Jean, Alan Ritter, and Dan Jurafsky. 2017. Adversarial learning for neural dialogue generation. In EMNLP.

[26]

Chin-Yew Lin. 2004. Rouge: A package for automatic evaluation of summaries. Text Summarization Branches Out (2004).

[27]

Chia-Wei Liu, Ryan Lowe, Iulian V. Serban, Michael Noseworthy, Laurent Charlin, and Joelle Pineau. 2016. How not to evaluate your dialogue system: An empirical study of unsupervised evaluation metrics for dialogue response generation. arXiv preprint arXiv:1603.08023 (2016).

[28]

Minh-Thang Luong, Hieu Pham, and Christopher D. Manning. 2015. Effective approaches to attention-based neural machine translation. In EMNLP.

[29]

Jing Ma, Wei Gao, Prasenjit Mitra, Sejeong Kwon, Bernard J. Jansen, Kam-Fai Wong, and Meeyoung Cha. 2016. Detecting Rumors from Microblogs with Recurrent Neural Networks. In IJCAI. 3818--3824.

Digital Library

[30]

Jing Ma, Wei Gao, Zhongyu Wei, Yueming Lu, and Kam-Fai Wong. 2015. Detect rumors using time series of social context information on microblogging websites. In CIKM.

Digital Library

[31]

Jim Maddock, Kate Starbird, Haneen J. Al-Hassani, Daniel E. Sandoval, Mania Orand, and Robert M. Mason. 2015. Characterizing online rumoring behavior using multi-dimensional signatures. In CSCW.

Digital Library

[32]

Tsvetomila Mihaylova, Preslav Nakov, Lluis Marquez, Alberto Barron-Cedeno, Mitra Mohtarami, Georgi Karadzhov, and James Glass. 2018. Fact checking in community forums. In AAAI.

[33]

Tomáš Mikolov, Martin Karafiát, Lukáš Burget, Jan Černockỳ, and Sanjeev Khudanpur. 2010. Recurrent neural network based language model. In ISCA.

[34]

An T. Nguyen, Aditya Kharosekar, Matthew Lease, and Byron Wallace. 2018. An interpretable joint graphical model for fact-checking from crowds. In AAAI.

[35]

Brendan Nyhan and Jason Reifler. 2010. When corrections fail: The persistence of political misperceptions. Political Behavior, Vol. 32, 2 (2010), 303--330.

[36]

Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In ACL.

Digital Library

[37]

James W. Pennebaker, Ryan L. Boyd, Kayla Jordan, and Kate Blackburn. 2015. The development and psychometric properties of LIWC2015. Technical Report.

[38]

Kashyap Popat, Subhabrata Mukherjee, Jannik Strötgen, and Gerhard Weikum. 2016. Credibility assessment of textual claims on the web. In CIKM.

Digital Library

[39]

Kashyap Popat, Subhabrata Mukherjee, Andrew Yates, and Gerhard Weikum. 2018. DeClarE: Debunking Fake News and False Claims using Evidence-Aware Deep Learning. In EMNLP.

[40]

Vahed Qazvinian, Emily Rosengren, Dragomir R. Radev, and Qiaozhu Mei. 2011. Rumor has it: Identifying misinformation in microblogs. In EMNLP.

Digital Library

[41]

Feng Qian, ChengYue Gong, Karishma Sharma, and Yan Liu. 2018. Neural User Response Generator: Fake News Detection with Collective User Intelligence. In IJCAI. 3834--3840.

Digital Library

[42]

Hannah Rashkin, Eunsol Choi, Jin Yea Jang, Svitlana Volkova, and Yejin Choi. 2017. Truth of varying shades: Analyzing language in fake news and political fact-checking. In EMNLP.

[43]

Iulian V. Serban, Alessandro Sordoni, Yoshua Bengio, Aaron Courville, and Joelle Pineau. 2015. Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models. arXiv preprint arXiv:1507.04808 (2015).

Digital Library

[44]

Iulian Vlad Serban, Alessandro Sordoni, Ryan Lowe, Laurent Charlin, Joelle Pineau, Aaron C. Courville, and Yoshua Bengio. 2017. A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues. In AAAI.

[45]

Lifeng Shang, Zhengdong Lu, and Hang Li. 2015. Neural Responding Machine for Short-Text Conversation. In ACL.

[46]

Chengcheng Shao, Giovanni Luca Ciampaglia, Alessandro Flammini, and Filippo Menczer. 2016. Hoaxy: A platform for tracking online misinformation. In WWW.

Digital Library

[47]

Prashant Shiralkar, Alessandro Flammini, Filippo Menczer, and Giovanni Luca Ciampaglia. 2017. Finding streams in knowledge graphs to support fact checking. In ICDM.

[48]

Kai Shu, Suhang Wang, Thai Le, Dongwon Lee, and Huan Liu. 2018. Deep headline generation for clickbait detection. ICDM.

[49]

Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In NIPS. 3104--3112.

Digital Library

[50]

James Thorne, Andreas Vlachos, Christos Christodoulopoulos, and Arpit Mittal. 2018. FEVER: a large-scale dataset for Fact Extraction and VERification. In EMNLP.

[51]

Oriol Vinyals and Quoc Le. 2015. A neural conversational model. ICML (2015).

[52]

Nguyen Vo and Kyumin Lee. 2018. The rise of guardians: Fact-checking url recommendation to combat fake news. In SIGIR.

Digital Library

[53]

Svitlana Volkova, Kyle Shaffer, Jin Yea Jang, and Nathan Hodas. 2017. Separating facts from fiction: Linguistic models to classify suspicious and trusted news posts on twitter. In ACL.

[54]

Soroush Vosoughi, Deb Roy, and Sinan Aral. 2018. The spread of true and false news online. Science, Vol. 359, 6380 (2018), 1146--1151.

[55]

Wenjie Wang, Minlie Huang, Xin-Shun Xu, Fumin Shen, and Liqiang Nie. 2018. Chat More: Deepening and Widening the Chatting Topic via A Deep Model. In SIGIR.

Digital Library

[56]

William Yang Wang. 2017. "Liar, Liar Pants on Fire": A New Benchmark Dataset for Fake News Detection. In ACL.

[57]

Yaqing Wang, Fenglong Ma, Zhiwei Jin, Ye Yuan, Guangxu Xun, Kishlay Jha, Lu Su, and Jing Gao. 2018. EANN: Event Adversarial Neural Networks for Multi-Modal Fake News Detection. In KDD.

Digital Library

[58]

Yuanshun Yao, Bimal Viswanath, Jenna Cryan, Haitao Zheng, and Ben Y Zhao. 2017. Automated Crowdturfing Attacks and Defenses in Online Review Systems. In SIGSAC.

Digital Library

[59]

Zhe Zhao, Paul Resnick, and Qiaozhu Mei. 2015. Enquiring minds: Early detection of rumors in social media from enquiry posts. In WWW.

Digital Library

Cited By

Wang LHu Y(2025)Topic-guided multi-domain fake news detectionMultimedia Systems10.1007/s00530-024-01636-x31:1Online publication date: 1-Feb-2025
https://dl.acm.org/doi/10.1007/s00530-024-01636-x
Marcondes FBarbosa MGala AAlmeida JNovais P(2024)Emotional and Mental Nuances and Technological Approaches: Optimising Fact-Check Dissemination through Cognitive Reinforcement TechniqueElectronics10.3390/electronics1301024013:1(240)Online publication date: 4-Jan-2024
https://doi.org/10.3390/electronics13010240
He BHu YLee YOh SVerma GKumar S(2024)A Survey on the Role of Crowds in Combating Online Misinformation: Annotators, Evaluators, and CreatorsACM Transactions on Knowledge Discovery from Data10.1145/369498019:1(1-30)Online publication date: 29-Nov-2024
https://dl.acm.org/doi/10.1145/3694980
Show More Cited By

Index Terms

Learning from Fact-checkers: Analysis and Generation of Fact-checking Language
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Natural language generation

Recommendations

Linguistic Signals under Misinformation and Fact-Checking: Evidence from User Comments on Social Media

Misinformation and fact-checking are opposite forces in the news environment: the former creates inaccuracies to mislead people, while the latter provides evidence to rebut the former. These news articles are often posted on social media and attract user ...
Are Fact Checkers Effective in the Post Truth World? Assessing Impact of Fact Checkers Cross Medium and Platforms
Web Information Systems Engineering – WISE 2024
Abstract
Social media platforms help users share opinions and find new information but also spread rumors, which misinforms the public. These rumour threads often prompt users (called guardians) to respond with fact-checking articles to debunk or verify ...
Why Doesn't Fact-Checking Work?: The Mis-Framing of Division on Social Media in Japan
SMSociety'20: International Conference on Social Media and Society

With the increasing popularity of fact-checking practices, concerns have grown over fact-checking that is either concentrated on one side of the political spectrum or partisan content masquerading as a fact-checking resource. If deployed by partisans, ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

July 2019

1512 pages

ISBN:9781450361729

DOI:10.1145/3331184

General Chairs:
Benjamin Piwowarski
CNRS - Sorbonne Universite, France
,
Max Chevalier
Universite de Toulouse, CNRS, France
,
Eric Gaussier
Universite Grenoble Alpes, CNRS, France
,
Program Chairs:
Yoelle Maarek
Amazon Research, Israel
,
Jian-Yun Nie
University of Montreal, Canada
,
Falk Scholer
RMIT University, Australia

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 July 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

NSF

Conference

SIGIR '19

Sponsor:

SIGIR

SIGIR '19: The 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

July 21 - 25, 2019

Paris, France

Acceptance Rates

SIGIR'19 Paper Acceptance Rate 84 of 426 submissions, 20%;

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

40
Total Citations
View Citations
2,121
Total Downloads

Downloads (Last 12 months)426
Downloads (Last 6 weeks)25

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Wang LHu Y(2025)Topic-guided multi-domain fake news detectionMultimedia Systems10.1007/s00530-024-01636-x31:1Online publication date: 1-Feb-2025
https://dl.acm.org/doi/10.1007/s00530-024-01636-x
Marcondes FBarbosa MGala AAlmeida JNovais P(2024)Emotional and Mental Nuances and Technological Approaches: Optimising Fact-Check Dissemination through Cognitive Reinforcement TechniqueElectronics10.3390/electronics1301024013:1(240)Online publication date: 4-Jan-2024
https://doi.org/10.3390/electronics13010240
He BHu YLee YOh SVerma GKumar S(2024)A Survey on the Role of Crowds in Combating Online Misinformation: Annotators, Evaluators, and CreatorsACM Transactions on Knowledge Discovery from Data10.1145/369498019:1(1-30)Online publication date: 29-Nov-2024
https://dl.acm.org/doi/10.1145/3694980
Liu HDas ABoltz AZhou DPinaroc DLease MLee M(2024)Human-centered NLP Fact-checking: Co-Designing with Fact-checkers using Matchmaking for AIProceedings of the ACM on Human-Computer Interaction10.1145/36869628:CSCW2(1-44)Online publication date: 8-Nov-2024
https://dl.acm.org/doi/10.1145/3686962
He BMa YAhamad MKumar S(2024)Corrective or Backfire: Characterizing and Predicting User Response to Social CorrectionProceedings of the 16th ACM Web Science Conference10.1145/3614419.3644004(149-158)Online publication date: 21-May-2024
https://dl.acm.org/doi/10.1145/3614419.3644004
Park HAhn D(2024)The Promise and Peril of ChatGPT in Higher Education: Opportunities, Challenges, and Design ImplicationsProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642785(1-21)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642785
Al-Quayed FJaved DJhanjhi NHumayun MAlnusairi T(2024)A Hybrid Transformer-Based Model for Optimizing Fake News DetectionIEEE Access10.1109/ACCESS.2024.347643212(160822-160834)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3476432
Farhangian FCruz RCavalcanti G(2024)Fake news detection: Taxonomy and comparative studyInformation Fusion10.1016/j.inffus.2023.102140103(102140)Online publication date: Mar-2024
https://doi.org/10.1016/j.inffus.2023.102140
Ahmed KKhan MHaq IMazroa AM.S. SInnab NAlajmi MAlkahtani H(2024)Social media’s dark secrets: A propagation, lexical and psycholinguistic oriented deep learning approach for fake news proliferationExpert Systems with Applications10.1016/j.eswa.2024.124650255(124650)Online publication date: Dec-2024
https://doi.org/10.1016/j.eswa.2024.124650
Mu YNiu PBontcheva KAletras N(2024)Predicting and analyzing the popularity of false rumors in WeiboExpert Systems with Applications10.1016/j.eswa.2023.122791243(122791)Online publication date: Jun-2024
https://doi.org/10.1016/j.eswa.2023.122791
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten