[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3486622.3493978acmconferencesArticle/Chapter ViewAbstractPublication PageswiConference Proceedingsconference-collections
short-paper

Credible Text Summarization in Social Media

Published: 13 April 2022 Publication History

Abstract

In the past decade, social media have become increasingly popular for information consumption, due to its easy access, fast dissemination, and low-cost production. However, compared to this explosion of information content, social media presents two major problems. The first concerns the potential propagation of misinformation, often in the form of fake news, which can have significant negative societal effects. The second concerns the extreme difficulty that users may have in finding information of interest in a context characterized by information overload. In this paper, both of these problems are considered and analyzed together. In fact, the main goal of this work is to study the problem of generating automatic summaries of texts (around specific topics of interest) that are at the same time credible. This task has been very little studied in the literature, so this contribution can be considered one of the first works about credible text summarization.

References

[1]
Ahmet Aker, Kevin Vincentius, and Kalina Bontcheva. 2019. Credibility and Transparency of News Sources: Data Collection and Feature Analysis. In NewsIR@ SIGIR. CEUR-WS, Paris, France, 15–20.
[2]
Hareesh Bahuleyan and Olga Vechtomova. 2017. UWaterloo at SemEval-2017 Task 8: Detecting Stance towards Rumours with Topic Independent Features. In Proceedings of SemEval-2017. ACL, Vancouver, Canada, 461–464.
[3]
Bartomiej Balcerzak, Wojciech Jaworski, and Adam Wierzbicki. 2014. Application of TextRank algorithm for credibility assessment. In 2014 IEEE/WIC/ACM WI-IAT International Joint Conferences, Vol. 1. IEEE, Warsaw, Poland, 451–454.
[4]
Elisa Bassignana, Valerio Basile, and Viviana Patti. 2018. Hurtlex: A multilingual lexicon of words to hurt. In 5th Italian Conference on Computational Linguistics, CLiC-it 2018, Vol. 2253. CEUR-WS, Turin, Italy, 1–6.
[5]
David Bawden, Clive Holtham, and Nigel Courtney. 1999. Perspectives on information overload. In Aslib proceedings. MCB UP Ltd, 249–255.
[6]
Erik Cambria, Daniel Olsher, and Dheeraj Rajagopal. 2014. SenticNet 3: A Common and Common-Sense Knowledge Base for Cognition-Driven Sentiment Analysis. In AAAI’14: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, Vol. 2. AAAI Press, Québec City, Québec, Canada, 1515––1521.
[7]
Yoonjung Choi and Janyce Wiebe. 2014. +/-EffectWordNet: Sense-level Lexicon Acquisition for Opinion Inference. In Proceedings of the 2014 EMNLP Conference. ACL, Doha, Qatar, 1181–1191.
[8]
Limeng Cui and Dongwon Lee. 2020. CoAID: COVID-19 Healthcare Misinformation Dataset. arxiv:2006.00885 [cs.SI]
[9]
Hal Daumé III and Daniel Marcu. 2006. Bayesian Query-Focused Summarization. In Proceedings COLING-ACL Joint Conferences. ACL, Sydney, Australia, 305–312.
[10]
Jorge Carrillo de Albornoz, Laura Plaza, and Pablo Gervás. 2012. SentiSense: An easily scalable concept-based affective lexicon for sentiment analysis. In Proceedings of the Eighth LREC Conference). ELRA, Istanbul, Turkey, 3562–3567.
[11]
Leon Derczynski and Kalina Bontcheva. 2014. Pheme: Veracity in Digital Social Networks. In UMAP workshops. CEUR-WS, 4 pages.
[12]
Ted Dunning. 1993. Accurate Methods for the Statistics of Surprise and Coincidence. Computational Linguistics 19, 1 (1993), 61–74.
[13]
Günes Erkan and Dragomir R. Radev. 2011. LexRank: Graph-based Lexical Centrality as Salience in Text Summarization. CoRR abs/1109.2128(2011), 457–479. arxiv:1109.2128
[14]
Soheil Esmaeilzadeh, Gao Xian Peh, and Angela Xu. 2019. Neural abstractive text summarization and fake news detection. arXiv preprint arXiv:1904.00788(2019), 10 pages.
[15]
Andrea Esuli and Fabrizio Sebastiani. 2007. SENTIWORDNET: A high-coverage lexical resource for opinion mining. Evaluation 9, 43 (01 2007), 26 pages.
[16]
Ethan Fast, Binbin Chen, and Michael S. Bernstein. 2016. Empath: Understanding Topic Signals in Large-Scale Text. In Proceedings of the 2016 CHI Conference. ACM, New York, NY, USA, 4647–4657.
[17]
Siva Charan Reddy Gangireddy, Cheng Long, and Tanmoy Chakraborty. 2020. Unsupervised fake news detection: A graph-based approach. In Proceedings of the 31st ACM Conference on Hypertext and Social Media. ACM, Online, USA, 75–83.
[18]
Lorraine Goeuriot 2020. Overview of the CLEF eHealth evaluation lab 2020. In International Conference of the Cross-Language Evaluation Forum for European Languages. Springer, Online, Greece, 255–271.
[19]
Yihong Gong and Xin Liu. 2001. Generic Text Summarization Using Relevance Measure and Latent Semantic Analysis. In SIGIR Forum (ACM Special Interest Group on Information Retrieval). ACM, 19–25.
[20]
Som Gupta and SK Gupta. 2019. Abstractive summarization: An overview of the state of the art. Expert Systems with Applications 121 (2019), 49–65.
[21]
Mohammad Hossin and Md Nasir Sulaiman. 2015. A review on evaluation metrics for data classification evaluations. International journal of data mining & knowledge management process 5, 2(2015), 1.
[22]
D. Inouye and J. K. Kalita. 2011. Comparing Twitter Summarization Algorithms for Multiple Post Summaries. In 2011 IEEE SocialCom-PASSAT Joint Conferences. IEEE, Boston, MA, USA, 298–306.
[23]
Shu-juan Ji, Qi Zhang, Jinpeng Li, Dickson KW Chiu, Shaohua Xu, Lei Yi, and Maoguo Gong. 2020. A burst-based unsupervised method for detecting review spammer groups. Information Sciences 536(2020), 454–469.
[24]
PM Lerman. 1980. Fitting segmented regression models by grid search. Journal of the Royal Statistical Society: Series C (Applied Statistics) 29, 1(1980), 77–84.
[25]
Chin-Yew Lin. 2004. Rouge: A package for automatic evaluation of summaries. In Text summarization branches out. ACL, Barcelona, Spain, 74–81.
[26]
H. P. Luhn. 1958. The Automatic Creation of Literature Abstracts. IBM J. Res. Dev. 2, 2 (April 1958), 159–165. https://doi.org/10.1147/rd.22.0159
[27]
Rada Mihalcea and Paul Tarau. 2004. TextRank: Bringing Order into Text. In Proceedings of the 2004 EMNLP Conference. ACL, Barcelona, Spain, 404–411.
[28]
Saif Mohammad and Peter Turney. 2010. Emotions Evoked by Common Words and Phrases: Using Mechanical Turk to Create an Emotion Lexicon. In Proceedings of the NAACL HLT 2010 Workshop. ACL, Los Angeles, CA, 26–34.
[29]
Ani Nenkova and Kathleen McKeown. 2011. Automatic summarization. Now Publishers Inc.
[30]
Finn Nielsen. 2011. A new ANEW: Evaluation of a word list for sentiment analysis in microblogs. CoRR (03 2011).
[31]
Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. 1999. The PageRank citation ranking: Bringing order to the web.Technical Report. Stanford.
[32]
Gabriella Pasi and Marco Viviani. 2020. Information credibility in the social web: Contexts, approaches, and open issues. arXiv preprint arXiv:2001.09473(2020), 11 pages.
[33]
S. Poria, A. Gelbukh, A. Hussain, N. Howard, D. Das, and S. Bandyopadhyay. 2013. Enhanced SenticNet with Affective Labels for Concept-Based Opinion Mining. IEEE Intelligent Systems 28, 2 (2013), 31–38.
[34]
Ajay Rastogi and Monica Mehrotra. 2017. Opinion Spam Detection in Online Reviews. Journal of Information & Knowledge Management 16 (09 2017).
[35]
Chengcheng Shao, Giovanni Luca Ciampaglia, Onur Varol, Kai-Cheng Yang, Alessandro Flammini, and Filippo Menczer. 2018. The spread of low-credibility content by social bots. Nature communications 9, 1 (2018), 1–9.
[36]
Hideyuki Shibuki 2010. A method for automatically generating a mediatory summary to verify credibility of information on the web. In Coling 2010: Posters. COLING, Beijing, China, 1140–1148.
[37]
Jae-Seung Shim, Ha-Ram Won, and Hyunchul Ahn. 2019. A Study on the Effect of the Document Summarization Technique on the Fake News Detection Model. Journal of Intelligence and Information Systems 25, 3 (2019), 201–220.
[38]
Lucy Vanderwende, Hisami Suzuki, Chris Brockett, and Ani Nenkova. 2007. Beyond SumBasic: Task-Focused Summarization with Sentence Simplification and Lexical Expansion. Inf. Process. Manage. 43 (11 2007), 1606–1618.
[39]
Marco Viviani and Gabriella Pasi. 2017. Credibility in social media: opinions, news, and health information—a survey. Wiley interdisciplinary reviews: Data mining and knowledge discovery 7, 5(2017), e1209.
[40]
Xinyi Zhou and Reza Zafarani. 2020. A survey of fake news: Fundamental theories, detection methods, and opportunities. ACM Computing Surveys (CSUR) 53, 5 (2020), 1–40.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
WI-IAT '21: IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology
December 2021
698 pages
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 April 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Credible Text Summarization
  2. Fake News
  3. Information Overload
  4. Misinformation
  5. Social Media.

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Conference

WI-IAT '21
Sponsor:
WI-IAT '21: IEEE/WIC/ACM International Conference on Web Intelligence
December 14 - 17, 2021
VIC, Melbourne, Australia

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 110
    Total Downloads
  • Downloads (Last 12 months)21
  • Downloads (Last 6 weeks)2
Reflects downloads up to 13 Dec 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media