[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3626772.3657901acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

From Text to Context: An Entailment Approach for News Stakeholder Classification

Published: 11 July 2024 Publication History

Abstract

Navigating the complex landscape of news articles involves understanding the various actors or entities involved, referred to as news stakeholders. These stakeholders, ranging from policymakers to opposition figures, citizens, and more, play pivotal roles in shaping news narratives. Recognizing their stakeholder types, reflecting their roles, political alignments, social standing, and more, is paramount for a nuanced comprehension of news content. Despite existing works focusing on salient entity extraction, coverage variations, and political affiliations through social media data, the automated detection of stakeholder roles within news content remains an underexplored domain. In this paper, we bridge this gap by introducing an effective approach to classify stakeholder types in news articles. Our method involves transforming the stakeholder classification problem into a natural language inference task, utilizing contextual information from news articles and external knowledge to enhance the accuracy of stakeholder type detection. Moreover, our proposed model showcases efficacy in zero-shot settings, further extending its applicability to diverse news contexts.

References

[1]
Khudran Alzhrani. 2020. Ideology Detection of Personalized Political News Coverage: A New Dataset. In Proceedings of the 2020 4th International Conference on Compute and Data Analysis (Silicon Valley, CA, USA) (ICCDA '20). Association for Computing Machinery, New York, NY, USA, 10--15. https://doi.org/10.1145/3388142.3388149
[2]
Adrian Benton, Raman Arora, and Mark Dredze. 2016. Learning multiview embeddings of twitter users. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 14--19.
[3]
Amar Budhiraja, Ankur Sharma, Rahul Agrawal, Monojit Choudhury, and Joyojeet Pal. 2021. American politicians diverge systematically, Indian politicians do so chaotically: text embeddings as a window into party polarization. In Proceedings of the International AAAI Conference on Web and Social Media, Vol. 15. 1054--1058.
[4]
Ming-Wei Chang, Lev-Arie Ratinov, Dan Roth, and Vivek Srikumar. 2008. Importance of Semantic Representation: Dataless Classification. In Aaai, Vol. 2. 830--835.
[5]
Xingyuan Chen, Yunqing Xia, Peng Jin, and John Carroll. 2015. Dataless text classification with descriptive LDA. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 29.
[6]
Jesse Dunietz and Dan Gillick. 2014. A new entity salience task with millions of training examples. In Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, volume 2: Short Papers. 205--209.
[7]
Jakob-Moritz Eberl, Hajo G Boomgaarden, and Markus Wagner. 2017. One bias fits all? Three types of media bias and their effects on party preferences. Communication Research, Vol. 44, 8 (2017), 1125--1148.
[8]
Michael Gamon, Tae Yano, Xinying Song, Johnson Apacible, and Patrick Pantel. 2013a. Identifying salient entities in web pages. In Proceedings of the 22nd ACM International Conference on Information & Knowledge Management (San Francisco, California, USA) (CIKM '13). Association for Computing Machinery, New York, NY, USA, 2375--2380. https://doi.org/10.1145/2505515.2505602
[9]
Michael Gamon, Tae Yano, Xinying Song, Johnson Apacible, and Patrick Pantel. 2013b. Understanding document aboutness-step one: Identifying salient entities. Microsoft Research (2013), 2.
[10]
David Grönberg and Marco Kuhlmann. 2021. Extracting Salient Named Entities from Financial News Articles. https://api.semanticscholar.org/CorpusID:263675377
[11]
Jie Gu, Feng Wang, Qinghui Sun, Zhiquan Ye, Xiaoxiao Xu, Jingmin Chen, and Jun Zhang. 2021. Exploiting behavioral consistence for universal user representation. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 4063--4071.
[12]
Ehsan-Ul Haq, Haris Bin Zia, Reza Hadi Mogavi, Gareth Tyson, Yang K Lu, Tristan Braud, and Pan Hui. 2023. A Twitter Dataset for Pakistani Political Discourse. arXiv preprint arXiv:2301.06316 (2023).
[13]
P Sol Hart, Sedona Chinn, and Stuart Soroka. 2020. Politicization and polarization in COVID-19 news coverage. Science communication, Vol. 42, 5 (2020), 679--697.
[14]
Salomon Kabongo, Jennifer D'Souza, and Sören Auer. 2023. Zero-shot Entailment of Leaderboards for Empirical AI Research. arXiv preprint arXiv:2303.16835 (2023).
[15]
Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdel rahman Mohamed, Omer Levy, Veselin Stoyanov, and Luke Zettlemoyer. 2019. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. In Annual Meeting of the Association for Computational Linguistics. https://api.semanticscholar.org/CorpusID:204960716
[16]
Xiao Liu, Yanan Zheng, Zhengxiao Du, Ming Ding, Yujie Qian, Zhilin Yang, and Jie Tang. 2021. GPT Understands, Too. arXiv:2103.10385 (2021).
[17]
Shon Otmazgin, Arie Cattan, and Yoav Goldberg. 2023. LingMess: Linguistically Informed Multi Expert Scorers for Coreference Resolution. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, Andreas Vlachos and Isabelle Augenstein (Eds.). Association for Computational Linguistics, Dubrovnik, Croatia, 2752--2760. https://doi.org/10.18653/v1/2023.eacl-main.202
[18]
Shimei Pan and Tao Ding. 2019. Social media-based user embedding: A literature review. arXiv preprint arXiv:1907.00725 (2019).
[19]
Marco Ponza, Diego Ceccarelli, Paolo Ferragina, Edgar Meij, and Sambhav Kothari. 2021. Contextualizing trending entities in news stories. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 346--354.
[20]
Tim Repke and Ralf Krestel. 2021. Extraction and representation of financial entities from text. In Data Science for Economics and Finance: Methodologies and Applications. Springer International Publishing Cham, 241--263.
[21]
Anirban Sen, A Agarwal, Aditya Guru, A Choudhuri, G Singh, Imran Mohammed, J Goyal, K Mittal, Manpreet Singh, Mridul Goel, et al. 2018. Leveraging Web Data to Monitor Changes in Corporate-Government Interlocks in India. In Proceedings of the 1st ACM SIGCAS Conference on Computing and Sustainable Societies. 1--11.
[22]
Anirban Sen, Priya Chhillar, Pooja Aggarwal, Sravan Verma, Debanjan Ghatak, Priya Kumari, Manpreet Singh Agandh, Aditya Guru, and Aaditeshwar Seth. 2019. An attempt at using mass media data to analyze the political economy around some key ICTD policies in India. In Proceedings of the Tenth International Conference on Information and Communication Technologies and Development. 1--11.
[23]
Ankur Sharma, Navreet Kaur, Anirban Sen, and Aaditeshwar Seth. 2020. Ideology detection in the indian mass media. In 2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). IEEE, 627--634.
[24]
Yangqiu Song and Dan Roth. 2014. On dataless hierarchical text classification. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 28.
[25]
Lei Tang and Huan Liu. 2010. Community detection and mining in social media. Morgan & Claypool Publishers.
[26]
Di Wang, Marcus Thint, and Ahmad Al-Rubaie. 2012. Semi-supervised latent Dirichlet allocation and its application for document classification. In 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, Vol. 3. IEEE, 306--310.
[27]
John Wilkerson and Andreu Casas. 2017. Large-scale computerized text analysis in political science: Opportunities and challenges. Annual Review of Political Science, Vol. 20 (2017), 529--544.
[28]
Chuan Wu, Evangelos Kanoulas, and Maarten de Rijke. 2020. It all starts with entities: A Salient entity topic model. Natural Language Engineering, Vol. 26, 5 (2020), 531--549.
[29]
Wenpeng Yin, Jamaal Hay, and Dan Roth. 2019. Benchmarking zero-shot text classification: Datasets, evaluation and entailment approach. arXiv preprint arXiv:1909.00161 (2019).
[30]
Liu Zhuang, Lin Wayne, Shi Ya, and Zhao Jun. 2021. A Robustly Optimized BERT Pre-training Approach with Post-training. In Proceedings of the 20th Chinese National Conference on Computational Linguistics, Sheng Li, Maosong Sun, Yang Liu, Hua Wu, Kang Liu, Wanxiang Che, Shizhu He, and Gaoqi Rao (Eds.). Chinese Information Processing Society of China, Huhhot, China, 1218--1227. https://aclanthology.org/2021.ccl-1.108

Index Terms

  1. From Text to Context: An Entailment Approach for News Stakeholder Classification

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval
    July 2024
    3164 pages
    ISBN:9798400704314
    DOI:10.1145/3626772
    Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 July 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. natural language inference
    2. news content analysis
    3. news stakeholders
    4. zero-shot classification

    Qualifiers

    • Short-paper

    Conference

    SIGIR 2024
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 67
      Total Downloads
    • Downloads (Last 12 months)67
    • Downloads (Last 6 weeks)15
    Reflects downloads up to 10 Dec 2024

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media