[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3508230.3508232acmotherconferencesArticle/Chapter ViewAbstractPublication PagesnlpirConference Proceedingsconference-collections
research-article

Aspect-Based Sentiment Analysis of Social Media Data With Pre-Trained Language Models

Published: 08 March 2022 Publication History

Abstract

There is a great scope in utilizing the increasing content expressed by users on social media platforms such as Twitter. This study explores the application of Aspect-based Sentiment Analysis (ABSA) of tweets to retrieve fine-grained sentiment insights. The Plant-based food domain is chosen as an area of focus. To the best of our knowledge this is the first time ABSA task is done for this sector and it is distinct from standard food products because different and controversial aspects arise and opinions are polarized. The choice is relevant because these products can help in meeting the sustainable development goals and improve the welfare of millions of animals. Pre-trained BERT,”Bidirectional Encoder Representations with transformers”, is fine-tuned for this task and stands out because it was trained to learn from all the words in the sentence simultaneously using transformers. The aim was to develop methods to be applied on real life cases, therefore lowering the dependency on labeled data and improving performance were the key objectives. This research contributes to existing approaches of ABSA by proposing data processing techniques to adapt social media data for ABSA. The scope of this project presents a new method for the aspect category detection task (ACD) which does not rely on labeled data by using regular expressions (Regex). For aspect the sentiment classification task (ASC) a semi-supervised learning technique is explored. Additionally Part-of-Speech (POS) tags are incorporated into the predictions. The findings show that Regex is a solution to eliminate the dependency on labeled data for ACD. For ASC fine-tuning BERT on a small subset of data was the most accurate method to lower the dependency on aspect level sentiment data.

References

[1]
Miklós Véha Márk Szakály Zoltán Szakály András Fehér, MichałVéha Gazdecki. 2020. A Comprehensive Review of the Benefits of and the Barriers to the Switch to a Plant-Based Diet. Sustainability (2020).
[2]
Dogu Araci. 2019. Finbert: Financial sentiment analysis with pre-trained language models. arXiv preprint arXiv:1908.10063(2019).
[3]
Udayan Khurana Elias B. Khalil Authors: Fatemeh Nargesian, Horst Samulowitz and Deepak Turaga. 2017. Learning Feature Engineering for Classification. ACM Digital Library (2017).
[4]
Jamshid Bagherzadeh and Hasan Asil. 2019. A review of various semi-supervised learning models with a deep learning and memory approach. Iran Journal of Computer Science 2, 2 (2019), 65–80.
[5]
The World Bank. 2020. Population, total. https://data.worldbank.org/indicator/SP.POP.TOTL. Accessed: 04/12/2021.
[6]
Samuel R. Bowman, Ellie Pavlick, Edouard Grave, Benjamin Van Durme, Alex Wang, Jan Hula, Patrick Xia, Raghavendra Pappagari, R. Thomas McCoy, Roma Patel, Najoung Kim, Ian Tenney, Yinghui Huang, Katherin Yu, Shuning Jin, and Berlin Chen. 2019. Looking for ELMo’s friends: Sentence-Level Pretraining Beyond Language Modeling. https://openreview.net/forum?id=Bkl87h09FX
[7]
Paweł Budzianowski and Ivan Vulić. 2019. Hello, It’s GPT-2 – How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems. arxiv:1907.05774 [cs.CL]
[8]
Bingli Clark Chai, Johannes Reidar van der Voort, Kristina Grofelnik, Helga Gudny Eliasdottir, Ines Klöss, and Federico J. A. Perez-Cueto. 2019. Which diet has the least environmental impact on our planet? A systematic review of vegan, vegetarian and omnivorous diets. Sustainability 11, 15 (2019), 4110.
[9]
Saranlita Chotirata and PhayungMeesad. 2021. Part-of-Speech tagging enhancement to natural language processing for Thai wh-question classification with deep learning. Science Direct (2021).
[10]
Anirban Choudhury. 2019. A Deep Dive Analysis of Customer Sentiments in the Food Service Industry | Quantzig’s New Success Story. https://apnews.com/press-release/pr-businesswire/58df0387e8dc46479849a6cb3078eb29. Accessed: 2020-11-15.
[11]
Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. 2014. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. arxiv:1412.3555 [cs.NE]
[12]
Jessica Clement. 2019. Twitter: monthly active users worldwide | Statista. https://www.statista.com/statistics/282087/number-of-monthly-active-twitter-users/. Accessed: 2020-11-13.
[13]
Brian Dean. 2021. Social Network Usage & Growth Statistics: How Many People Use Social Media in 2021?https://backlinko.com/social-media-users#social-media-usage-stats. Accessed: 04/12/2021.
[14]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805(2018).
[15]
Hai Ha Do, P. W. C. Prasad, Angelika Maag, and Abeer Alsadoon. 2019. Deep learning for aspect-based sentiment analysis: a comparative review. Expert Systems with Applications 118 (2019), 272–299.
[16]
Mickel Hoang, Oskar Alija Bihorac, and Jacobo Rouces. 2019. Aspect-based sentiment analysis using bert. In NEAL Proceedings of the 22nd Nordic Conference on Computional Linguistics (NoDaLiDa), September 30-October 2, Turku, Finland. Linköping University Electronic Press, 187–196.
[17]
Minqing Hu and Bing Liu. 2004. Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. 168–177.
[18]
Cristela Maia Bairrada Isabel Miguel, Arnaldo Coelho. 2021. Modelling Attitude towards Consumption of Vegan Products. Sustainability (2021).
[19]
Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, and Ondrej Chum. 2019. Label Propagation for Deep Semi-supervised Learning. arxiv:1904.04717 [cs.CV]
[20]
Zhao Jianqiang and Gui Xiaolin. 2017. Comparison research on text pre-processing methods on twitter sentiment analysis. IEEE Access 5(2017), 2870–2879.
[21]
A.M. Kuchling. [n.d.]. Regular Expression HOWTO. Technical Report. Python.
[22]
Mary l. McHugh. 2012. Interrater reliability: the kappa statistic. Biochemia Medica (2012).
[23]
Ning Li, Chi-Yin Chow, and Jia-Dong Zhang. 2020. SEML: A Semi-Supervised Multi-Task Learning Framework for Aspect-Based Sentiment Analysis. IEEE Access 8(2020), 189287–189297.
[24]
I. Montan M. Honnibal. 2017. spaCy2: Natural language understanding with bloom embeddings, convolutional neural networks and incremental parsing. Zenodo (2017).
[25]
Esteban Ortiz-Ospina. 2019. The rise of social media. https://ourworldindata.org/rise-of-social-media. Accessed: 2020-11-12.
[26]
Tal Perry. 2020. Context is King! Why Deep Learning matters for NLP. https://www.lighttag.io/blog/context-is-king/. Accessed: 2020-11-09.
[27]
Maria Pontiki, Dimitrios Galanis, Haris Papageorgiou, Ion Androutsopoulos, Suresh Manandhar, Mohammad Al-Smadi, Mahmoud Al-Ayyoub, Yanyan Zhao, Bing Qin, Orphée De Clercq, 2016. Semeval-2016 task 5: Aspect based sentiment analysis. In 10th International Workshop on Semantic Evaluation (SemEval 2016).
[28]
Maria Pontiki, Dimitris Galanis, John Pavlopoulos, Harris Papageorgiou, Ion Androutsopoulos, and Suresh Manandhar. 2014. SemEval-2014 Task 4: Aspect Based Sentiment Analysis. In Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014). Association for Computational Linguistics, Dublin, Ireland, 27–35. https://doi.org/10.3115/v1/S14-2004
[29]
Paramita Ray and Amlan Chakrabarti. 2020. A mixed approach of deep learning method and rule-based method to improve aspect level sentiment analysis. Applied Computing and Informatics(2020).
[30]
Sebastian Ruder, Matthew E Peters, Swabha Swayamdipta, and Thomas Wolf. 2019. Transfer learning in natural language processing. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Tutorials. 15–18.
[31]
Marzieh Saeidi, Guillaume Bouchard, Maria Liakata, and Sebastian Riedel. 2016. SentiHood: Targeted Aspect Based Sentiment Analysis Dataset for Urban Neighbourhoods. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers. The COLING 2016 Organizing Committee, Osaka, Japan, 1546–1556. https://www.aclweb.org/anthology/C16-1146
[32]
Yasuhiro Fujiwara Sekitoshi Kana and Sotetsu Iwamura. 2017. Preventing Gradient Explosionsin Gated Recurrent Units. ACM Digital Library (2017).
[33]
Peng Shi and Jimmy Lin. 2019. Simple bert models for relation extraction and semantic role labeling. arXiv preprint arXiv:1904.05255(2019).
[34]
Sam Shleifer. 2019. Low Resource Text Classification with ULMFit and Backtranslation. arxiv:1903.09244 [cs.CL]
[35]
Chi Sun, Luyao Huang, and Xipeng Qiu. 2019. Utilizing BERT for aspect-based sentiment analysis via constructing auxiliary sentence. arXiv preprint arXiv:1903.09588(2019).
[36]
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to Sequence Learning with Neural Networks. arxiv:1409.3215 [cs.CL]
[37]
Bo Wang and Min Liu. 2015. Deep learning for aspect-based sentiment analysis. Stanford University report(2015).
[38]
Hu Xu, Bing Liu, Lei Shu, and Philip S. Yu. 2019. Bert post-training for review reading comprehension and aspect-based sentiment analysis. arXiv preprint arXiv:1904.02232(2019).
[39]
Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Ruslan Salakhutdinov, and Quoc V. Le. 2020. XLNet: Generalized Autoregressive Pretraining for Language Understanding. arxiv:1906.08237 [cs.CL]
[40]
Paul R. Yarnold. 2016. ODA vs. ϕ and κ: Paradoxes of Kappa. Optimal Data Analysis(2016).
[41]
Danny Zhu. 2020. Sentiment Analysis for Impossible Burger. https://rpubs.com/DannyZhu/Impossible_Burger. Accessed: 2020-11-15.

Cited By

View all
  • (2024)Enhanced Local and Global Context Focus Mechanism Using BART Model for Aspect Based Sentiment Analysis of Social Media DataIEEE Access10.1109/ACCESS.2024.347181612(145768-145781)Online publication date: 2024
  • (2023)Unveiling Sustainability in Ecommerce: GPT-Powered Software for Identifying Sustainable Product FeaturesSustainability10.3390/su15151201515:15(12015)Online publication date: 4-Aug-2023
  • (2023)Sentiment Analysis for the Natural Environment: A Systematic ReviewACM Computing Surveys10.1145/360460556:4(1-37)Online publication date: 10-Nov-2023

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
NLPIR '21: Proceedings of the 2021 5th International Conference on Natural Language Processing and Information Retrieval
December 2021
175 pages
ISBN:9781450387354
DOI:10.1145/3508230
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 March 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. ABSA
  2. Aspect-Based Sentiment Analysis
  3. BERT
  4. POS tags
  5. plant based Domain
  6. semi-supervised
  7. social media data.

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

NLPIR 2021

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)93
  • Downloads (Last 6 weeks)9
Reflects downloads up to 18 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Enhanced Local and Global Context Focus Mechanism Using BART Model for Aspect Based Sentiment Analysis of Social Media DataIEEE Access10.1109/ACCESS.2024.347181612(145768-145781)Online publication date: 2024
  • (2023)Unveiling Sustainability in Ecommerce: GPT-Powered Software for Identifying Sustainable Product FeaturesSustainability10.3390/su15151201515:15(12015)Online publication date: 4-Aug-2023
  • (2023)Sentiment Analysis for the Natural Environment: A Systematic ReviewACM Computing Surveys10.1145/360460556:4(1-37)Online publication date: 10-Nov-2023
  • (2023)A novel framework for aspect based sentiment analysis using a hybrid BERT (HybBERT) modelMultimedia Tools and Applications10.1007/s11042-023-17647-1Online publication date: 21-Nov-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media