[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.3115/1073083.1073153dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews

Published: 06 July 2002 Publication History

Abstract

This paper presents a simple unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended (thumbs down). The classification of a review is predicted by the average semantic orientation of the phrases in the review that contain adjectives or adverbs. A phrase has a positive semantic orientation when it has good associations (e.g., "subtle nuances") and a negative semantic orientation when it has bad associations (e.g., "very cavalier"). In this paper, the semantic orientation of a phrase is calculated as the mutual information between the given phrase and the word "excellent" minus the mutual information between the given phrase and the word "poor". A review is classified as recommended if the average semantic orientation of its phrases is positive. The algorithm achieves an average accuracy of 74% when evaluated on 410 reviews from Epinions, sampled from four different domains (reviews of automobiles, banks, movies, and travel destinations). The accuracy ranges from 84% for automobile reviews to 66% for movie reviews.

References

[1]
Agresti, A. 1996. An introduction to categorical data analysis. New York: Wiley.]]
[2]
Brill, E. 1994. Some advances in transformation-based part of speech tagging. Proceedings of the Twelfth National Conference on Artificial Intelligence (pp. 722--727). Menlo Park, CA: AAAI Press.]]
[3]
Church, K. W., & Hanks, P. 1989. Word association norms, mutual information and lexicography. Proceedings of the 27th Annual Conference of the ACL (pp. 76--83). New Brunswick, NJ: ACL.]]
[4]
Frank, E., & Hall, M. 2001. A simple approach to ordinal classification. Proceedings of the Twelfth European Conference on Machine Learning (pp. 145--156). Berlin: Springer-Verlag.]]
[5]
Hatzivassiloglou, V., & McKeown, K. R. 1997. Predicting the semantic orientation of adjectives. Proceedings of the 35th Annual Meeting of the ACL and the 8th Conference of the European Chapter of the ACL (pp. 174--181). New Brunswick, NJ: ACL.]]
[6]
Hatzivassiloglou, V., & Wiebe, J. M. 2000. Effects of adjective orientation and gradability on sentence subjectivity. Proceedings of 18th International Conference on Computational Linguistics. New Brunswick, NJ: ACL.]]
[7]
Hearst, M. A. 1992. Direction-based text interpretation as an information access refinement. In P. Jacobs (Ed.), Text-Based Intelligent Systems: Current Research and Practice in Information Extraction and Retrieval. Mahwah, NJ: Lawrence Erlbaum Associates.]]
[8]
Landauer, T. K., & Dumais, S. T. 1997. A solution to Plato's problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge. Psychological Review, 104, 211--240.]]
[9]
Santorini, B. 1995. Part-of-Speech Tagging Guidelines for the Penn Treebank Project (3rd revision, 2nd printing). Technical Report, Department of Computer and Information Science, University of Pennsylvania.]]
[10]
Spertus, E. 1997. Smokey: Automatic recognition of hostile messages. Proceedings of the Conference on Innovative Applications of Artificial Intelligence (pp. 1058--1065). Menlo Park, CA: AAAI Press.]]
[11]
Tong, R. M. 2001. An operational system for detecting and tracking opinions in on-line discussions. Working Notes of the ACM SIGIR 2001 Workshop on Operational Text Classification (pp. 1--6). New York, NY: ACM.]]
[12]
Turney, P. D. 2001. Mining the Web for synonyms: PMI-IR versus LSA on TOEFL. Proceedings of the Twelfth European Conference on Machine Learning (pp. 491--502). Berlin: Springer-Verlag.]]
[13]
Wiebe, J. M. 2000. Learning subjective adjectives from corpora. Proceedings of the 17th National Conference on Artificial Intelligence. Menlo Park, CA: AAAI Press.]]
[14]
Wiebe, J. M., Bruce, R., Bell, M., Martin, M., & Wilson, T. 2001. A corpus study of evaluative and speculative language. Proceedings of the Second ACL SIG on Dialogue Workshop on Discourse and Dialogue. Aalborg, Denmark.]]

Cited By

View all
  1. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image DL Hosted proceedings
      ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
      July 2002
      543 pages

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      Published: 06 July 2002

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate 85 of 443 submissions, 19%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)250
      • Downloads (Last 6 weeks)37
      Reflects downloads up to 11 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Topic Modeling for Mining Opinion Aspects from a Customer Feedback CorpusAutomatic Documentation and Mathematical Linguistics10.3103/S000510552401006058:1(63-79)Online publication date: 2-Apr-2024
      • (2024)TextVista: NLP-Enriched Time-Series Text Data VisualizationsProceedings of the 50th Graphics Interface Conference10.1145/3670947.3670971(1-12)Online publication date: 3-Jun-2024
      • (2024)User satisfaction with Arabic COVID-19 appsInformation Processing and Management: an International Journal10.1016/j.ipm.2024.10364461:3Online publication date: 2-Jul-2024
      • (2023)Multimodal Sentiment Analysis: A Survey of Methods, Trends, and ChallengesACM Computing Surveys10.1145/358607555:13s(1-38)Online publication date: 13-Jul-2023
      • (2023)Improving sentiment domain adaptation for Arabic using an unsupervised self-labeling frameworkInformation Processing and Management: an International Journal10.1016/j.ipm.2023.10333860:3Online publication date: 1-May-2023
      • (2023)Identifying equipment health status from maintenance records using Lexicon based Unsupervised Sentiment Analysis Adjusted for Negation (LUSAA-N)Computers and Industrial Engineering10.1016/j.cie.2023.109693186:COnline publication date: 1-Dec-2023
      • (2023)Exploring Emotion Trends in Product Reviews: A Multi-modal Analysis with Malicious Comment Filtering and User Privacy ProtectionInformation Security and Cryptology10.1007/978-981-97-0942-7_19(379-396)Online publication date: 9-Dec-2023
      • (2022)Building an enhanced sentiment classification framework based on natural language processingJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-21927843:2(1771-1777)Online publication date: 1-Jan-2022
      • (2022)Survey on Aspect Category DetectionACM Computing Surveys10.1145/354455755:7(1-37)Online publication date: 15-Dec-2022
      • (2022)Text-to-Speech Synthesis: Literature Review with an Emphasis on Malayalam LanguageACM Transactions on Asian and Low-Resource Language Information Processing10.1145/350139721:4(1-56)Online publication date: 19-Jan-2022
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media