[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.3115/1218955.1218989dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

The sentimental factor: improving review classification via human-provided information

Published: 21 July 2004 Publication History

Abstract

Sentiment classification is the task of labeling a review document according to the polarity of its prevailing opinion (favorable or unfavorable). In approaching this problem, a model builder often has three sources of information available: a small collection of labeled documents, a large collection of unlabeled documents, and human understanding of language. Ideally, a learning method will utilize all three sources. To accomplish this goal, we generalize an existing procedure that uses the latter two.We extend this procedure by re-interpreting it as a Naive Bayes model for document sentiment. Viewed as such, it can also be seen to extract a pair of derived features that are linearly combined to predict sentiment. This perspective allows us to improve upon previous methods, primarily through two strategies: incorporating additional derived features into the model and, where possible, using labeled data to estimate their relative influence.

References

[1]
Kushal Dave, Steve Lawrence, and David M. Pennock. 2003. Mining the peanut gallery: Opinion extraction and semantic classification of product reviews.
[2]
C. Fellbaum. 1998. Wordnet an electronic lexical database.
[3]
T. Hastie, R. Tibshirani, and J. Friedman. 2001. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer-Verlag.
[4]
Vasileios Hatzivassiloglou and Kathleen R. McKeown. 1997. Predicting the semantic orientation of adjectives. In Philip R. Cohen and Wolfgang Wahlster, editors, Proceedings of the Thirty-Fifth Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, pages 174--181, Somerset, New Jersey. Association for Computational Linguistics.
[5]
Satoshi Morinaga, Kenji Yamanishi, Kenji Tateishi, and Toshikazu Fukushima. 2002. Mining product reputations on the web.
[6]
Ng, A. Y. and Jordan, M. 2002. On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes. Advances in Neural Information Processing Systems, 14.
[7]
Kamal Nigam, Andrew K. McCallum, Sebastian Thrun, and Tom M. Mitchell. 2000. Text classification from labeled and unlabeled documents using EM. Machine Learning, 39(2/3): 103--134.
[8]
Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan. 2002. Thumbs up? sentiment classification using machine learning techniques. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP).
[9]
P. D. Turney and M. L. Littman. 2002. Unsupervised learning of semantic orientation from a hundred-billion-word corpus.
[10]
Peter Turney. 2002. Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL'02), pages 417--424, Philadelphia, Pennsylvania. Association for Computational Linguistics.
[11]
Janyce Wiebe. 2000. Learning subjective adjectives from corpora. In Proc. 17th National Conference on Artificial Intelligence (AAAI-2000), Austin, Texas.
[12]
Jian Zhang and Yiming Yang. 2003. "robustness of regularized linear classification methods in text categorization". In Proceedings of the 26th Annual International ACM SIGIR Conference (SIGIR 2003).

Cited By

View all
  • (2017)Document-level sentiment classification using hybrid machine learning approachKnowledge and Information Systems10.1007/s10115-017-1055-z53:3(805-831)Online publication date: 1-Dec-2017
  • (2016)Classification of sentiment reviews using n-gram machine learning approachExpert Systems with Applications: An International Journal10.1016/j.eswa.2016.03.02857:C(117-126)Online publication date: 15-Sep-2016
  • (2014)Review-based measurement of customer satisfaction in mobile serviceExpert Systems with Applications: An International Journal10.1016/j.eswa.2013.07.10141:4(1041-1050)Online publication date: 1-Mar-2014
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '04: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
July 2004
729 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 21 July 2004

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)31
  • Downloads (Last 6 weeks)4
Reflects downloads up to 19 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2017)Document-level sentiment classification using hybrid machine learning approachKnowledge and Information Systems10.1007/s10115-017-1055-z53:3(805-831)Online publication date: 1-Dec-2017
  • (2016)Classification of sentiment reviews using n-gram machine learning approachExpert Systems with Applications: An International Journal10.1016/j.eswa.2016.03.02857:C(117-126)Online publication date: 15-Sep-2016
  • (2014)Review-based measurement of customer satisfaction in mobile serviceExpert Systems with Applications: An International Journal10.1016/j.eswa.2013.07.10141:4(1041-1050)Online publication date: 1-Mar-2014
  • (2010)Cross-Domain Contextualization of Sentiment LexiconsProceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence10.5555/1860967.1861118(771-776)Online publication date: 4-Aug-2010
  • (2009)Sentiment classification of online Cantonese reviews by supervised machine learning approachesInternational Journal of Web Engineering and Technology10.1504/IJWET.2009.0322545:4(382-397)Online publication date: 1-Mar-2009
  • (2009)Domain-specific sentiment analysis using contextual feature generationProceedings of the 1st international CIKM workshop on Topic-sentiment analysis for mass opinion10.1145/1651461.1651469(37-44)Online publication date: 6-Nov-2009
  • (2009)A survey on sentiment detection of reviewsExpert Systems with Applications: An International Journal10.1016/j.eswa.2009.02.06336:7(10760-10773)Online publication date: 1-Sep-2009
  • (2009)Sentiment classification of online reviews to travel destinations by supervised machine learning approachesExpert Systems with Applications: An International Journal10.1016/j.eswa.2008.07.03536:3(6527-6535)Online publication date: 1-Apr-2009
  • (2008)Sentiment analysis in multiple languagesACM Transactions on Information Systems10.1145/1361684.136168526:3(1-34)Online publication date: 20-Jun-2008
  • (2006)Predicting the political sentiment of web log posts using supervised machine learning techniques coupled with feature selectionProceedings of the 8th Knowledge discovery on the web international conference on Advances in web mining and web usage analysis10.5555/1784815.1784826(187-206)Online publication date: 20-Aug-2006
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media