[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

Feature selection for sentiment analysis based on content and syntax models

Published: 01 November 2012 Publication History

Abstract

Recent solutions for sentiment analysis have relied on feature selection methods ranging from lexicon-based approaches where the set of features are generated by humans, to approaches that use general statistical measures where features are selected solely on empirical evidence. The advantage of statistical approaches is that they are fully automatic, however, they often fail to separate features that carry sentiment from those that do not. In this paper we propose a set of new feature selection schemes that use a Content and Syntax model to automatically learn a set of features in a review document by separating the entities that are being reviewed from the subjective expressions that describe those entities in terms of polarities. By focusing only on the subjective expressions and ignoring the entities, we can choose more salient features for document-level sentiment analysis. The results obtained from using these features in a maximum entropy classifier are competitive with the state-of-the-art machine learning approaches.

References

[1]
Das, Sanjiv R. and Chen, Mike Y., Yahoo! for Amazon: sentiment extraction from small talk on the Web. Management Science. v53 i9. 1375-1388.
[2]
A holistic lexicon-based approach to opinion mining. In: Proceedings of the Conference on Web Search and Web Data Mining (WSDM),
[3]
Griffiths, Thomas L., Steyvers, Mark, Blei, David M. and Tenenbaum, Joshua B., Integrating topics and syntax. In: In Advances in Neural Information Processing Systems, 17. MIT Press. pp. 537-544.
[4]
Predicting the semantic orientation of adjectives. In: Proceedings of the Joint ACL/EACL Conference, pp. 174-181.
[5]
Hatzivassiloglou, Vasileios and Wiebe, Janyce, Effects of adjective orientation and gradability on sentence subjectivity. In: Proceedings of the International Conference on Computational Linguistics (COLING),
[6]
Horrigan, John A., Online shopping. In: Pew Internet & American Life Project Report,
[7]
Hu, Minqing and Liu, Bing, Mining opinion features in customer reviews. In: Proceedings of AAAI, pp. 755-760.
[8]
Hurst, Matthew and Nigam, Kamal, Retrieving topical sentiments from online document collections. In: Document Recognition and Retrieval XI, pp. 27-34.
[9]
Kim, Soo-Min and Hovy, Eduard, Determining the sentiment of opinions. In: Proceedings of the International Conference on Computational Linguistics (COLING),
[10]
Foundations of Statistical Natural Language Processing. 1999. MIT Press, Cambridge, MA, USA.
[11]
Pang, Bo and Lee, Lillian, A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the Association for Computational Linguistics (ACL), pp. 271-278.
[12]
Pang, Bo and Lee, Lillian, Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval. v2. 1-135.
[13]
Pang, Bo, Lee, Lillian and Vaithyanathan, Shivakumar, Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 79-86.
[14]
Riloff, Ellen, Wiebe, Janyce and Wilson, Theresa, Learning subjective nouns using extraction pattern bootstrapping. In: Proceedings of the Conference on Natural Language Learning (CoNLL), pp. 25-32.
[15]
Sebastiani, Fabrizio, Machine learning in automated text categorization. ACM Computing Surveys. v34 i1. 1-47.
[16]
Steyvers, Mark and Griffiths, Tom, Probabilistic topic models. In: Landauer, T., Mcnamara, D., Dennis, S., Kintsch, W. (Eds.), Latent Semantic Analysis: A Road to Meaning, Laurence Erlbaum.
[17]
Turney, Peter, Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the Association for Computational Linguistics (ACL), pp. 417-424.
[18]
Whitelaw, Casey, Garg, Navendu and Argamon, Shlomo, Using appraisal groups for sentiment analysis. In: Proceedings of the ACM SIGIR Conference on Information and Knowledge Management (CIKM), ACM. pp. 625-631.
[19]
Wiebe, Janyce, Learning subjective adjectives from corpora. In: Proceedings of AAAI,
[20]
Wilson, Theresa, Wiebe, Janyce and Hoffmann, Paul, Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of the Human Language Technology Conference and the Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), pp. 347-354.
[21]
Yi, Jeonghee, Nasukawa, Tetsuya, Bunescu, Razvan and Niblack, Wayne, Sentiment analyzer: Extracting sentiments about a given topic using natural language processing techniques. In: Proceedings of the IEEE International Conference on Data Mining (ICDM),

Cited By

View all
  • (2024)Product Redesign and Innovation Based on Online ReviewsINFORMS Journal on Computing10.1287/ijoc.2022.033336:3(742-765)Online publication date: 1-May-2024
  • (2023)Deep Recurrent Neural Model for Multi Domain Sentiment Analysis with Attention MechanismWireless Personal Communications: An International Journal10.1007/s11277-023-10274-x130:1(43-60)Online publication date: 22-Mar-2023
  • (2023)Emotion and sentiment analysis for intelligent customer service conversation using a multi-task ensemble frameworkCluster Computing10.1007/s10586-023-04073-z27:2(2099-2115)Online publication date: 22-Jun-2023
  • Show More Cited By
  1. Feature selection for sentiment analysis based on content and syntax models

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Decision Support Systems
      Decision Support Systems  Volume 53, Issue 4
      November, 2012
      229 pages

      Publisher

      Elsevier Science Publishers B. V.

      Netherlands

      Publication History

      Published: 01 November 2012

      Author Tags

      1. Content and Syntax models
      2. Feature selection
      3. Maximum entropy modeling
      4. Sentiment analysis
      5. Text classification
      6. Topic modeling

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 18 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Product Redesign and Innovation Based on Online ReviewsINFORMS Journal on Computing10.1287/ijoc.2022.033336:3(742-765)Online publication date: 1-May-2024
      • (2023)Deep Recurrent Neural Model for Multi Domain Sentiment Analysis with Attention MechanismWireless Personal Communications: An International Journal10.1007/s11277-023-10274-x130:1(43-60)Online publication date: 22-Mar-2023
      • (2023)Emotion and sentiment analysis for intelligent customer service conversation using a multi-task ensemble frameworkCluster Computing10.1007/s10586-023-04073-z27:2(2099-2115)Online publication date: 22-Jun-2023
      • (2022)Review on sentiment analysis for text classification techniques from 2010 to 2021Multimedia Tools and Applications10.1007/s11042-022-14112-382:6(8137-8193)Online publication date: 1-Dec-2022
      • (2022)Identifying Structural Holes for Sentiment ClassificationInformation Systems Frontiers10.1007/s10796-021-10185-x24:5(1735-1751)Online publication date: 1-Oct-2022
      • (2022)An Investigation of Peripheral and Central Cues of Online Customer Review Voting and Helpfulness through the Lens of Elaboration Likelihood ModelInformation Systems Frontiers10.1007/s10796-020-10069-624:1(211-231)Online publication date: 1-Feb-2022
      • (2022)A survey on sentiment analysis methods, applications, and challengesArtificial Intelligence Review10.1007/s10462-022-10144-155:7(5731-5780)Online publication date: 7-Feb-2022
      • (2021)Information-Theoretic Feature Selection via Tensor Decomposition and SubmodularityIEEE Transactions on Signal Processing10.1109/TSP.2021.312514769(6195-6205)Online publication date: 1-Jan-2021
      • (2021)360 degree view of cross-domain opinion classification: a surveyArtificial Intelligence Review10.1007/s10462-020-09884-954:2(1385-1506)Online publication date: 1-Feb-2021
      • (2020)A Weakly Supervised WordNet-Guided Deep Learning Approach to Extracting Aspect Terms from Online ReviewsACM Transactions on Management Information Systems10.1145/339963011:3(1-22)Online publication date: 21-Jul-2020
      • Show More Cited By

      View Options

      View options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media