[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/1273073.1273132dlproceedingsArticle/Chapter ViewAbstractPublication PagescolingConference Proceedingsconference-collections
Article
Free access

Automatic construction of polarity-tagged corpus from HTML documents

Published: 17 July 2006 Publication History

Abstract

This paper proposes a novel method of building polarity-tagged corpus from HTML documents. The characteristics of this method is that it is fully automatic and can be applied to arbitrary HTML documents. The idea behind our method is to utilize certain layout structures and linguistic pattern. By using them, we can automatically extract such sentences that express opinion. In our experiment, the method could construct a corpus consisting of 126,610 sentences.

References

[1]
Kushal Dave, Steve Lawrence, and David M. Pennock. 2003. Mining the peanut gallery: Opinion extraction and semantic classification of product revews. In Proceedings of the WWW, pages 519--528.
[2]
Andrea Esuli and Fabrizio Sebastiani. 2005. Determining the semantic orientation of terms throush gloss classification. In Proceedings of the CIKM.
[3]
Vasileios Hatzivassiloglou and Katheleen R. McKeown. 1997. Predicting the semantic orientation of adjectives. In Proceedings of the ACL, pages 174--181.
[4]
Minqing Hu and Bing Liu. 2004. Mining and summarizing customer reviews. In Proceedings of the KDD, pages 168--177.
[5]
Jaap Kamps, Maarten Marx, Robert J. Mokken, and Maarten de Rijke. 2004. Using wordnet to measure semantic orientations of adjectives. In Proceedings of the LREC.
[6]
Satoshi Morinaga, Kenji Yamanishi, Kenji Tateishi, and Toshikazu Fukushima. 2002. Mining product reputations on the web. In Proceedings of the KDD.
[7]
Bo Pang, Lillian Lee, and Shivakumar Vaihyanathan. 2002. Thumbs up? sentiment classification using machine learning techniques. In Proceedings of the EMNLP.
[8]
Ellen Riloff and Janyce Wiebe. 2003. Learning extraction patterns for subjective expressions. In Proceedings of the EMNLP.
[9]
Ellen Riloff, Janyce Wiebe, and Theresa Wilson. 2003. Learning subjective nouns using extraction pattern bootstrapping. In Proceedings of the CoNLL.
[10]
Hiroya Takamura, Takashi Inui, and Manabu Okumura. 2005. Extracting semantic orientation of words using spin model. In Proceedings of the ACL, pages 133--140.
[11]
Peter D. Turney. 2002. Thumbs up or thumbs down? senmantic orientation applied to unsupervised classification of reviews. In Proceedings of the ACL, pages 417--424.
[12]
Janyce Wiebe and Ellen Riloff. 2005. Creating subjective and objective sentence classifiers from unannotated texts. In Proceedings of the CICLing.
[13]
Janyce M. Wiebe. 2000. Learning subjective adjectives from corpora. In Proceedings of the AAAI.
[14]
Theresa Wilson, Janyce Wiebe, and Paul Hoffmann. 2005. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the HLT/EMNLP.
[15]
Hong Yu and Yasileios Hatzivassiloglou. 2003. Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. In Proceedings of the EMNLP.

Cited By

View all
  1. Automatic construction of polarity-tagged corpus from HTML documents

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image DL Hosted proceedings
      COLING-ACL '06: Proceedings of the COLING/ACL on Main conference poster sessions
      July 2006
      992 pages

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      Published: 17 July 2006

      Qualifiers

      • Article

      Acceptance Rates

      COLING-ACL '06 Paper Acceptance Rate 126 of 126 submissions, 100%;
      Overall Acceptance Rate 1,537 of 1,537 submissions, 100%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)33
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 19 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2017)Current State of Text Sentiment Analysis from Opinion to Emotion MiningACM Computing Surveys10.1145/305727050:2(1-33)Online publication date: 25-May-2017
      • (2016)Opinion mining based on fuzzy domain ontology and Support Vector MachineApplied Soft Computing10.1016/j.asoc.2016.06.00347:C(235-250)Online publication date: 1-Oct-2016
      • (2015)Type-2 fuzzy ontology-based opinion mining and information extractionApplied Intelligence10.1007/s10489-014-0609-y42:3(481-500)Online publication date: 1-Apr-2015
      • (2010)A comparative study of Bayesian models for unsupervised sentiment detectionProceedings of the Fourteenth Conference on Computational Natural Language Learning10.5555/1870568.1870586(144-152)Online publication date: 15-Jul-2010
      • (2010)Dependency tree-based sentiment classification using CRFs with hidden variablesHuman Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics10.5555/1857999.1858119(786-794)Online publication date: 2-Jun-2010
      • (2009)Sentiment analysis of conditional sentencesProceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 110.5555/1699510.1699534(180-189)Online publication date: 6-Aug-2009
      • (2009)Joint sentiment/topic model for sentiment analysisProceedings of the 18th ACM conference on Information and knowledge management10.1145/1645953.1646003(375-384)Online publication date: 2-Nov-2009
      • (2008)Looking for troubleProceedings of the 22nd International Conference on Computational Linguistics - Volume 110.5555/1599081.1599105(185-192)Online publication date: 18-Aug-2008
      • (2008)Opinion Mining and Sentiment AnalysisFoundations and Trends in Information Retrieval10.1561/15000000112:1-2(1-135)Online publication date: 1-Jan-2008
      • (2008)A holistic lexicon-based approach to opinion miningProceedings of the 2008 International Conference on Web Search and Data Mining10.1145/1341531.1341561(231-240)Online publication date: 11-Feb-2008

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media