[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1835449.1835517acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Proximity-based opinion retrieval

Published: 19 July 2010 Publication History

Abstract

Blog post opinion retrieval aims at finding blog posts that are relevant and opinionated about a user's query. In this paper we propose a simple probabilistic model for assigning relevant opinion scores to documents. The key problem is how to capture opinion expressions in the document, that are related to the query topic. Current solutions enrich general opinion lexicons by finding query-specific opinion lexicons using pseudo-relevance feedback on external corpora or the collection itself. In this paper we use a general opinion lexicon and propose using proximity information in order to capture opinion term relatedness to the query. We propose a proximity-based opinion propagation method to calculate the opinion density at each point in a document. The opinion density at the position of a query term in the document can then be considered as the probability of opinion about the query term at that position. The effect of different kernels for capturing the proximity is also discussed. Experimental results on the BLOG06 dataset show that the proposed method provides significant improvement over standard TREC baselines and achieves a 2.5% increase in MAP over the best performing run in the TREC 2008 blog track.

References

[1]
K. Dave, S. Lawrence, and D. M. Pennock. Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In Proceedings of WWW '03, pages 519--528, 2003.
[2]
K. Eguchi and V. Lavrenko. Sentiment retrieval using generative models. In Proceedings of EMNLP'06, pages 345--354, 2006.
[3]
A. Esuli and F. Sebastiani. Sentiwordnet: A publicly available lexical resource for opinion mining. In Proceedings of LREC '06, pages 417--422, 2006.
[4]
N. Fuhr. Probabilistic models in information retrieval. Proceedings of Comput. J., 35(3):243--255, 1992.
[5]
M. Gamon. Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis. In COLING '04, page 841, 2004.
[6]
B. He, C. Macdonald, I. Ounis, J. Peng, and R. L. Santos. University of glasgow at TREC 2008: Experiments in blog, enterprise, and relevance feedback tracks with terrier. In Proceedings of TREC'08, 2008.
[7]
M. Hurst and K. Nigam. Retrieving topical sentiments from online document collections. In Document Recognition and Retrieval XI, pages 27--34, 2004.
[8]
L. Jia, C. T. Yu, and W. Zhang. UIC at TREC 2008 blog track. In Proceedings of TREC'08.
[9]
Y. Lee, S.-H. Na, J. Kim, S.-H. Nam, H.-Y. Jung, and J.-H. Lee. KLE at TREC 2008 blog track: Blog post and feed retrieval. In Proceedings of TREC'08, 2008.
[10]
Y. Lv and C. Zhai. Positional language models for information retrieval. In SIGIR '09, pages 299--306, 2009.
[11]
C. Macdonald, B. He, I. Ounis, and I. Soboro. Limits of opinion-finding baseline systems. In SIGIR '08, pages 747--748, 2008.
[12]
C. Macdonald, I. Ounis, and I. Soboro. Overview of the TREC-2007 blog track. In Proceedings of TREC'07, 2007.
[13]
T. Mullen and N. Collier. Sentiment analysis using support vector machines with diverse information sources. In Proceedings of EMNLP'04, pages 412--418, 2004.
[14]
S.-H. Na, Y. Lee, S.-H. Nam, and J.-H. Lee. Improving opinion retrieval based on query-specific sentiment lexicon. In ECIR '09, pages 734--738, 2009.
[15]
I. Ounis, M. de Rijke, C. Macdonald, G. Mishne, and I. Soboro. Overview of the TREC-2006 blog track. In Proceedings of TREC'06, 2006.
[16]
I. Ounis, C. Macdonald, and I. Soboro. Overview of the TREC-2008 blog track. In Proceedings of TREC'08, 2008.
[17]
B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up?: sentiment classification using machine learning techniques. In Proceedings of EMNLP '02, pages 79--86, 2002.
[18]
J. M. Ponte and W. B. Croft. A language modeling approach to information retrieval. In Proceedings of SIGIR '98, pages 275--281, 1998.
[19]
R. L. Santos, B. He, C. Macdonald, and I. Ounis. Integrating proximity to subjective sentences for blog opinion retrieval. In Proceedings of ECIR'09, pages 325--336, 2003.
[20]
P. D. Turney. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In ACL'02, pages 417--424, 2002.
[21]
P. D. Turney and M. L. Littman. Measuring praise and criticism: Inference of semantic orientation from association. ACM Trans. Inf. Syst., 21(4):315--346, 2003.
[22]
O. Vechtomova. Facet-based opinion retrieval from blogs. Inf. Process. Manage., 46(1):71--88, 2010.
[23]
K. Yang. WIDIT in TREC 2008 blog track: Leveraging multiple sources of opinion evidence. In Proceedings of TREC'08, 2008.
[24]
J. Yi, T. Nasukawa, R. Bunescu, and W. Niblack. Sentiment analyzer: Extracting sentiments about a given topic using natural language processing techniques. In ICDM '03, page 427, 2003.
[25]
M. Zhang and X. Ye. A generation model to unify topic relevance and lexicon-based sentiment for opinion retrieval. In SIGIR '08, pages 411--418, 2008.
[26]
W. Zhang, C. Yu, and W. Meng. Opinion retrieval from blogs. In CIKM '07, pages 831--840, 2007.

Cited By

View all
  • (2021)Term position‐based language model for information retrievalJournal of the Association for Information Science and Technology10.1002/asi.2443172:5(627-642)Online publication date: 10-Apr-2021
  • (2018)Review on Recent Advances in Information Mining From Big Consumer Opinion Data for Product DesignJournal of Computing and Information Science in Engineering10.1115/1.404108719:1Online publication date: 17-Sep-2018
  • (2017)A language-model-based approach for subjectivity detectionJournal of Information Science10.1177/016555151664181843:3(356-377)Online publication date: 1-Jun-2017
  • Show More Cited By

Index Terms

  1. Proximity-based opinion retrieval

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
    July 2010
    944 pages
    ISBN:9781450301534
    DOI:10.1145/1835449
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 July 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. blog
    2. opinion
    3. proximity
    4. retrieval
    5. sentiment

    Qualifiers

    • Research-article

    Conference

    SIGIR '10
    Sponsor:

    Acceptance Rates

    SIGIR '10 Paper Acceptance Rate 87 of 520 submissions, 17%;
    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 04 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Term position‐based language model for information retrievalJournal of the Association for Information Science and Technology10.1002/asi.2443172:5(627-642)Online publication date: 10-Apr-2021
    • (2018)Review on Recent Advances in Information Mining From Big Consumer Opinion Data for Product DesignJournal of Computing and Information Science in Engineering10.1115/1.404108719:1Online publication date: 17-Sep-2018
    • (2017)A language-model-based approach for subjectivity detectionJournal of Information Science10.1177/016555151664181843:3(356-377)Online publication date: 1-Jun-2017
    • (2017)Time Sensitivity for Personalized Search2017 IEEE/ACS 14th International Conference on Computer Systems and Applications (AICCSA)10.1109/AICCSA.2017.77(585-592)Online publication date: Oct-2017
    • (2015)A Semantic Social Recommender System Using Ontologies Based Approach For Tunisian TourismADCAIJ: Advances in Distributed Computing and Artificial Intelligence Journal10.14201/ADCAIJ201541901064:1(90-106)Online publication date: 6-Oct-2015
    • (2015)Improving opinion retrieval in social media by combining features-based coreferencing and memory-based learningInformation Sciences: an International Journal10.1016/j.ins.2014.12.021299:C(20-31)Online publication date: 1-Apr-2015
    • (2015)An effective approach to tweets opinion retrievalWorld Wide Web10.1007/s11280-013-0268-718:3(545-566)Online publication date: 1-May-2015
    • (2015)Learning Ranked Sentiment LexiconsComputational Linguistics and Intelligent Text Processing10.1007/978-3-319-18117-2_3(35-48)Online publication date: 2015
    • (2014)GeoTime-based tag ranking model for automatic image annotationProceedings of the 29th Annual ACM Symposium on Applied Computing10.1145/2554850.2554866(896-901)Online publication date: 24-Mar-2014
    • (2014)Opinion retrieval through unsupervised topological learning2014 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN.2014.6889934(3130-3134)Online publication date: Jul-2014
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media