[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1772690.1772712acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

A generalized framework of exploring category information for question retrieval in community question answer archives

Published: 26 April 2010 Publication History

Abstract

Community Question Answering (CQA) has emerged as a popular type of service where users ask and answer questions and access historical question-answer pairs. CQA archives contain very large volumes of questions organized into a hierarchy of categories. As an essential function of CQA services, question retrieval in a CQA archive aims to retrieve historical question-answer pairs that are relevant to a query question. In this paper, we present a new approach to exploiting category information of questions for improving the performance of question retrieval, and we apply the approach to existing question retrieval models, including a state-of-the-art question retrieval model. Experiments conducted on real CQA data demonstrate that the proposed techniques are capable of outperforming a variety of baseline methods significantly.

References

[1]
E. Agichtein, C. Castillo, D. Donato, A. Gionis, and G. Mishne. Finding high-quality content in social media. In WSDM, pp. 183--194, 2008.
[2]
A. Berger, R. Caruana, D. Cohn, D. Freitag, and V. Mittal. Bridging the lexical chasm: statistical approaches to answer-finding. In SIGIR, pp. 192--199, 2000.
[3]
J. Bian, Y. Liu, E. Agichtein, and H. Zha. Finding the right facts in the crowd: factoid question answering over social media. In WWW, pp. 467--476, 2008.
[4]
R. D. Burke, K. J. Hammond, V. A. Kulyukin, S. L. Lytinen, N. Tomuro, and S. Schoenberg. Question answering from frequently asked question files: Experiences with the faq finder system. AI Magazine, 18(2):57--66, 1997.
[5]
X. Cao, G. Cong, B. Cui, C. S. Jensen, and C. Zhang. The use of categorization information in language models for question retrieval. In CIKM, pp. 265--274, 2009.
[6]
C. Chekuri, M. H. Goldwasser, P. Raghavan, and E. Upfal. Web search using automatic classification. In WWW, 1997.
[7]
H. Duan, Y. Cao, C.-Y. Lin, and Y. Yu. Searching questions by identifying question topic and question focus. In ACL-HLT, pp. 156--164, 2008.
[8]
H. Fang, T. Tao, and C. Zhai. A formal study of information retrieval heuristics. In SIGIR, pp. 49--56, 2004.
[9]
J. Jeon, W. B. Croft, and J. H. Lee. Finding semantically similar questions based on their answers. In SIGIR, pp. 617--618, 2005.
[10]
J. Jeon, W. B. Croft, and J. H. Lee. Finding similar questions in large question and answer archives. In CIKM, pp. 84--90,2005.
[11]
V. Jijkoun and M. de Rijke. Retrieving answers from frequently asked questions pages on the web. In CIKM, pp. 76--83, 2005.
[12]
W. Lam, M. Ruiz, and P. Srinivasan. Automatic text categorization and its application to text retrieval. IEEE TKDE, 11(6):865--879, 1999.
[13]
Y. Liu, J. Bian, and E. Agichtein. Predicting information seeker satisfaction in community question answering. In SIGIR, pp. 483--490, 2008.
[14]
S. Riezler, A. Vasserman, I. Tsochantaridis, V. O. Mittal, and Y. Liu. Statistical machine translation for query expansion in answer retrieval. In ACL, pp. 464--471, 2007.
[15]
S. Robertson, S. Walker, S. Jones, M. Hancock-Beaulieu, and M. Gatford. Okapi at trec-3. In TREC, pp. 109--126, 1994.
[16]
A. Singhal, C. Buckley, and M. Mitra. Pivoted document length normalization. In SIGIR, pp. 21--29, 1996.
[17]
R. Soricut and E. Brill. Automatic question answering: Beyond the factoid. In HLT-NAACL, pp. 57--64, 2004.
[18]
K. Wang, Z. Ming, and T.-S. Chua. A syntactic tree matching approach to finding similar questions in community-based qa services. In SIGIR, pp. 187--194, 2009.
[19]
X. Xue, J. Jeon, and W. B. Croft. Retrieval models for question and answer archives. In SIGIR, pp. 475--482, 2008.
[20]
C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to information retrieval. ACM TOIS, 22(2):179--214, 2004.
[21]
J. Zobel and A. Moffat. Inverted files for text search engines. ACM Computing Surveys, 38(2), 56 pages, 2006.

Cited By

View all
  • (2023)Aligning Image Semantics and Label Concepts for Image Multi-Label ClassificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/355027819:2(1-23)Online publication date: 6-Feb-2023
  • (2023)I Know What You Are Searching for: Code Snippet Recommendation from Stack Overflow PostsACM Transactions on Software Engineering and Methodology10.1145/355015032:3(1-42)Online publication date: 26-Apr-2023
  • (2023)Robust Searching-Based Gradient Collaborative Management in Intelligent Transportation SystemACM Transactions on Multimedia Computing, Communications, and Applications10.1145/354993920:2(1-23)Online publication date: 27-Sep-2023
  • Show More Cited By

Index Terms

  1. A generalized framework of exploring category information for question retrieval in community question answer archives

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '10: Proceedings of the 19th international conference on World wide web
    April 2010
    1407 pages
    ISBN:9781605587998
    DOI:10.1145/1772690

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 26 April 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. categorization
    2. question retrieval
    3. question-answering services

    Qualifiers

    • Research-article

    Conference

    WWW '10
    WWW '10: The 19th International World Wide Web Conference
    April 26 - 30, 2010
    North Carolina, Raleigh, USA

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)16
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 01 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Aligning Image Semantics and Label Concepts for Image Multi-Label ClassificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/355027819:2(1-23)Online publication date: 6-Feb-2023
    • (2023)I Know What You Are Searching for: Code Snippet Recommendation from Stack Overflow PostsACM Transactions on Software Engineering and Methodology10.1145/355015032:3(1-42)Online publication date: 26-Apr-2023
    • (2023)Robust Searching-Based Gradient Collaborative Management in Intelligent Transportation SystemACM Transactions on Multimedia Computing, Communications, and Applications10.1145/354993920:2(1-23)Online publication date: 27-Sep-2023
    • (2023)The Statistics of Eye Movements and Binocular Disparities during VR Gaming: Implications for Headset DesignACM Transactions on Graphics10.1145/354952942:1(1-15)Online publication date: 19-Jan-2023
    • (2023)Exploiting Rateless Codes and Cross-layer Optimization for Low-power Wide-area NetworksACM Transactions on Sensor Networks10.1145/354456018:4(1-24)Online publication date: 31-Jan-2023
    • (2023)A Large-Scale Synthetic Gait Dataset Towards in-the-Wild Simulation and Comparison StudyACM Transactions on Multimedia Computing, Communications, and Applications10.1145/351719919:1(1-23)Online publication date: 5-Jan-2023
    • (2022)On the Analysis and Evaluation of Proximity-based Load-balancing PoliciesACM Transactions on Modeling and Performance Evaluation of Computing Systems10.1145/35499337:2-4(1-27)Online publication date: 26-Nov-2022
    • (2022)A Novel Time-Interval Based Modulation for Large-Scale, Low-Power, Wide-Area-NetworksACM Transactions on Sensor Networks10.1145/354954318:4(1-30)Online publication date: 29-Nov-2022
    • (2022)Power Converter Circuit Design Automation Using Parallel Monte Carlo Tree SearchACM Transactions on Design Automation of Electronic Systems10.1145/354953828:2(1-33)Online publication date: 24-Dec-2022
    • (2022)FENCE: Feasible Evasion Attacks on Neural Networks in Constrained EnvironmentsACM Transactions on Privacy and Security10.1145/354474625:4(1-34)Online publication date: 21-Jul-2022
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    EPUB

    View this article in ePub.

    ePub

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media