More Web Proxy on the site http://driver.im/

Article

Exploiting redundancy in question answering

Authors:

Charles L. A. Clarke,

Gordon V. Cormack,

Thomas R. LynamAuthors Info & Claims

SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval

Pages 358 - 365

https://doi.org/10.1145/383952.384024

Published: 01 September 2001 Publication History

Abstract

Our goal is to automatically answer brief factual questions of the form ``When was the Battle of Hastings?'' or ``Who wrote The Wind in the Willows?''. Since the answer to nearly any such question can now be found somewhere on the Web, the problem reduces to finding potential answers in large volumes of data and validating their accuracy. We apply a method for arbitrary passage retrieval to the first half of the problem and demonstrate that answer redundancy can be used to address the second half. The success of our approach depends on the idea that the volume of available Web data is large enough to supply the answer to most factual questions multiple times and in multiple contexts. A query is generated from a question and this query is used to select short passages that may contain the answer from a large collection of Web data. These passages are analyzed to identify candidate answers. The frequency of these candidates within the passages is used to ``vote'' for the most likely answer. The approach is experimentally tested on questions taken from the TREC-9 question-answering test collection. As an additional demonstration, the approach is extended to answer multiple choice trivia questions of the form typically asked in trivia quizzes and television game shows.

References

[1]

Eric Breck, John Burger, David House, Marc Light, and Inderjeet Mani. Question answering from large document collections. In 1999 AAAI Fall Symposium on Question Answering Systems, North Falmouth, MA, 1999.

[2]

Sergey Brin and Lawrence Page. The anatomy ofa large-scale hypertextual web search engine. In Seventh World Wide Web Conference, Brisbane, Australia, April 1998.

Digital Library

[3]

Claire Cardie. Empirical methods in information extraction. AI Magazine, 18(4):65-79, Winter 1997.

Digital Library

[4]

Claire Cardie, Vincent Ng, David Pierce, and Chris Buckley. Examining the role of statistical and linguistic knowledge sources in a general-knowledge question-answering system. In Sixth Applied Natural Language Processing Conference, pages 180-187, 2000.

Digital Library

[5]

C. L. A. Clarke, G. V. Cormack, D. I. E. Kisman, and T. R. Lynam. Question answering by passage selection. In 9th Text REtrieval Conference, Gaithersburg, MD, 2000.

[6]

Charles L. A. Clarke, Gordon V. Cormack, and Elizabeth A. Tudhope. Relevance ranking for one to three term queries. Information Processing and Management, 36(2):291-311, 2000.

Digital Library

[7]

G. V. Cormack, C. L. A. Clarke, C. R. Palmer, and D. I. E. Kisman. Fast automatic passage ranking. In 8th Text REtrieval Conference, Gaithersburg, MD, November 1999.

[8]

Anne Diekema, Xiaoyong Liu, Jiangping Chen, Hudong Wang, Nancy McCracken, Ozgur Yilmazel, and Elizabeth D. Liddy. Question answering: CNLP at the TREC-9 question answering track. In 9th Text REtrieval Conference, Gaithersburg, MD, 2000.

[9]

Ralph Grishman and Beth Sundheim. Design of the MUC-6 evaluation. In 6th Message Understanding Conference, Columbia, MD, 1995.

Digital Library

[10]

Sanda M. Harabagiu and Steven J. Maiorano. Finding answers in large collections of texts: Paragraph indexing + abductive inference. In 1999 AAAI Fall Symposium on Question Answering Systems, pages 63-71, North Falmouth, MA, 1999.

[11]

Sanda M. Harabagiu, Dan Moldovan, Marius Pacsca, Rada Mihalcea, Mihai Surdeanu, Razvan Bunescu, Roxana Girju, Vasile Rus, and Paul Morarescu. FALCON: Boosting knowledge for answer engines. In 9th Text REtrieval Conference, Gaithersburg, MD, 2000.

[12]

David Hawking, Nick Craswell, Paul Thistlewaite, and Donna Harman. Results and challenges in Web search evaluation. In 8th World Wide Web Conference, pages 243-252, Toronto, Canada, May 1999.

Digital Library

[13]

Eduard Hovy, Ulf Hermjakob, Chin-Yew Lin, Mike Junk, and Laurie Gerber. The Webclopedia. In 9th Text REtrieval Conference, Gaithersburg, MD, 2000.

[14]

Abraham Ittycheriah, Martin Franz, Wei-Jing Zhu, and Adwait Ratnaparkhi. IBM's statistical question answering system. In 9th Text REtrieval Conference, Gaithersburg, MD, 2000.

[15]

Julian Kupiec. MURAX: A robust linguistic approach for question answering using an on-line encyclopedia. In 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 181-190, Pittsburgh, 1993.

Digital Library

[16]

John Prager, Eric Brown, Amni Coden, and Dragomir Radev. Question-answering by predictive annotation. In 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 184-191, Athens, August 2000.

Digital Library

[17]

John Prager, Dragomir R Radev, Eric Brown, Amni Coden, and Valerie Samn. The use of predictive annotation for question answering in TREC-8. In 8th Text REtrieval Conference, Gaithersburg, MD, 1999.

[18]

Dragomir R Radev, John Prager, and Valerie Samn. Ranking suspected answers to natural language questions using predictive annotation. In 6th Conference on Applied Natural Language Processing, Seattle, May 2000.

Digital Library

[19]

Alan F. Smeaton. Information Retrieval: Still butting heads with natural language processing. In Information Extraction: A Multidisciplinary Approach to an Emerging Information Technology, pages 115-138, Frascati, Italy, July 1997. Reprinted as Springer-Verlag Lecture Notes in Artificial Intelligence 1299.

[20]

Rohini Srihari and Wei Li. Information extraction supported question answering. In 8th Text REtrieval Conference, Gaithersburg, MD, 1999.

[21]

Ellen M. Voorhees and Donna Harman, editors. Proceedings of the Ninth Text REtrieval Conference, Gaithersburg, MD, 2000. See trec.nist.goc.

[22]

Ellen M. Voorhees and Dawn Tice. Building a question answering test collection. In 23rd Annual International ACM SIGIR Conference onResearch and Development in Information Retrieval, pages 200-207, Athens, August 2000.

Digital Library

[23]

W. A. Woods, Stephen Green, Paul Martin, and Ann Houston. Halfway to question answering. In 9th Text REtrieval Conference, Gaithersburg, MD, 2000.

[24]

Xiaolan Zhu and Susan Gauch. Incorporating quality metrics in centralized/distributed information retrieval on the WWW. In 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 288-295, Athens, August 2000.

Digital Library

Cited By

Kamalloo EClarke CRafiei DChen HDuh WHuang HKato MMothe JPoblete B(2023)Limitations of Open-Domain Question Answering Benchmarks for Document-level ReasoningProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3592011(2123-2128)Online publication date: 19-Jul-2023
https://dl.acm.org/doi/10.1145/3539618.3592011
Yadegari MKamalloo ERafiei DAmigo ECastells PGonzalo JCarterette BCulpepper JKazai G(2022)Detecting Frozen Phrases in Open-Domain Question AnsweringProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531793(1990-1996)Online publication date: 6-Jul-2022
https://dl.acm.org/doi/10.1145/3477495.3531793
Su LZhang RGuo JFan YChen JLan YCheng XLewin-Eytan LCarmel DYom-Tov EAgichtein EGabrilovich E(2021)Beyond Relevance: Trustworthy Answer Selection via Consensus VerificationProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441781(562-570)Online publication date: 8-Mar-2021
https://dl.acm.org/doi/10.1145/3437963.3441781
Show More Cited By

Index Terms

Exploiting redundancy in question answering
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
2. Information systems
  1. Information retrieval

Recommendations

Quality-aware collaborative question answering: methods and evaluation
WSDM '09: Proceedings of the Second ACM International Conference on Web Search and Data Mining

Community Question Answering (QA) portals contain questions and answers contributed by hundreds of millions of users. These databases of questions and answers are of great value if they can be used directly to answer questions from any user. In this ...
A community question-answering refinement system
HT '11: Proceedings of the 22nd ACM conference on Hypertext and hypermedia

Community Question Answering (CQA) websites, which archive millions of questions and answers created by CQA users to provide a rich resource of information that is missing at web search engines and QA websites, have become increasingly popular. Web users ...
Will my question be answered? predicting "Question Answerability" in community question-answering sites
ECMLPKDD'13: Proceedings of the 2013th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part III

All askers who post questions in Community-based Question Answering (CQA) sites such as Yahoo! Answers, Quora or Baidu's Zhidao, expect to receive an answer, and are frustrated when their questions remain unanswered. We propose to provide a type of "...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval

September 2001

454 pages

ISBN:1581133316

DOI:10.1145/383952

Chairmen:
Donald H. Kraft
Louisiana State Univ.
,
W. Bruce Croft
University of Massachusetts, (For the Americas)
,
David J. Harper
The Robert Gordon University, (For Europe and Africa)
,
Justin Zobel
RMIT University, (For Asia and Australasia)

Copyright © 2001 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 September 2001

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Article

Conference

SIGIR01

Sponsor:

SIGIR

SIGIR01: 24th ACM/SIGIR International Conference on Research and Development in Information Retrieval

Louisiana, New Orleans, USA

Acceptance Rates

SIGIR '01 Paper Acceptance Rate 47 of 201 submissions, 23%;

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

118
Total Citations
View Citations
938
Total Downloads

Downloads (Last 12 months)13
Downloads (Last 6 weeks)7

Reflects downloads up to 23 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Kamalloo EClarke CRafiei DChen HDuh WHuang HKato MMothe JPoblete B(2023)Limitations of Open-Domain Question Answering Benchmarks for Document-level ReasoningProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3592011(2123-2128)Online publication date: 19-Jul-2023
https://dl.acm.org/doi/10.1145/3539618.3592011
Yadegari MKamalloo ERafiei DAmigo ECastells PGonzalo JCarterette BCulpepper JKazai G(2022)Detecting Frozen Phrases in Open-Domain Question AnsweringProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531793(1990-1996)Online publication date: 6-Jul-2022
https://dl.acm.org/doi/10.1145/3477495.3531793
Su LZhang RGuo JFan YChen JLan YCheng XLewin-Eytan LCarmel DYom-Tov EAgichtein EGabrilovich E(2021)Beyond Relevance: Trustworthy Answer Selection via Consensus VerificationProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441781(562-570)Online publication date: 8-Mar-2021
https://dl.acm.org/doi/10.1145/3437963.3441781
Garg ABadal TMukhopadhyay D(2021)QA System: Business Intelligence in HealthcareAdvanced Computing10.1007/978-981-16-0401-0_16(212-223)Online publication date: 11-Feb-2021
https://doi.org/10.1007/978-981-16-0401-0_16
Clarke C(2018)Web Question AnsweringEncyclopedia of Database Systems10.1007/978-1-4614-8265-9_1363(4637-4643)Online publication date: 7-Dec-2018
https://doi.org/10.1007/978-1-4614-8265-9_1363
Zhang HRao JLin JSmucker MKando NSakai TJoho HLi Hde Vries AWhite R(2017)Automatically Extracting High-Quality Negative Examples for Answer Selection in Question AnsweringProceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3077136.3080645(797-800)Online publication date: 7-Aug-2017
https://dl.acm.org/doi/10.1145/3077136.3080645
Bachrach YZukov-Gregoric ACoope STovell EMaksak BRodriguez JMcMurtie CBordbar M(2017)An Attention Mechanism for Neural Answer Selection Using a Combined Global and Local View2017 IEEE 29th International Conference on Tools with Artificial Intelligence (ICTAI)10.1109/ICTAI.2017.00072(425-432)Online publication date: Nov-2017
https://doi.org/10.1109/ICTAI.2017.00072
Wang XLi ZTang J(2017)Multimedia news QAImage and Vision Computing10.1016/j.imavis.2017.01.00460:C(162-170)Online publication date: 1-Apr-2017
https://dl.acm.org/doi/10.1016/j.imavis.2017.01.004
Amarger FChanet JHaemmerlé OHernandez NRoussey C(2016)Knowledge Engineering Method Based on Consensual Knowledge and Trust Computation: The MUSCKA SystemGraph-Based Representation and Reasoning10.1007/978-3-319-40985-6_14(177-190)Online publication date: 10-Jun-2016
https://doi.org/10.1007/978-3-319-40985-6_14
Clarke C(2016)Web Question AnsweringEncyclopedia of Database Systems10.1007/978-1-4899-7993-3_1363-2(1-7)Online publication date: 28-Dec-2016
https://doi.org/10.1007/978-1-4899-7993-3_1363-2
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents