[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.3115/1218955.1218982dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

An empirical study of information synthesis tasks

Published: 21 July 2004 Publication History

Abstract

This paper describes an empirical study of the "Information Synthesis" task, defined as the process of (given a complex information need) extracting, organizing and inter-relating the pieces of information contained in a set of relevant documents, in order to obtain a comprehensive, non redundant report that satisfies the information need.Two main results are presented: a) the creation of an Information Synthesis testbed with 72 reports manually generated by nine subjects for eight complex topics with 100 relevant documents each; and b) an empirical comparison of similarity metrics between reports, under the hypothesis that the best metric is the one that best distinguishes between manual and automatically generated reports. A metric based on key concepts overlap gives better results than metrics based on n-gram overlap (such as ROUGE) or sentence overlap.

References

[1]
P. Clarkson and R. Rosenfeld. 1997. Statistical language modeling using the CMU-Cambridge toolkit. In Proceeding of Eurospeech '97, Rhodes, Greece.
[2]
J. Goldstein, V. O. Mittal, J. G. Carbonell, and J. P. Callan. 2000. Creating and Evaluating Multi-Document Sentence Extract Summaries. In Proceedings of Ninth International Conferences on Information Knowledge Management (CIKM'00), pages 165--172, McLean, VA.
[3]
H. V. Halteren and S. Teufel. 2003. Examining the Consensus between Human Summaries: Initial Experiments with Factoids Analysis. In HLT/NAACL-2003 Workshop on Automatic Summarization, Edmonton, Canada.
[4]
V. Khandelwal, R. Gupta, and J. Allan. 2001. An Evaluation Corpus for Temporal Summarization. In Proceedings of the First International Conference on Human Language Technology Research (HLT 2001), Tolouse, France.
[5]
C. Lin and E. H. Hovy. 2003. Automatic Evaluation of Summaries Using N-gram Co-ocurrence Statistics. In Proceeding of the 2003 Language Technology Conference (HLT-NAACL 2003), Edmonton, Canada.
[6]
I. Mani. 2001. Automatic Summarization, volume 3 of Natural Language Processing. John Benjamins Publishing Company, Amsterdam/Philadelphia.
[7]
C. D. Manning and H. Schutze. 1999. Foundations of statistical natural language processing. MIT Press, Cambridge Mass.
[8]
P. Over. 2003. Introduction to DUC-2003: An Intrinsic Evaluation of Generic News Text Summarization Systems. In Proceedings of Workshop on Automatic Summarization (DUC 2003).
[9]
K. Papineni, S. Roukos, T. Ward, and W. Zhu. 2002. Bleu: a method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL), pages 311--318, Philadelphia.
[10]
C. Peters, M. Braschler, J. Gonzalo, and M. Kluck, editors. 2002. Evaluation of Cross-Language Information Retrieval Systems, volume 2406 of Lecture Notes in Computer Science. Springer-Verlag, Berlin-Heidelberg-New York.
[11]
D. R. Radev, J. Hongyan, and M. Budzikowska. 2000. Centroid-Based Summarization of Multiple Documents: Sentence Extraction, Utility-Based Evaluation, and User Studies. In Proceedings of the Workshop on Automatic Summarization at the 6th Applied Natural Language Processing Conference and the 1st Conference of the North American Chapter of the Association for Computational Linguistics, Seattle, WA, April.

Cited By

View all
  • (2011)Using semantic information to answer complex questionsProceedings of the 24th Canadian conference on Advances in artificial intelligence10.5555/2018192.2018201(68-73)Online publication date: 25-May-2011
  • (2009)Automatic summarization of MEDLINE citations for evidence-based medical treatmentJournal of Biomedical Informatics10.1016/j.jbi.2008.10.00242:5(801-813)Online publication date: 1-Oct-2009
  • (2008)UNED at WebCLEF 2008Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access10.5555/1813809.1813930(798-801)Online publication date: 17-Sep-2008
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '04: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
July 2004
729 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 21 July 2004

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)45
  • Downloads (Last 6 weeks)10
Reflects downloads up to 11 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2011)Using semantic information to answer complex questionsProceedings of the 24th Canadian conference on Advances in artificial intelligence10.5555/2018192.2018201(68-73)Online publication date: 25-May-2011
  • (2009)Automatic summarization of MEDLINE citations for evidence-based medical treatmentJournal of Biomedical Informatics10.1016/j.jbi.2008.10.00242:5(801-813)Online publication date: 1-Oct-2009
  • (2008)UNED at WebCLEF 2008Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access10.5555/1813809.1813930(798-801)Online publication date: 17-Sep-2008
  • (2006)DUC 2005Proceedings of the Workshop on Task-Focused Summarization and Question Answering10.5555/1654679.1654689(48-55)Online publication date: 23-Jul-2006
  • (2006)Dimensionality reduction aids term co-occurrence based multi-document summarizationProceedings of the Workshop on Task-Focused Summarization and Question Answering10.5555/1654679.1654681(1-7)Online publication date: 23-Jul-2006
  • (2006)The role of information retrieval in answering complex questionsProceedings of the COLING/ACL on Main conference poster sessions10.5555/1273073.1273141(523-530)Online publication date: 17-Jul-2006
  • (2006)Will pyramids built of nuggets topple over?Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics10.3115/1220835.1220884(383-390)Online publication date: 4-Jun-2006
  • (2006)Answer extraction, semantic clustering, and extractive summarization for clinical question answeringProceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics10.3115/1220175.1220281(841-848)Online publication date: 17-Jul-2006
  • (2005)Using random walks for question-focused sentence retrievalProceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing10.3115/1220575.1220690(915-922)Online publication date: 6-Oct-2005
  • (2005)QARLAProceedings of the 43rd Annual Meeting on Association for Computational Linguistics10.3115/1219840.1219875(280-289)Online publication date: 25-Jun-2005
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media