[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1007/11863878_10guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Towards next generation citeseer: a flexible architecture for digital library deployment

Published: 17 September 2006 Publication History

Abstract

CiteSeer began as the first search engine for scientific literature to incorporate Autonomous Citation Indexing, and has since grown to be a well-used, open archive for computer and information science publications, currently indexing over 730,000 academic documents. However, CiteSeer currently faces significant challenges that must be overcome in order to improve the quality of the service and guarantee that CiteSeer will continue to be a valuable, up-to-date resource well into the foreseeable future. This paper describes a new architectural framework for CiteSeer system deployment, named CiteSeer Plus. The new framework supports distributed indexing and storage for load balancing and fault-tolerance as well as modular service deployment to increase system flexibility and reduce maintenance costs. In order to facilitate novel approaches to information extraction, a blackboard framework is built into the architecture.

References

[1]
B. L. Buteau. A generic framework for distributed, cooperating blackboard systems. Proceedings of the 1990 ACM annual conference on Cooperation, p.358-365, February 20-22, 1990.
[2]
H. Chen, V. Dhar. A knowledge-based approach to the design of document-based retrieval systems. ACM SIGOIS Bulletin, v.11 n.2-3, p.281-290, Apr. 1990.
[3]
E. Garfield. Science Citation Index - A new dimension in indexing. Science, 144, pp. 649-654, 1964.
[4]
C.L. Giles, K. Bollacker and S. Lawrence. CiteSeer: An Automatic Citation Indexing System, Digital Libraries 98: Third ACM Conf. on Digital Libraries, ACM Press. New York, 1998, pp. 89-98.
[5]
C.L. Giles and I.G. Councill. Who gets acknowledged: measuring scientific contributions through automatic acknowledgement indexing. PNAS, 101, Number 51, pp. 17599-17604, 2004.
[6]
H. Han, C. Lee Giles, E. Manavoglu, H. Zha, Z. Zhang, E. A. Fox. Automatic Document Metadata Extraction using Support Vector Machines. Proceedings of the 2003 Joint Conference on Digital Libraries (JCDL03), 2003.
[7]
J. Lafferty, A. McCallum, and F. Pereira. Conditional Random Fields: Probabilistic models for segmenting and labeling sequence data. In International Conference on Machine Learning, 2001.
[8]
S. Lawrence, C. Lee Giles. Searching the World Wide Web. Science, 280, Number 5360, pp. 98-100, 1998.
[9]
T. R. Leek. Information extraction using hidden Markov models. Masters thesis, UC San Diego, 1997.
[10]
H. Penny Nii. Blackboard systems: The blackboard model of problem solving and the evolution of blackboard architectures. The AI Magazine, VII(2):38-53, Summer 1986.
[11]
T. O'Reilly. What Is Web 2.0 Design Patterns and Business Models for the Next Generation of Software. http://www.oreillynet.com/pub/a/oreilly/tim/news /2005/09/30/what-is-web-20.html
[12]
F. Peng and A. McCallum. Accurate information extraction from research papers using conditional random fields. Proceedings of Human Language Technology Conference and North American Chapter of the Association for Computational Linguistics(HLT-NAACL), pages 329336 (2004).
[13]
Y. Petinot, C. Lee Giles, V. Bhatnagar, P. B. Teregowda, H. Han, I. Councill. A Service-Oriented Architecture for Digital Libraries. ICSOC04, November 15-19, 2004.
[14]
K. Seymore, A. McCallum and R. Rosenfeld. Learning hidden Markov model structure for information extraction. In Papers from the AAAI-99 Workshop on Machine Learning for Information Extration, pages 3742, July 1999.
[15]
J. Stribling, I.G. Councill, M.F. Kaashoek, R. Morris, and S. Shenker. Overcite: A cooperative digital research library. In Proceedings of The International Workshop on Peer-To-Peer Systems (IPTPS 05), Ithaca, NY, 2005 .
[16]
H. Van de Sompel, P. Hochstenbach. Reference linking in a hybrid library environment. Part 1: Frameworks for linking. D-Lib Magazine, v.5 n.4, 1999.

Cited By

View all
  1. Towards next generation citeseer: a flexible architecture for digital library deployment

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    ECDL'06: Proceedings of the 10th European conference on Research and Advanced Technology for Digital Libraries
    September 2006
    566 pages
    ISBN:3540446362
    • Editors:
    • Julio Gonzalo,
    • Costantino Thanos,
    • M. Felisa Verdejo,
    • Rafael C. Carrasco

    Publisher

    Springer-Verlag

    Berlin, Heidelberg

    Publication History

    Published: 17 September 2006

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 11 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2019)CiteSeerXProceedings of the Conference on Artificial Intelligence for Data Discovery and Reuse10.1145/3359115.3359119(1-4)Online publication date: 13-May-2019
    • (2016)Research-paper recommender systemsInternational Journal on Digital Libraries10.1007/s00799-015-0156-017:4(305-338)Online publication date: 1-Nov-2016
    • (2010)SeerSuiteProceedings of the 2010 USENIX conference on Web application development10.5555/1863166.1863180(14-14)Online publication date: 23-Jun-2010
    • (2009)Conceptual recommender system for CiteSeerXProceedings of the third ACM conference on Recommender systems10.1145/1639714.1639758(241-244)Online publication date: 23-Oct-2009
    • (2007)ChemXSeerProceedings of the ACM first workshop on CyberInfrastructure: information management in eScience10.1145/1317353.1317356(7-10)Online publication date: 9-Nov-2007

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media