[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/133160.133203acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article
Free access

A loosely-coupled integration of a text retrieval system and an object-oriented database system

Published: 01 June 1992 Publication History

Abstract

Document management systems are needed for many business applications. This type of system would combine the functionality of a database system, (for describing, storing and maintaining documents with complex structure and relationships) with a text retrieval system (for effective retrieval based on full text). The retrieval model for a document management system is complicated by the variety and complexity of the objects that are represented. In this paper, we describe an approach to complex object retrieval using a probabilistic inference net model, and an implementation of this approach using a loose coupling of an object-oriented database system (IRIS) and a text retrieval system based on inference nets (INQUERY). The resulting system is used to store long, structured documents and can retrieve document components (sections, figures, etc.) based on their contents or the contents of related components. The lessons learnt from the implementation are discussed.

References

[1]
J. Annevelink. Database programming languages: A functional approach. In 1991 A GM SIGMOD I~ter~ational Conference on Manageraent of Data, pages 318-327, 1991.
[2]
J. Banerjee, H. Chou, J.F. Garza, W. Kim, D. Woelk, and N. BaUou. Data model issues for object-oriented applications. A CM Transactions on Office lr~formatior~ Systems, 5(1):3-26, 1987.
[3]
David C. Blair. An extended relational retrieval model. Information Processing and Management, 24(3):349-371, 19s8
[4]
J. P. Callan, W.B. Croft, and S.M. Harding. The INQUEKY retrieval system. Technical report, Department of Computer Science, University of Massachusetts, Amherst, MA 01003, 1992.
[5]
M.P. Consens and A.O. Mendelzon. Expressing structural hypertext queries in graphlog. In Proceedings of Hypertezt 89, pages 269-292, 1989.
[6]
W. B. Croft, H.R. Turtle, and D.D. Lewis. The use of phrases and structured queries in information retrieval. In Proceedings of the A CM SIGIR Conference on Research and Development in Information Retrieval, pages 32-45, 1991.
[7]
W. Bruce Croft, R. Krovetz, and H. R. Turtle. Interactive retrieval of complex documents. Information Processing and Management, 26(5):593-613, 1990.
[8]
W. Bruce Croft and Howard Turtle. Retrieval of complex objects. In Proceedings of EDB T 92, 1991. (to appear).
[9]
D.H. Fishman. Overview of the Iris dbms. Hewlett Packard Technical Report, HPL-SAL-89-15, 1989.
[10]
Norbert Fuhr. A probabilistic framework for vague queries and imprecise information in databases. In Proceedings of VZDB 90, pages 696-707, 1990.
[11]
H. Garcia-Mohna and D. Porter. Supporting probabilistic data in n relational system. In Proceedings of EDBT, pages 60-74, 1990.
[12]
C. A. Lynch and M. Stonebraker. Extended user-defined indexing with apphcations to textual databases. In Proceedings of the Very Large Database Conference, pages 306-317, 1988.
[13]
I.A. Macleod and R.G. Crawford. Document retrieval as a database apphcation. Information Technology: Research and Development, 2:43-60, 1983.
[14]
D. Mater and :1. Stein. Development and implementation of an object-oriented dbms. In B. Shriver and P. Wegner, editors, Research Directions in Object.Oriented Programming, pages 355- 392. MIT Press, 1987.
[15]
Amihai Motro. VAGUE: A user interface to relational databases that permits vague queries. A CM Transactions of Office Information Systems, 6(3):187-214, July 1988.
[16]
Gerard Salton and Chris Buckley. Global text matching for information retrieval.Sczen.c~, 253-1012-1015, 1991.
[17]
Gerard Salton and Michael 3. McGi}l. Introduction to Modern Information Retrieval. McGraw-Hill, 1983.
[18]
H.3. Schek. Methods for the administration of textual data in database systems. In C.J. Van Rijsbergen, R.N. Oddy, and P.W. Williams, editors, Research and Development in Information l~etrieval, pages 218-235, t981.
[19]
Howard R. Turtle. Inference Networks for Document Retrieval. PhD thesis, University of Massachusetts at Amherst, 1990.
[20]
H.R. Turtle and W.B. Croft. Evaluation of an inference network-based retrieval model. ACM Transactions on Information Systems, 9(3):187- 222, 1991.
[21]
K. Wilkinson, P. Lyngbaek, and W. Hasan. The Iris architecture and implementation. IEEE Transactions on Knowledge and Data Engineering, 2(1)'63-75, 1990.
[22]
S. B. Zdonik and D. Mater. Readings in Object- Oriented Database Systems. Morgan Kaufmann, San Mateo, CA, 1990.

Cited By

View all
  • (2017)Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical ResourcesTransactions on Computational Collective Intelligence XXVI10.1007/978-3-319-59268-8_8(162-185)Online publication date: 15-Jun-2017
  • (2012)Improving Customer Relationship Management through Integrated Mining of Heterogeneous DataInternational Journal of Computer Theory and Engineering10.7763/IJCTE.2012.V4.523(518-522)Online publication date: 2012
  • (2006)The PENG SystemProceedings of the 17th International Conference on Database and Expert Systems Applications10.1109/DEXA.2006.136(445-449)Online publication date: 4-Sep-2006
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '92: Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
June 1992
352 pages
ISBN:0897915232
DOI:10.1145/133160
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 June 1992

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

SIGIR92
Sponsor:
  • SIGIR
  • Royal School of Lib.

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)57
  • Downloads (Last 6 weeks)16
Reflects downloads up to 15 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2017)Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical ResourcesTransactions on Computational Collective Intelligence XXVI10.1007/978-3-319-59268-8_8(162-185)Online publication date: 15-Jun-2017
  • (2012)Improving Customer Relationship Management through Integrated Mining of Heterogeneous DataInternational Journal of Computer Theory and Engineering10.7763/IJCTE.2012.V4.523(518-522)Online publication date: 2012
  • (2006)The PENG SystemProceedings of the 17th International Conference on Database and Expert Systems Applications10.1109/DEXA.2006.136(445-449)Online publication date: 4-Sep-2006
  • (2005)The TEXTURE benchmarkProceedings of the 31st international conference on Very large data bases10.5555/1083592.1083631(313-324)Online publication date: 30-Aug-2005
  • (2002)Mapping DTDs to object-oriented schemasProceedings of the Second International Conference on Web Information Systems Engineering10.1109/WISE.2001.996477(161-170)Online publication date: 2002
  • (2002)Combining Pat-Trees and Signature Files for Query Evaluation in Document DatabasesDatabase and Expert Systems Applications10.1007/3-540-48309-8_44(473-484)Online publication date: 18-Jun-2002
  • (1998)Layered index structures in document database systemsProceedings of the seventh international conference on Information and knowledge management10.1145/288627.288688(406-413)Online publication date: 1-Nov-1998
  • (1998)Removal of redundancy in documents retrieved from different resourcesProceedings Tenth IEEE International Conference on Tools with Artificial Intelligence (Cat. No.98CH36294)10.1109/TAI.1998.744799(112-119)Online publication date: 1998
  • (1998)A model and a visual query language for structured textProceedings. String Processing and Information Retrieval: A South American Symposium (Cat. No.98EX207)10.1109/SPIRE.1998.712977(7-13)Online publication date: 1998
  • (1997)A data modeling approach to the seamless information exchange among structured documents and databasesProceedings of the 1997 ACM symposium on Applied computing10.1145/331697.331712(78-87)Online publication date: 1-Apr-1997
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media