Abstract
A major problem that arises in many large application domains is the discrepancy among terminologies of different information systems. The terms used by the information systems of one organization may not agree with the terms used by another organization even when they are in the same domain. Such a situation clearly impedes communication and the sharing of information, and decreases the efficiency of doing business. Problems of this nature can be overcome using a controlled vocabulary (CV), a system of concepts that consolidates and unifies the terminologies of a domain. However, CVs are large and complex and difficult to comprehend. This paper presents a methodology for representing a semantic network-based CV as an object-oriented database (OODB). We call such a representation an Object-Oriented Vocabulary Repository (OOVR). The methodology is based on a structural analysis and partitioning of the source CV. The representation of a CV as an OOVR offers both the level of support typical of database management systems and an abstract view which promotes comprehension of the CV's structure and content. After discussing the theoretical aspects of the methodology, we apply it to the MED and InterMED, two existing CVs from the medical field. A program, called the OOVR Generator, for automatically carrying out our methodology is described. Both the MED-OOVR and the InterMED-OOVR have been created using the OOVR Generator, and each exists on top of ONTOS, a commercial OODBMS. The OOVR derived from the InterMED is presently available on the Web.
Similar content being viewed by others
References
American Society of Hospital Pharmacists, Bethesda, MD. American Hospital Formulary Service Drug Information, 1997. Updated annually.
R. J. Brachman. On the epistemological status of semantic networks. In N. V. Findler, editor, Associative Networks: Representation and Use of Knowledge by Computers, pages 3-50. Academic Press, Inc., New York, NY, 1979.
S. K. Card. User limits and the VDT computer interface (excerpt). In R. M. Baecker and W. A. S. Buxton, editors, Readings in Human-Computer Interaction, pages 180-191. Morgan Kaufmann Publishers, Inc., Los Altos, CA, 1987.
M. J. Carey, D. J. DeWitt, J. E. Richardson, and E. J. Shekita. Storage management for objects in EXODUS. In W. Kim and F. H. Lochovsky, editors, Object-Oriented Concepts, Databases, and Applications, pages 341-369. ACM Press, New York, NY, 1989.
R. G. G. Cattell and D. K. Barry, editors. The Object Database Standard: ODMG 2.0. Morgan Kaufmann Publishers, Inc., San Francisco, CA, 1997.
P. P.-S. Chen. The Entity-Relationship Model: Toward a unified view of data. ACM Trans. Database Syst., 1(1):9-36, 1976.
J. J. Cimino. Personal communication, 1997.
J. J. Cimino, P. D. Clayton, G. Hripcsak, and S. B. Johnson. Knowledge-based approaches to the maintenance of a large controlled medical terminology. JAMIA, 1(1):35-50, 1994.
J. J. Cimino, G. Hripcsak, S. B. Johnson, and P. D. Clayton. Designing an introspective, multipurpose, controlled medical vocabulary. In Proc. Thirteenth Annual Symposium on Computer Applications in Medical Care, pages 513-517, Washington, DC, Nov. 1989.
College of American Pathologists, Skokie, IL. Systematized Nomenclature of Medicine, second edition, 1982.
O. Deux et al. The O2 system. Commun. ACM, 34(10):34-48, Oct. 1991.
D. H. Fischer. Consistency rules and triggers for thesauri. Int. Classif., 18(4):212-225, 1991.
D. H. Fischer. Consistency rules and triggers for multilingual terminology. In Proc. TKE'93, Terminology and Knowledge Engineering, pages 333-342, 1993.
GemStone Systems, Inc. URL: http://www.gemstone.com.
C. A. Goble, A. J. Glowinski, W. A. Nolan, and A. L. Rector. A descriptive semantic formalism for medicine. In Proc. 9th ICDE, pages 624-631, Vienna, Austria, 1993.
H. Gu, J. Cimino, M. Halper, J. Geller, and Y. Perl. Utilizing OODB schema modeling for vocabulary management. In J. Cimino, editor, Proc. 1996 AMIA Annual Fall Symposium, pages 274-278, Washington, DC, Oct. 1996.
H. Gu, Y. Perl, J. Geller, M. Halper, J. Cimino, and M. Singh. Partitioning a vocabulary's IS-A hierarchy into trees. In D. R. Masys, editor, Proc. 1997 AMIA Annual Fall Symposium, pages 630-634, Nashville, TN, Oct. 1997.
M. Halper, R. Galnares, J. Geller, and Y. Perl. An analogical, Web-based interface to an OODB medical vocabulary. In preparation.
M. Hammer and D. McLeod. Database description with SDM: A semantic database model. ACM Trans. Database Syst., 6(3):351-386, 1981.
R. Hull and R. King. Semantic database modeling: Survey, applications, and research issues. ACM Comput. Surv., 19(3):201-260, Sept. 1987.
N. Ide, J. L. Maitre, and J. Véronis. Outline of a model for lexical databases. Information Processing and Management, 29(2):159-186, 1993.
P. D. Karp, K. Myers, and T. Gruber. The generic frame protocol. In Proc. IJCAI-95, pages 768-774, Montreal, Canada, 1995.
P. D. Karp and S. M. Paley. Knowledge representation in the large. In Proc. IJCAI-95, pages 751-758, Montreal, Canada, 1995.
M. Kifer, W. Kim, and Y. Sagiv. Querying object-oriented databases. In Proc. 1992 ACM SIGMOD Conference on Management of Data, San Diego, CA, June 1992.
W. Kim and F. H. Lochovsky, editors. Object-Oriented Concepts, Databases, and Applications. ACM Press, New York, NY, 1989.
C. Lamb, G. Landis, J. Orenstein, and D. Weinreb. The ObjectStore database system. Commun. ACM, 34(10):50-63, Oct. 1991.
F. Lehmann. Semantic networks. In [28], pages 1-50.
F. Lehmann, editor. Semantic Networks in Artificial Intelligence. Pergamon Press, Tarrytown, NY, 1992.
D. B. Lenat and R. V. Guha. Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project. Addison-Wesley Publishing Co., Inc., Reading, MA, 1990.
L. Liu, M. Halper, H. Gu, J. Geller, and Y. Perl. Modeling a vocabulary in an object-oriented database. In K. Barker and M. T. Özsu, editors, CIKM-96, Proc. 5th Int'l Conference on Information and Knowledge Management, pages 179-188, Rockville, MD, Nov. 1996.
E. Mays, C. Apte, J. Griesmer, and J. Kastner. Experience with K-Rep: An object-centered knowledge representation language. In Proc. IEEE AI Application Conference, San Diego, CA, Mar. 1988.
G. A. Miller. The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psych. Rev., 63, 1956.
G. A. Miller. WordNet: A lexical database for English. Commun. ACM, 38(11):39-41, 1995.
W. Möhr and L. Rostek. TEDI: An object-oriented terminology editor. In Proc. TKE'93, Terminology and Knowledge Engineering, pages 363-374, 1993.
J. Mylopoulos, A. Borgida, M. Jarke, and M. Koubarakis. Telos: Representing knowledge about information systems. TOIS, 8(4):325-362, 1990.
J. Mylopoulos, V. Chaudhri, D. Plexousakis, A. Shrufi, and T. Topaloglou. Building knowledge base management systems: A progress report. Technical Report DKBS-TR-94-4, Department of Computer Science, University of Toronto, 1994.
The National Drug Code Directory. URL: http://www.fda.gov/cder/ndc/index.htm.
N. F. Noy and C. D. Hafner. The state of the art in ontology design: A survey and comparative review. AI Magazine, 18(3):53-74, Fall 1997.
Welcome to ODI. URL: http://www.odi.com.
D. E. Oliver, E. H. Shortliffe, and InterMed Collaboratory. Collaborative model development for vocabulary and guidelines. In J. Cimino, editor, Proc. 1996 AMIA Annual Fall Symposium, page 826, Washington, DC, Oct. 1996.
ONTOS Home Page. URL: http://www.ontos.com.
ONTOS, Inc. Lowell, MA. ONTOS DB 3.1 Reference Manual, 1995.
The OOVR Browser. URL: http://object.njit.edu:2000/~newoohvr/JBI/INTERMED/InterTerms.html.
Object Technology by Ardent Software (O2 System). URL: http://www.ardentsoftware.com/object/index.html.
S. C. Shapiro and W. J. Rapaport. The SNePS family. In [28], pages 243-275.
E. H. Shortliffe, G. O. Barnett, J. Cimino, R. A. Greenes, S. M. Huff, and V. L. Patel. Collaborative medical informatics research using the Internet and the World Wide Web. In J. Cimino, editor, Proc. 1996 AMIA Annual Fall Symposium, pages 125-129, Washington, DC, Oct. 1996.
V. Soloviev. An overview of three commercial object-oriented database management systems: ONTOS, ObjectStore, and O2. SIGMOD Record, 21(1):93-104, Mar. 1992.
J. F. Sowa. Principles of Semantic Networks, Explorations in the Representation of Knowledge. Morgan Kaufmann Publishers, Inc., San Mateo, CA, 1991.
U. S. Department of Health and Human Services, National Institutes of Health, National Library of Medicine. Unified Medical Language System, 1996.
United States National Center for Health Statistics, Washington, DC. International Classification of Diseases: Ninth Revision, with Clinical Modifications, 1980.
P. Valduriez, S. Khoshafian, and G. Copeland. Implementation techniques of complex objects. In Proc. VLDB '86, pages 101-109, Kyoto, Japan, Aug. 1986.
Versant. URL: http://www.versant.com.
W. A. Woods. What's in a link: Foundations for semantic networks. In R. J. Brachman and H. J. Levesque, editors, Readings in Knowledge Representation, pages 218-241. Morgan Kaufmann Publishers, Inc., San Mateo, CA, 1985.
S. B. Zdonik and D. Maier, editors. Readings in Object-Oriented Database Systems. Morgan Kaufmann Publishers, Inc., San Mateo, CA, 1990.
J. Zhang. Application of OODB and SGML techniques in text database: An electronic dictionary system. SIGMOD Record, 24(1):3-8, Mar. 1995.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Liu, Lm., Halper, M., Geller, J. et al. Controlled Vocabularies in OODBs: Modeling Issues and Implementation. Distributed and Parallel Databases 7, 37–65 (1999). https://doi.org/10.1023/A:1008682210559
Issue Date:
DOI: https://doi.org/10.1023/A:1008682210559