Abstract
Most existing statistical databases are mere collections of statistical files gathered for specific purposes. Consequently, as they grow in size, users are faced with difficulties in identifying and finding the data they need.
In order to obtain data descriptions independent of specific purposes, this paper proposes an object-oriented data design, which distinguishes between data conceptually obtainable and data actually stored in a database, and specifies relationships among classifications and categories independent of particular data files.
This is followed by a discussion of the representation of knowledge about data and classifications on a knowledge base, giving clear definitions of hierarchies and relationships among statistical data concepts.
Finally, a natural language query system using the knowledge base is demonstrated, which proves the advantage of the proposed statistical data concepts.
Preview
Unable to display preview. Download preview PDF.
References
ANSI/X3/SPARC, "Study Group on Data Base Management Systems: Interim Report," FDT (Bulletin of ACM-SIGMOD), 7(2), 1975.
R.J.Brackman, "What IS-A is and isn't: An Analysis of Taxonomic Links in Semantic Networks," IEEE Computer, Oct. 1983, pp.30–36.
P.Chan and A.Shoshani, "SUBJECT: A Directory Driven System for Organizing and Accessing Large Statistical Databases," VLDB, 1981, pp.553–563.
R.E.Cubitt, "Meta Data: An Experience of its Uses and Management," SSDBM, 1983, pp.167–169.
E.Malmborg, "On the Semantics of Aggregated Data," SSDBM, 1986, pp.152–158.
National Land Agency, Knowledge Management of Land Information, (in Japanese), Publication Bureau of the Ministry of Finance, Japan, 1986.
Z.M.Ozsoyoglu and G.Ozsoyoglu, "An Extension of Relational Algebra for Summary Tables," SSDBM, 1983, pp.202–211.
R.Reiter, "On Closed World Data Bases," in H.Gallaire and J.Minker (eds.), Logic and Data Bases, Plenum Press, 1978, pp.55–76.
H.Sato, T.Nakano, Y.Fukasawa and R.Hotaka, "Conceptual Schema for a Wide-Scope Statistical Database and Its Applications," SSDBM, 1986, pp.165–172.
H.Sato, Design and Development of Statistical Databases: An Application of Data Model and Knowledge Base, (in Japanese), Ohm Co., Japan, 1988, 246 pages.
A.Shoshani, "Statistical Databases: Characteristics, Problems and some Solutions," VLDB, 1982, pp.208–222.
J.M. Smith and D.C.P. Smith, "Database Abstractions: Aggregation and Generalization," TODS, 2(2), June 1977, pp.105–133.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1989 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Sato, H. (1989). A data model, knowledge base, and natural language processing for sharing a large statistical database. In: Rafanelli, M., Klensin, J.C., Svensson, P. (eds) Statistical and Scientific Database Management. SSDBM 1988. Lecture Notes in Computer Science, vol 339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0027515
Download citation
DOI: https://doi.org/10.1007/BFb0027515
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-50575-4
Online ISBN: 978-3-540-46045-9
eBook Packages: Springer Book Archive