Abstract
In this paper, we present a data and an execution model that allow for efficient storage and retrieval of XML documents in a relational database. The data model is strictly based on the notion of binary associations: by decomposing XML documents into small, flexible and semantically homogeneous units we are able to exploit the performance potential of vertical fragmentation. Moreover, our approach provides clear and intuitive semantics, which facilitates the definition of a declarative query algebra. Our experimental results with large collections of XML documents demonstrate the effectiveness of the techniques proposed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
S. Abiteboul, D. Quass, J. McHugh, J. Widom, and J. L. Wiener. The Lorel Query Language for Semistructured Data. International Journal on Digital Libraries, 1(1):68–88, 1997.
C. Beeri and Y. Tzaban. SAL: An Algebra for Semistructured Data and XML. In International Workshop on the Web and Databases, pages 37–42, Pennsylvania, USA, 1999.
P. A. Boncz and M. L. Kersten. MIL Primitives for Querying a Fragmented World. The VLDB Journal, 8(2):101–119, 1999.
A. Bonifati and S. Ceri. Comparative Analysis of Five XML Query Languages. ACM SIGMOD Record, 1(29):68–79, 2000.
J. Bosak. Sample XML documents. shakespeare.1.01.xml.zip, available at ftp://sunsite.unc.edu/pub/sun-info/standards/xml/eg/.
P. Buneman, S. B. Davidson, G. G. Hillebrand, and D. Suciu. A Query Language and Optimization Techniques for Unstructured Data. In Proc. of the ACM SIGMOD Int’l. Conf. on Management of Data, pages 505–516, Montreal, Canada, 1996.
A. Deutsch, M. F. Fernandez, and D. Suciu. Storing Semistructured Data with STORED. In Proc. of the ACM SIGMOD Int’l. Conf. on Management of Data, pages 431–442, Philadephia, PA, USA, 1999.
M. Dyck. The GNU version of The Collaborative International Dictionary of English, presented in the Extensible Markup Language. Available at http://metalab.unc.edu/webster/.
D. Florescu and D. Kossmann. Storing and Querying XML Data Using an RDBMS. Data Engineering Bulletin, 22(3), 1999.
R. Goldman and J. Widom. Dataguides: Enabling Query Formulation and Optimization in Semistructured Databases. In Proc. of the Int’l. Conf. on Very Large Data Bases, pages 436–445, Athens, Greece, 1997.
C. Kanne and G. Moerkotte. Efficient Storage of XML Data. In Proceedings of the 16th International Conference on Data Engineering, page 198, 2000.
M. Ley. DBLP Bibliography. http://www.informatik.uni-trier.de:8000/~ley/db/.
J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A Database Management System for Semistructured Data. ACM SIGMOD Record, 3(26), 1997.
J. Shanmugasundaram, K. Tufte, G. He, C. Zhang, D. DeWitt, and J. Naughton. Relational Databases for Querying XML Documents: Limitations and Opportunities. In Proc. of the Int’l. Conf. on Very Large Data Bases, pages 302–314, Edinburgh, UK, 1999.
T. Shimura, M. Yoshikawa, and S. Uemura. Storage and Retrieval of XML Documents Using Object-Relational Databases. In Database and Expert Systems Applications, pages 206–217. Springer, 1999.
Software AG. Tamino-Technical Description. Available at http://www.softwareag.com/tamino/technical/description.htm.
R. van Zwol, P. Apers, and A. Wilschutz. Implementing semi-structured data with MOA. In Workshop on Query Processing for Semistructured data and Non-Standard Data Formats (in conjunction with ICDT), 1999.
W3C. Document Object Model (DOM). Available at http://www.w3.org/DOM/.
W3C. Extensible Markup Language (XML) 1.0. Available at http://www.w3.org/TR/1998/REC-xml-19980210.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Schmidt, A., Kersten, M., Windhouwer, M., Waas, F. (2001). Efficient Relational Storage and Retrieval of XML Documents. In: Goos, G., Hartmanis, J., van Leeuwen, J., Suciu, D., Vossen, G. (eds) The World Wide Web and Databases. WebDB 2000. Lecture Notes in Computer Science, vol 1997. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45271-0_9
Download citation
DOI: https://doi.org/10.1007/3-540-45271-0_9
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41826-9
Online ISBN: 978-3-540-45271-3
eBook Packages: Springer Book Archive