[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Relational support for flexible schema scenarios

Published: 01 August 2008 Publication History

Abstract

Efficient support for applications that deal with data heterogeneity, hierarchies and schema evolution is an important challenge for relational engines. In this paper we show how this flexibility can be handled in Microsoft SQL Server. For this purpose, the engine has been equipped in an integrated package of relational extensions. The package includes sparse storage, column set operations, filtered indices, filtered statistics and hierarchy querying with OrdPath labeling. In addition, economical loading of metadata allow us to answer queries independently of the number of columns in a table and drastically improve scaling capabilities. The design of a prototypical content and collaboration application based on a wide table is described, along with experiments validating its performance.

References

[1]
D. J. Abadi. Column stores for wide and sparse data. In CIDR, Asilomar, CA, USA, 2007.
[2]
R. Agrawal, A. Somani, and Y. Xu. Storage and querying of e-commerce data. In VLDB, pages 149--158, San Francisco, CA, USA, 2001. Morgan Kaufmann Publishers Inc.
[3]
J. L. Beckmann, A. Halverson, R. Krishnamurthy, and J. F. Naughton. Extending RDBMSs to support sparse datasets using an interpreted attribute storage format. In ICDE, page 58, Washington, DC, USA, 2006. IEEE Computer Society.
[4]
N. Bruno and S. Chaudhuri. Exploiting statistics on query expressions for optimization. In SIGMOD '02: Proceedings of the 2002 ACM SIGMOD international conference on Management of data, pages 263--274, New York, NY, USA, 2002. ACM.
[5]
N. Bruno and S. Chaudhuri. Conditional selectivity for statistics on query expressions. In SIGMOD '04: Proceedings of the 2004 ACM SIGMOD international conference on Management of data, pages 311--322, New York, NY, USA, 2004. ACM.
[6]
E. Chu, J. Beckmann, and J. Naughton. The case for a wide-table approach to manage sparse relational data sets. In SIGMOD, pages 821--832, New York, NY, USA, 2007. ACM Press.
[7]
G. G. Conor Cunningham, Cesar A. Galindo-Legaria. Pivot and unpivot: Optimization and execution strategies in an RDBMS. In VLDB, Toronto, Canada, 2004.
[8]
T. C. A. D. J. F. N. Eric Chu, Akanksha Baid. A relational approach to incrementally extracting and querying structure in unstructured data. In VLDB, pages 1045--1056, 2007.
[9]
C. A. Galindo-legaria, M. M. Joshi, F. Waas, and M.-c. Wu. Statistics on views. In VLDB, Berlin, Germany, 2003.
[10]
C. A. Galindo-Legaria, S. Stefani, and F. Waas. Query processing for sql updates. In SIGMOD, pages 844--849, New York, NY, USA, 2004. ACM Press.
[11]
T. Härder, M. Haustein, C. Mathis, and M. Wagner. Node labeling schemes for dynamic xml documents reconsidered. Data Knowl. Eng., 60(1):126--149, 2007.
[12]
A. Kumaran and P. Carlin. Multi-lingual semantic matching with ordpath in relational systems. IEEE Data Eng. Bull., 30(1):44--56, 2007.
[13]
P. O'Neil, E. O'Neil, S. Pal, I. Cseri, G. Schaller, and N. Westbury. Ordpaths: insert-friendly xml node labels. In SIGMOD, pages 903--908, New York, NY, USA, 2004. ACM Press.
[14]
J. Parsons and Y. Wand. Emancipating instances from the tyranny of classes in information modeling. ACM Trans. Database Syst., 25(2):228--268, 2000.
[15]
C. Sartori and M. R. Scalas. Partial indexing for nonuniform data distributions in relational dbms's. IEEE Transactions on Knowledge and Data Engineering, 6(3):420--429, 1994.
[16]
P. Seshadri and A. N. Swami. Generalized partial indexes. In ICDE '95: Proceedings of the Eleventh International Conference on Data Engineering, pages 420--427, Washington, DC, USA, 1995. IEEE Computer Society.
[17]
M. J. B. K. S. S. P. T. Srini Acharya, Cesar Galindo-Legaria. Filtered indices and their use in flexible schema scenarios. In ICDE, pages 903--908, Cancun, Mexico, 2008.
[18]
M. Stonebraker. The case for partial indexes. SIGMOD Rec., 18(4):4--11, 1989.
[19]
M. Stonebraker, R. H. Katz, D. A. Patterson, and J. K. Ousterhout. The design of xprs. In VLDB '88: Proceedings of the 14th International Conference on Very Large Data Bases, pages 318--330, San Francisco, CA, USA, 1988. Morgan Kaufmann Publishers Inc.
[20]
I. Tatarinov, S. D. Viglas, K. Beyer, J. Shanmugasundaram, E. Shekita, and C. Zhang. Storing and querying ordered xml using a relational database system. In SIGMOD, pages 204--215, New York, NY, USA, 2002. ACM Press.

Cited By

View all
  • (2014)Leveraging compression in the tableau data engineProceedings of the 2014 ACM SIGMOD International Conference on Management of Data10.1145/2588555.2595639(563-573)Online publication date: 18-Jun-2014
  • (2014)Mapping XML to a Wide Sparse TableIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2012.22126:6(1400-1414)Online publication date: 1-Jun-2014
  • (2014)Cinderella — Adaptive online partitioning of irregularly structured data2014 IEEE 30th International Conference on Data Engineering Workshops10.1109/ICDEW.2014.6818342(284-291)Online publication date: Mar-2014
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 1, Issue 2
August 2008
461 pages

Publisher

VLDB Endowment

Publication History

Published: 01 August 2008
Published in PVLDB Volume 1, Issue 2

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)1
Reflects downloads up to 17 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2014)Leveraging compression in the tableau data engineProceedings of the 2014 ACM SIGMOD International Conference on Management of Data10.1145/2588555.2595639(563-573)Online publication date: 18-Jun-2014
  • (2014)Mapping XML to a Wide Sparse TableIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2012.22126:6(1400-1414)Online publication date: 1-Jun-2014
  • (2014)Cinderella — Adaptive online partitioning of irregularly structured data2014 IEEE 30th International Conference on Data Engineering Workshops10.1109/ICDEW.2014.6818342(284-291)Online publication date: Mar-2014
  • (2014)Flexible Relational Data Model – A Common Ground for Schema-Flexible Database SystemsAdvances in Databases and Information Systems10.1007/978-3-319-10933-6_3(25-38)Online publication date: 2014
  • (2012)Adaptive Index BufferProceedings of the 2012 IEEE 28th International Conference on Data Engineering Workshops10.1109/ICDEW.2012.39(308-314)Online publication date: 1-Apr-2012
  • (2012)Mapping XML to a Wide Sparse TableProceedings of the 2012 IEEE 28th International Conference on Data Engineering10.1109/ICDE.2012.24(630-641)Online publication date: 1-Apr-2012
  • (2009)Filtered statisticsProceedings of the 2009 ACM SIGMOD International Conference on Management of data10.1145/1559845.1559943(897-904)Online publication date: 29-Jun-2009
  • (2009)A comparison of flexible schemas for software as a serviceProceedings of the 2009 ACM SIGMOD International Conference on Management of data10.1145/1559845.1559941(881-888)Online publication date: 29-Jun-2009

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media