Abstract
Database schema upgrades are common in modern information systems, where the provenance of the schema is of much interest, and actually required to explain the provenance of contents generated by the database conversion that is part of such upgrades. Thus, an integrated management for data and metadata is needed, and the Archived Metadata and Provenance Manager (AM&PM) system is the first to address this requirement by building on recent advances in schema mappings and database upgrade automation. Therefore AM&PM (i) extends the Information Schema with the capability of archiving the provenance of the schema and other metadata, (ii) provides a timestamp based representation for the provenance of the actual data, and (iii) supports powerful queries on the provenance of the data and on the history of the metadata. In this paper, we present the design and main features of AM&PM, and the results of various experiments to evaluate its performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
California department of transofrmation-highway conditions, http://www.dot.ca.gov/hq/roadinfo
Wikipedia:database download, http://en.wikipedia.org/wiki/Wikipedia:Database_download
Alexe, B., Chiticariu, L., Tan, W.C.: Spider: a schema mapping debugger. In: VLDB, pp. 1179–1182 (2006)
Bose, R., Frew, J.: Lineage retrieval for scientific data processing: a survey. ACM Comput. Surv. 37(1), 1–28 (2005)
Buneman, P., Khanna, S., Tan, W.-C.: Why and Where: A Characterization of Data Provenance. In: Van den Bussche, J., Vianu, V. (eds.) ICDT 2001. LNCS, vol. 1973, pp. 316–330. Springer, Heidelberg (2000)
Buneman, P., Tan, W.C.: Provenance in databases. In: SIGMOD Conference, pp. 1171–1173 (2007)
Chiticariu, L., Tan, W.C.: Debugging schema mappings with routes. In: VLDB, pp. 79–90 (2006)
Curino, C., Zaniolo, C.: Pantha rhei benchmark datasets, http://yellowstone.cs.ucla.edu/schema-evolution/index.php/Benchmark_home
Curino, C., Moon, H.J., Tanca, L., Zaniolo, C.: Schema evolution in wikipedia - toward a web information system benchmark. In: ICEIS, pp. 323–332 (2008)
Curino, C.A., Moon, H.J., Zaniolo, C.: Graceful database schema evolution: the prism workbench. Proc. VLDB Endow. 1(1), 761–772 (2008)
Curino, C.A., Moon, H.J., Deutsch, A., Zaniolo, C.: Update rewriting and integrity constraint maintenance in a schema evolution support system: Prism++. Proc. VLDB Endow. 4(2), 117–128 (2010)
Eisenberg, A., Melton, J., Kulkarni, K.G., Michels, J.-E., Zemke, F.: Sql: 2003 has been published. SIGMOD Record 33(1), 119–126 (2004)
Green, T.J., Karvounarakis, G., Tannen, V.: Provenance semirings. In: PODS, pp. 31–40 (2007)
Ikeda, R., Salihoglu, S., Widom, J.: Provenance-based refresh in data-oriented workflows. In: CIKM, pp. 1659–1668 (2011)
Simmhan, Y.L., Plale, B., Gannon, D.: A survey of data provenance in e-science. SIGMOD Rec. 34(3), 31–36 (2005)
SQL/XML, http://www.sqlx.org/
Srivastava, D., Velegrakis, Y.: Intensional associations between data and metadata. In: SIGMOD Conference, pp. 401–412 (2007)
Tan, W.C.: Provenance in databases: Past, current, and future. IEEE Data Eng. Bull. 30(4), 3–12 (2007)
Wang, F., Zaniolo, C., Zhou, X.: Archis: an xml-based approach to transaction-time temporal database systems. The VLDB Journal 17(6), 1445–1463 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gao, S., Zaniolo, C. (2012). Supporting Database Provenance under Schema Evolution. In: Castano, S., Vassiliadis, P., Lakshmanan, L.V., Lee, M.L. (eds) Advances in Conceptual Modeling. ER 2012. Lecture Notes in Computer Science, vol 7518. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33999-8_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-33999-8_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33998-1
Online ISBN: 978-3-642-33999-8
eBook Packages: Computer ScienceComputer Science (R0)