Providing Flexible Tradeoff for Provenance Tracking

Liwei Wang²³,
Henning Köehler²⁴,
Ke Deng²⁴,
Xiaofang Zhou²⁴ &
…
Shazia Sadiq²⁴

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6724))

Included in the following conference series:

International Conference on Web Information Systems Engineering

1010 Accesses

Abstract

The description of the origins of a piece of data and the transformations by which it arrived in a database is called data provenance, lineage or pedigree. The two major approaches to represent provenance information use annotations and inversion. Annotations are flexible in representing diverse provenance metadata but the complete provenance data may outsize the data itself. The inversion method is concise by using a single inverse query or function but the provenance needs to be computed on-the-fly which can be expensive. This paper proposes a new approach of provenance storage which combines the two methods and is adaptive to storage constraint.

Supported by Specialized Research Fund for the Doctoral Program of Higher Education of China (No.200804861067) and the Special Fund for Basic Scientific Research of Central Colleges, Wuhan University.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 35.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 44.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Incremental Inference of Provenance Types

Provenance and Privacy

A systematic review of provenance systems

Article 17 February 2018

References

Bhagwat, D., Chiticariu, L., Tan, W.-C.: An annotation management system for relational databases. In: Proceedings of the Thirtieth International Conference on Very Large Data Bases, pp. 900–911 (August 2004)
Google Scholar
Cui, Y., Widom, J., Wiener, J.L.: Tracing the lineage of view data in a warehousing environment. ACM Transactions on Database Systems 25(2), 179–227 (2000)
Article Google Scholar
Buneman, P., Khanna, S., Tan, W.-C.: On propagation of deletions and annotations through views. In: Proceedings of The Twenty-first ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pp. 150–158 (June 2002)
Google Scholar
Geerts, F., Kementsietsidis, A., Milano, D.: Mondrian: Annotating and querying databases through colors and blocks. In: Proceedings of the Twenty-second International Conference on Data Engineering, p. 82 (April 2006)
Google Scholar
Srivastava, D., Velegrakis, Y.: Intensional associations between data and metadata. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 401–412 (June 2007)
Google Scholar
Chapman, A.P., Jagadish, H.V., Ramanan, P.: Efficient provenance storage. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 993–1006 (June 2008)
Google Scholar
Foster, I., Vöckler, J., Wilde, M.: Chimera: A virtual data system for representing, querying, and automating data derivation. In: Proceeding of the 14th Conference on Scientific and Statistical Data Management, pp. 37–46 (2002)
Google Scholar
Saad, Y.: Sparskit: a basic tool kit for sparse matrix computations. University of Illinois, Tech. Rep. CSRD TR 1029 (1990)
Google Scholar
Koster, J.: Parallel templates for numerical linear algebra, a high-performance computation library. Master’s thesis (July 2002)
Google Scholar
Hossain, S.: On efficient storage of sparse matrices. In: Computing by the Numbers: Algorithms, Precision, and Complexity Matheon Workshop (2006)
Google Scholar
Isenburg, M., Lindstrom, P., Snoeyink, J.: Lossless compression of predicted floating-point geometry, pp. 869–877 (July 2005)
Google Scholar
Mechelen, V., Bock, H.-H., Boeck, P.D.: Two mode clustering methods: a structured overview. Statistical Methods in Medical Research 13(5), 363–394 (2004)
Article MATH MathSciNet Google Scholar
McCormick, W.T., Schweitzer, P.J., White, T.W.: Problem Decomposition and Data Reorganization by a Clustering Technique. Operations Research 20(5), 993–1009 (1972)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Wuhan University, China
Liwei Wang
The University of Queensland, Australia
Henning Köehler, Ke Deng, Xiaofang Zhou & Shazia Sadiq

Authors

Liwei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Henning Köehler
View author publications
You can also search for this author in PubMed Google Scholar
Ke Deng
View author publications
You can also search for this author in PubMed Google Scholar
Xiaofang Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Shazia Sadiq
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dickson Computer Systems, 7A Victory Avenue 4/F Homantin, Kowloon, Hong Kong, China
Dickson K. W. Chiu
Ecole Nationale Supérieure de Mécanique et d’Aréotechnique, Laboratoire d’Informatique Scientifique et Industrielle, Téléport 2 - avenue Clément Ader, 86961, Futuroscope Chasseneuil Cedex, France
Ladjel Bellatreche
Dept. of Computer Science and Engineering, Ritsumeikan University, Wakakusa 6-4-10, 525-0045, Kusatu, Shiga, Japan
Hideyasu Sasaki
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Sha Tin, Hong Kong, China
Ho-fung Leung
Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
Shing-Chi Cheung
School of Computer Science, Hangshou Dianzi University, Xiasha Higher Education Zone, 310018, Hanshou City, Zhejiang, China
Haiyang Hu
Department of Computer Science and Software Engineering, The University of Melbourne, 3010, Parkville, Victoria, Australia
Jie Shao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, L., Köehler, H., Deng, K., Zhou, X., Sadiq, S. (2011). Providing Flexible Tradeoff for Provenance Tracking. In: Chiu, D.K.W., et al. Web Information Systems Engineering – WISE 2010 Workshops. WISE 2010. Lecture Notes in Computer Science, vol 6724. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24396-7_18

Download citation

DOI: https://doi.org/10.1007/978-3-642-24396-7_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24395-0
Online ISBN: 978-3-642-24396-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics