[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2463676.2465269acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
demonstration

Packing experiments for sharing and publication

Published: 22 June 2013 Publication History

Abstract

Reproducibility is a core component of the scientific process. Revisiting and reusing past results allow science to move forward - "standing on the shoulders of giants", as Newton once said. An impediment to the adoption of computational reproducibility is that authors find it difficult to generate a compendium that encompasses all the required components to correctly reproduce their experiments. Even when a compendium is available, reviewers and readers may have difficulties in verifying the results on platforms different from the ones where the experiments were originally run. As a step towards simplifying the process of creating reproducible experiments, we have developed ReproZip, a tool that automatically captures the provenance of experiments and packs all the necessary files, library dependencies and variables to reproduce the results. Reviewers can then unpack and run the experiments without having to install any additional software. We will demonstrate real use cases for ReproZip, how packages are created, and how reviewers can validate and explore experiments.

References

[1]
S. B. Davidson and J. Freire. Provenance and scientific workflows: challenges and opportunities. In SIGMOD, pages 1345--1350, 2008.
[2]
A. Davison. Automated capture of experiment context for easier reproducibility in computational research. Computing in Science Engineering, 14(4):48--56, july-aug. 2012.
[3]
D. Donoho, A. Maleki, I. Rahman, M. Shahram, and V. Stodden. Reproducible research in computational harmonic analysis. Computing in Science & Engineering, 11(1):8--18, Jan.-Feb. 2009.
[4]
J. Freire, D. Koop, E. Santos, C. Scheidegger, C. Silva, and H. T. Vo. The Architecture of Open Source Applications, chapter VisTrails. Lulu.com, 2011.
[5]
J. Freire and C. T. Silva. Making Computations and Publications Reproducible with VisTrails. Computing in Science and Engineering, 14(4):18--25, 2012.
[6]
GenePattern. http://www.broadinstitute.org/cancer/software/genepattern/.
[7]
P. Guo. CDE: A Tool for Creating Portable Experimental Software Packages. Computing in Science and Engineering, 14(4):32--35, 2012.
[8]
P. J. Guo and M. Seltzer. Burrito: wrapping your lab notebook in computational infrastructure. In Proceedings of the 4th USENIX conference on Theory and Practice of Provenance, TaPP'12, pages 7--7, Berkeley, CA, USA, 2012. USENIX Association.
[9]
R. LeVeque. Python tools for reproducible research on hyperbolic problems. Computing in Science & Engineering, 11(1):19--27, Jan.-Feb. 2009.
[10]
Madagascar. http://www.ahay.org/wiki/Main_Page.
[11]
MongoDB. http://www.mongodb.org/.
[12]
SystemTap. http://sourceware.org/systemtap/.

Cited By

View all
  • (2023)Towards an IDE for Scientific Computational Experiments2023 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC)10.1109/VL-HCC57772.2023.00056(290-292)Online publication date: 3-Oct-2023
  • (2023)Can Image Data Facilitate Reproducibility of Graphics and Visualizations? Toward a Trusted Scientific PracticeIEEE Computer Graphics and Applications10.1109/MCG.2023.324181943:2(89-100)Online publication date: 1-Mar-2023
  • (2019)Computational reproducibility of scientific workflows at extreme scalesInternational Journal of High Performance Computing Applications10.1177/109434201983912433:5(763-776)Online publication date: 1-Sep-2019
  • Show More Cited By

Index Terms

  1. Packing experiments for sharing and publication

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '13: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
    June 2013
    1322 pages
    ISBN:9781450320375
    DOI:10.1145/2463676
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 June 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. computational reproducibility
    2. provenance
    3. reprozip

    Qualifiers

    • Demonstration

    Conference

    SIGMOD/PODS'13
    Sponsor:

    Acceptance Rates

    SIGMOD '13 Paper Acceptance Rate 76 of 372 submissions, 20%;
    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 31 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Towards an IDE for Scientific Computational Experiments2023 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC)10.1109/VL-HCC57772.2023.00056(290-292)Online publication date: 3-Oct-2023
    • (2023)Can Image Data Facilitate Reproducibility of Graphics and Visualizations? Toward a Trusted Scientific PracticeIEEE Computer Graphics and Applications10.1109/MCG.2023.324181943:2(89-100)Online publication date: 1-Mar-2023
    • (2019)Computational reproducibility of scientific workflows at extreme scalesInternational Journal of High Performance Computing Applications10.1177/109434201983912433:5(763-776)Online publication date: 1-Sep-2019
    • (2018)Provenance in WorkflowsEncyclopedia of Database Systems10.1007/978-1-4614-8265-9_80745(2912-2916)Online publication date: 7-Dec-2018
    • (2017)Managing Provenance of Implicit Data Flows in Scientific ExperimentsACM Transactions on Internet Technology10.1145/305337217:4(1-22)Online publication date: 18-Aug-2017
    • (2017)Clouds and Reproducibility: A Way to Go to Scientific Experiments?Cloud Computing10.1007/978-3-319-54645-2_5(127-151)Online publication date: 3-Jun-2017
    • (2016)ReproZipProceedings of the 2016 International Conference on Management of Data10.1145/2882903.2899401(2085-2088)Online publication date: 26-Jun-2016
    • (2015)Using a suite of ontologies for preserving workflow-centric research objectsWeb Semantics: Science, Services and Agents on the World Wide Web10.1016/j.websem.2015.01.00332:C(16-42)Online publication date: 1-May-2015
    • (undefined)Using a Suite of Ontologies for Preserving Workflow-Centric Research ObjectsSSRN Electronic Journal10.2139/ssrn.3199184

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media