[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/872757.872800acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Scientific data repositories: designing for a moving target

Published: 09 June 2003 Publication History

Abstract

Managing scientific data warehouses requires constant adaptations to cope with changes in processing algorithms, computing environments, database schemas, and usage patterns. We have faced this challenge in the RHESSI Experimental Data Center (HEDC), a datacenter for the RHESSI NASA spacecraft. In this paper we describe our experience in developing HEDC and discuss in detail the design choices made. To successfully accommodate typical adaptations encountered in scientific data management systems, HEDC (i) clearly separates generic from domain specific code in all tiers, (ii) uses a file system for the actual data in combination with a DBMS to manage the corresponding meta data, and (iii) revolves around a middle tier designed to scale if more browsing or processing power is required. These design choices are valuable contributions as they address common concerns in a wide range of scientific data management systems.

References

[1]
Aladin. http://aladin.u-strasbg.fr/java/.
[2]
Astrobrowse. http://heasarc.gsfc.nasa.gov/ab/.
[3]
Virtual Obs. http://www.eso.org/projects/avo/.
[4]
ADS. http://adswww.harvard.edu/.
[5]
BANERJEE, S. A DBS Platform for Bioinformatics. In VLDB, Cairo, Egypt (Sept. 2000), pp. 705--710.
[6]
BARCLAY, T., GRAY, J., AND SLUTZ, D. Microsoft TerraServer: a spatial data warehouse. In SIGMOD, Dallas, USA (2000).
[7]
CIO. http://ircatalog.gsfc.nasa.gov/.
[8]
CDA. http://cdsarc.u-strasbg.fr/.
[9]
DSP. http://www.cacr.caltech.edu/digital_sky.html.
[10]
FREW, J. Data management for earth science systems. Sigmod Record 26, 1 (1997), 27--31.
[11]
FREYTAG, J. The future home of data. In VLDB, Hong-Kong, China (Aug. 2002).
[12]
GAMMA, E. et al. Design Patterns. Addison Wesley, 1995.
[13]
GRAY, J., AND SZALAY, A. The world wide telescope. In CACM. Vol. 45, No. 11 (Nov. 2002), pp. 50--54.
[14]
Hubble Space Telescope. http://hubble.nasa.gov/.
[15]
Infrared Space Obs. http://www.iso.vilspa.esa.es/.
[16]
KAESTLE, G., SHEK, E., AND DAO, S. Sharing experiences from scientific experiments. In SSDBM, Cleveland, USA (1999).
[17]
SAINT-HILAIRE, P., ET AL. The RHESSI Experimental Data Center. In Solar Physics 210(1--2), pp. 143--164 (Dec. 2002).
[18]
SHEIKHOLESLAMI, G., ET AL. WaveCluster: A wavelet based clustering approach for spatial data. VLDB Journal 8, 3--4 (2000), 289--304.
[19]
Sloan Digital Sky Survey. http://www.sdss.org/.
[20]
STOESSER, G., ET AL. The EMBL nucleotide sequence database. Nuclear Acids Research 27, 1 (1999), 18--24.
[21]
STOLTE, E., AND ALONSO, G. Efficient exploration of large scientific databases. In VLDB, Hong Kong, China (Aug 2002), pp. 622--633.
[22]
STOLTE, E., AND ALONSO, G. Optimizing scientific databases for Client-Side proccessing. In EDBT, Prague, Czech Republic (Mar 2002), pp. 390--408.
[23]
SZALAY, A. S., ET AL. Designing and mining multi-terabyte astronomy archives: the Sloan Digital Sky Survey. In SIGMOD, Dallas, USA (2000).
[24]
TSUR, S. Data Mining in the Bioinformatics Domain. In VLDB, Cairo, Egypt (Sept. 2000), pp. 711--714.
[25]
WANG, J. T.-L., ET AL. Pattern matching and pattern discovery in scientific, program, and document databases. In SIGMOD, San Jose, USA (1995), p. 487.
[26]
ZARRO, D. SOHO synoptic database. http://sohowww.nascom.nasa.gov.

Cited By

View all
  • (2022)Maintaining Repositories, Databases, and Digital Collections in Memory Institutions: An Integrative ReviewProceedings of the Association for Information Science and Technology10.1002/pra2.75559:1(310-323)Online publication date: 14-Oct-2022
  • (2015)The DBMS - your big data sommelier2015 IEEE 31st International Conference on Data Engineering10.1109/ICDE.2015.7113361(1119-1130)Online publication date: Apr-2015
  • (2014)PydronProceedings of the 11th USENIX conference on Operating Systems Design and Implementation10.5555/2685048.2685100(645-659)Online publication date: 6-Oct-2014
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '03: Proceedings of the 2003 ACM SIGMOD international conference on Management of data
June 2003
702 pages
ISBN:158113634X
DOI:10.1145/872757
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2003

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

SIGMOD/PODS03
Sponsor:

Acceptance Rates

SIGMOD '03 Paper Acceptance Rate 53 of 342 submissions, 15%;
Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Maintaining Repositories, Databases, and Digital Collections in Memory Institutions: An Integrative ReviewProceedings of the Association for Information Science and Technology10.1002/pra2.75559:1(310-323)Online publication date: 14-Oct-2022
  • (2015)The DBMS - your big data sommelier2015 IEEE 31st International Conference on Data Engineering10.1109/ICDE.2015.7113361(1119-1130)Online publication date: Apr-2015
  • (2014)PydronProceedings of the 11th USENIX conference on Operating Systems Design and Implementation10.5555/2685048.2685100(645-659)Online publication date: 6-Oct-2014
  • (2013)Turning scientists into data explorersProceedings of the 2013 SIGMOD/PODS Ph.D. symposium10.1145/2483574.2483580(25-30)Online publication date: 23-Jun-2013
  • (2013)Management and storage of in situ oceanographic dataInformation Systems10.1016/j.is.2012.10.00438:3(351-368)Online publication date: 1-May-2013
  • (2013)BibliographyComputation and Storage in the Cloud10.1016/B978-0-12-407767-6.00021-4(109-113)Online publication date: 2013
  • (2012)Data vaultsProceedings of the 24th international conference on Scientific and Statistical Database Management10.1007/978-3-642-31235-9_32(485-494)Online publication date: 25-Jun-2012
  • (2011)A call to armsACM SIGMOD Record10.1145/2070736.207075040:3(61-69)Online publication date: 17-Nov-2011
  • (2010)Epidemic marketplaceProceedings of the First international conference on Information technology in bio- and medical informatics10.5555/1885247.1885251(31-44)Online publication date: 1-Sep-2010
  • (2010)ROARSProceedings of the 19th ACM International Symposium on High Performance Distributed Computing10.1145/1851476.1851587(766-775)Online publication date: 21-Jun-2010
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media