[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/956863.956872acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Multi-resolution modeling of large scale scientific simulation data

Published: 03 November 2003 Publication History

Abstract

To provide scientists and engineers with the ability to explore and analyze tera-scale size data-sets we are using a twofold approach. First, we model the data with the objective of creating a compressed yet manageable representation. Second, with that compressed representation, we provide the ability to query the resulting approximation in order to obtain approximate yet sufficient answers; a process called ad-hoc querying. This paper is concerned with a wavelet modeling technique that seeks to capture the important physical characteristics of the target scientific data. Our approach is driven by the compression, which is necessary for viable throughput, along with the end user requirements from the discovery process. Our work contrasts existing research which applies wavelets to range querying, change detection, and clustering problems by working directly with the wavelet decomposition of the data. The difference in this procedure is due primarily to the nature of the data and the requirements of the scientists and engineers. Our approach directly uses the wavelet coefficients of the data to compress as well as query. We describe how the wavelet decomposition is used to facilitate data compression and how queries are posed on the resulting compressed model. Results of this process will be shown for several problems of interest.

References

[1]
G. Abdulla, C. Baldwin, T. Critchlow, R. Kamimura, I. Lozares, R. Music, N. A. Tang, B. S. Lee, and R. Snapp. Approxima te ad-hoc query engine for simulation data. In Joint Conference on Digital Libraries JCDL-01, pages 255--256, June 2001.
[2]
K. Chakrabarti, M. Garofalakis, R. Rastogi, and K. Shim. Approximate query answering using wavelets. VLDB Journal,3, 2001.
[3]
I. Daubechies. Ten Lectures on Wavelets. SIAM,1992.
[4]
R. DeVore, B. Jawerth, and B. Lucier. Image compression through wavelet transform coding. IEEE Trans. Image Processing, 38:719--746, 1992.
[5]
R. DeVore, B. Jawerth, and V. Popov. Compression of wavelet decompositions. American Journal of Mathematics, 114:737-- 785, 1992.
[6]
U. M. Fayyad, D. Haussler, and P. E. Stolorz. KDD for science data analysis: Issues and examples. In Knowledge Discovery and Data Mining, pages 50--56, 1996.
[7]
M. L. Hilton, B. D. Jawerth, and A. Sengupta. Compressing still and moving images with wavelets. Multimedia Systems, 2(5):218--227, 1994.
[8]
B. Jawerth and W. Sweldens. An overview of wavelet based multiresolution analyses.SIAM Rev., 36(3):377--412,1994.
[9]
Jpeg home page. http://www.jpeg.org/JPEG2000.htm,1999.
[10]
E. Keogh, K. Chakrabarti, S. Mehrotra, and M. Pazzani. Locally adaptive dimensionality reduction for indexing large time series databases. In 2001 ACM SIGMOD Conference on Management of Data, My 2001.
[11]
S. Mallat. A Wavelet Tour of Signal Processing. Academic Press,1998.
[12]
F. Murtagh, J.-L. Starck, and M. W. Berry. Overcoming the curse of dimensionality in clustering by means of the wavelet transform. Computer Journal, 43(2):107--120,2000.
[13]
R. T. Ogden. Essential Wavelets for Statistical Applications and Data Analysis. Springer Verlag, 1996.
[14]
N. Ramakrishnan and A. Gram. Mining scientific data. In Advances in Computers, pages 119--169. Academic Press,2001.
[15]
R. R. Schmidt and C. Shahabi. Polap: A fast wavelet-based technique for progressive evaluation of olap queries. Technical Report 01-744, Department of Computer Science, University of Southern California, 2001.
[16]
C. Shahabi, S. Chung, and M. Safar. A wavelet-based approach to improve the efficiency of multi-level surprise mining. In PAKDD International Workshop on Mining Spatial and Temporal Data 2001,2001.
[17]
G. Sheikholeslami, S. Chatterjee, and A. Zhang. WaveCluster: A multi-resolution clustering approach for very large spatial databases. In Proceedings of the 24th Internation Conference Very Large Data Bases, VLDB, pages 428--439, 24--27 1998.
[18]
J.-L. Starck, F. Murtagh, and A. Bijaoui. Image Processing and Data Analysis: The Multiscale Approach. Cambridge University Press,1998.
[19]
G. Strang and T. Nguyen. Wavelets and Filter Banks. Wellesley Cambridge, 1996.
[20]
D. S. Taubman and M. W. Marcellin. JPEG2000: Image Compression Fundamentals, Standards, and Practice. Kluwer, 2001.
[21]
A. Uhl. Wavelets and digital image compression I. Technical Report RIST++04/93, University of Salzburg,1993.
[22]
B. Vidakovic. Statistical Modeling by Wavelets. Wiley, 1999.
[23]
D. Yu, S. Chatterjee, G. Sheikholeslami, and Z. Aidong. Efficiently detecting arbitrary shaped clusters in very large datasets with high dimensions. Technical Report 98-08, Department of Computer Science and Engineering, SUNY Buffalo, 1998.

Cited By

View all
  • (2019)Spatially-aware Parallel I/O for Particle DataProceedings of the 48th International Conference on Parallel Processing10.1145/3337821.3337875(1-10)Online publication date: 5-Aug-2019
  • (2017)Reducing Network Congestion and Synchronization Overhead During Aggregation of Hierarchical Data2017 IEEE 24th International Conference on High Performance Computing (HiPC)10.1109/HiPC.2017.00034(223-232)Online publication date: Dec-2017
  • (2014)A framework for automating the configuration of OpenCLEnvironmental Modelling & Software10.5555/2772077.277222153:C(81-86)Online publication date: 1-Mar-2014
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '03: Proceedings of the twelfth international conference on Information and knowledge management
November 2003
592 pages
ISBN:1581137230
DOI:10.1145/956863
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 November 2003

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. compression
  2. data modeling
  3. scientific data processing
  4. wavelets

Qualifiers

  • Article

Conference

CIKM03

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Spatially-aware Parallel I/O for Particle DataProceedings of the 48th International Conference on Parallel Processing10.1145/3337821.3337875(1-10)Online publication date: 5-Aug-2019
  • (2017)Reducing Network Congestion and Synchronization Overhead During Aggregation of Hierarchical Data2017 IEEE 24th International Conference on High Performance Computing (HiPC)10.1109/HiPC.2017.00034(223-232)Online publication date: Dec-2017
  • (2014)A framework for automating the configuration of OpenCLEnvironmental Modelling & Software10.5555/2772077.277222153:C(81-86)Online publication date: 1-Mar-2014
  • (2012)Fast and effective lossy compression algorithms for scientific datasetsProceedings of the 18th international conference on Parallel Processing10.1007/978-3-642-32820-6_83(843-856)Online publication date: 27-Aug-2012
  • (2005)A hybrid approach for multiresolution modeling of large-scale scientific dataProceedings of the 2005 ACM symposium on Applied computing10.1145/1066677.1066793(511-518)Online publication date: 13-Mar-2005
  • (2004)Simulation data as data streamsACM SIGMOD Record10.1145/974121.97413733:1(89-94)Online publication date: 1-Mar-2004

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media