[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article
Free access

On the performance of object clustering techniques

Published: 01 June 1992 Publication History

Abstract

We investigate the performance of some of the best-known object clustering algorithms on four different workloads based upon the tektronix benchmark. For all four workloads, stochastic clustering gave the best performance for a variety of performance metrics. Since stochastic clustering is computationally expensive, it is interesting that for every workload there was at least one cheaper clustering algorithm that matched or almost matched stochastic clustering. Unfortunately, for each workload, the algorithm that approximated stochastic clustering was different. Our experiments also demonstrated that even when the workload and object graph are fixed, the choice of the clustering algorithm depends upon the goals of the system. For example, if the goal is to perform well on traversals of small portions of the database starting with a cold cache, the important metric is the per-traversal expansion factor, and a well-chosen placement tree will be nearly optimal; if the goal is to achieve a high steady-state performance with a reasonably large cache, the appropriate metric is the number of pages to which the clustering algorithm maps the active portion of the database. For this metric, the PRP clustering algorithm, which only uses access probabilities achieves nearly optimal performance.

References

[1]
T. Lousenia Anderson et al. The Tektronix HyperModel Benchmark. EDBT, 1990.
[2]
P. Bailey. Performance evaluation in a persistent object system. In Proceedings of the Third International Workshop on Persistent Object Systems, Newcastle, Australia, September 1989.
[3]
Veronique Benzaken and Claude Delobel. Enhancing performance in a persistent object store: Clustering strategies in 02. Technical Report 50-90, Altair, August 1990.
[4]
P. J. Denning. The working set model of program behavior. Comm. A CM, 11(5):323-333, May 1968.
[5]
O. Deux et al. The 02 system. A CM Communications, 34(10):34-49, 1991.
[6]
Pamela Drew and Roger King. The performance and utility of the CACTIS implementation algorithms. In Proceedings of the 16.th VLDB Conference, pages 135-147, Brisbane, Australia, 1990.
[7]
Gilbert Harrus, Veronique Benzaken, and Claude Delobel. Measuring performance of clustering strategies: the CluB-0 benchmark. Technical Report 66-91, Altair, January 1991.
[8]
Scott E. Hudson and Roger King. Cactis: A selfadaptive, concurrent implementation of an objectoriented database management system. A CM Transactions on Data Base Systems, 14(3):291- 321, September 1989.
[9]
B.W. Kernighan and S. Lin. An efficient heuristic procedure for partitioning graphs. Bell System Technical Journal, 49(2):291-307, February 1970.
[10]
Christos H. Papadimitriou and Kenneth Steiglitz. Combinatorial Optimization: Algorithms and Complexity. Prentice-Hall Inc, 1982.
[11]
Mark Palmer and Stanley B. Zdonik. FIDO: a cache that learns to fetch. In Proceedings of the 17-th VLDB Conference, Barcelona Spain, 1991.
[12]
J. Richardson and M. Carey. Persistence in the E Language: Issues and Implementation. Software Practice and Experience, 19, 12 1989.
[13]
Karen Shannon and Richard Snodgrass. Semantic clustering. In Proceedings of the $-th Int'l Workshop in Persistent Object Systems, pages 361-374, Martha's Vineyard, MA, September 1990.
[14]
James W. Stamos. Static grouping of small objects to enhance performance of a paged virtual memory. A CM Transactions on Computer Systems, 2(2):155-180, May 1984.
[15]
Manohs M. Tsangaris and Jeffrey F. Naughton. A stochastic approach for clustering in object stores. In Proceedings o/ the SIGMOD International Conference on Management of Data, pages 12-21, Denver, Colorado, May 1991.
[16]
Manolis M. Tsangaris and Jeffrey F. Naughton. A stochastic approach for clustering in object stores. Technical Report 1017, University of Wisconsin, Computer Sciences Dept., April 1991.
[17]
Manolis M. Tsangaris and Jeffrey F. Naughton. On the performance of object clustering techniques. Technical report, University of Wisconsin, Computer Sciences Dept., March 1992.
[18]
C. T. Yu, Cheing-Mei Suen, K. Lam, and M. K. Siu. Adaptive record clustering. ACM Transactions on Data Base Systems, 10(2):180-204, June 1985.
[19]
P.C. Yue and C.K. Wong. On the optimality of the probability ranking scheme in storage applications. JACM, 20(4):624-633, October 1973.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM SIGMOD Record
ACM SIGMOD Record  Volume 21, Issue 2
June 1, 1992
415 pages
ISSN:0163-5808
DOI:10.1145/141484
Issue’s Table of Contents
  • cover image ACM Conferences
    SIGMOD '92: Proceedings of the 1992 ACM SIGMOD international conference on Management of data
    June 1992
    416 pages
    ISBN:0897915216
    DOI:10.1145/130283
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 June 1992
Published in SIGMOD Volume 21, Issue 2

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)136
  • Downloads (Last 6 weeks)30
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2007)Opportunistic prioritised clustering framework for improving OODBMS performanceJournal of Systems and Software10.1016/j.jss.2006.04.01780:3(371-387)Online publication date: 1-Mar-2007
  • (2006)An XML data allocation method on disksJournal of Systems Architecture: the EUROMICRO Journal10.1016/j.sysarc.2006.06.00352:10(578-588)Online publication date: 1-Oct-2006
  • (2005)The automatic improvement of locality in storage systemsACM Transactions on Computer Systems10.1145/1113574.111357723:4(424-473)Online publication date: 1-Nov-2005
  • (2005)Adaptive parameter passingObject Technologies for Advanced Software10.1007/3-540-60954-7_47(118-136)Online publication date: 7-Jun-2005
  • (2001)An object-oriented database design for improved performanceData & Knowledge Engineering10.1016/S0169-023X(01)00004-037:2(117-138)Online publication date: 1-May-2001
  • (1995)Graph-based optimizations for parameter passing in remote invocationsProceedings of International Workshop on Object Orientation in Operating Systems10.1109/IWOOS.1995.470558(179-182)Online publication date: 1995
  • (1993)MILORD: Advanced database technology for implementing a multimedia medical workstationProceedings. The Third International Conference on Image Management and Communication in Patient Care10.1109/IMAC.1993.665430(46-54)Online publication date: 1993
  • (1993)Data organization and storage hierarchies in a multimedia serverDigest of Papers. Compcon Spring10.1109/CMPCON.1993.289740(596-604)Online publication date: 1993
  • (2019)Approximate Data Reuse-based Accelerator Design for Embedded ProcessorACM Transactions on Design Automation of Electronic Systems10.1145/334209824:5(1-25)Online publication date: 21-Aug-2019
  • (2019)Investigating the Impact of Image Content on the Energy Efficiency of Hardware-accelerated Digital Spatial FiltersACM Transactions on Design Automation of Electronic Systems10.1145/334181924:5(1-34)Online publication date: 17-Oct-2019
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media