[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article
Free access

On a partitioning problem

Published: 01 September 1978 Publication History

Abstract

This paper investigates the problem of locating a set of “boundary points” of a large number of records. Conceptually, the boundary points partition the records into subsets of roughly the same number of elements, such that the key values of the records in one subset are all smaller or all larger than those of the records in another subset. We guess the locations of the boundary points by linear interpolation and check their accuracy by reading the key values of the records on one pass. This process is repeated until all boundary points are determined. Clearly, this problem can also be solved by performing an external tape sort. Both analytical and empirical results indicate that the number of passes required is small in comparison with that in an external tape sort. This kind of record partitioning may be of interest in setting up a statistical database system.

References

[1]
FELLER, W. An Introduction to Probability Theory and Its Applications, Vol. I. Wiley, New York, 1968.
[2]
KNUTH, D.E. The Art of Computer Programming, Vol. III: Sorting and Searching. Addison- Wesley, Reading, Mass., 1973.
[3]
Yu, C.T., AND CHIN, F.Y. A study on the protection of statistical data bases. Proc. ACM SIGMOD 1977, pp. 169-181.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Database Systems
ACM Transactions on Database Systems  Volume 3, Issue 3
Sept. 1978
119 pages
ISSN:0362-5915
EISSN:1557-4644
DOI:10.1145/320263
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 September 1978
Published in TODS Volume 3, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. external sort
  2. key value
  3. partition
  4. passes
  5. tape probability

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 277
    Total Downloads
  • Downloads (Last 12 months)15
  • Downloads (Last 6 weeks)2
Reflects downloads up to 13 Dec 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media