Abstract
The explosive growth in data collection in business organizations introduces the problem of turning these rapidly expanding data stores into nuggets of actionable knowledge. The state-of-the-art data mining tools available for this integrate loosely with data stored in DMBSSs, typically through a cursor interface. In this paper, we consider several formulations of association rule mining (a typical data mining problem) using SQL-92 queries and study the performance of different join orders and join methods for executing them. We analyze the cost of the different execution plans which provides a basis to incorporate the semantics of association rule mining into future query optimizers. Based on them we identify certain optimizations and develop the Set-oriented Apriori approach. This work is an initial step towards developing “SQL-aware” mining algorithms and exploring the enhancements to current relational DBMSs to make them “mining-aware” thereby bridging the gap between the two.
This work was supported in part by the Office of Naval Research and the Spawar System Center — San Diego, by the Rome Laboratory, DARPA, and the NSF Grant IRI-9528390
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. In Proc. of the ACM SIGMOD Conference, pages 207–216, Washington, D.C., May 1993.
R. Agrawal and K. Shim. Developing tightly-coupled data mining applications on a RDBMS. In Proc. of the KDD Conference, Portland, Oregon, August 1996.
R. Agrawal and R. Srikant. Fast Algorithms for Mining Association Rules. In Proc. of the VLDB Conference, Santiago, Chile, September 1994.
J. Han, Y. Fu, K. Koperski, W. Wang, and 0. Zaiane. DMQL: A data mining query language for relational datbases. In Proc. of the 1996 SIGMOD DMKD workshop, Montreal, Canada, May 1996.
M. Houtsma and A. Swami. Set-oriented mining of association rules. In Int’l Conference on Data Engineering, Taipei, Taiwan, March 1995.
T. Imielinski, A. Virmani, and A. Abdulghani. Discovery Board Application Programming Interface and Query Language for Database Mining. In Proc. of the KDD Conference, Portland, Oregon, August 1996.
R. Meo, G. Psaila, and S. Ceri. A new SQL like operator for mining association rules. In Proc. of the VLDB Conference, Bombay, India, Sep 1996.
PostgreSQL Organization. PostgreSQL 6.3 User Manual, February 1998. http://ww.postgresql.org.
S. Sarawagi, S. Thomas, and R. Agrawal. Integrating Association Rule Mining with Relational Database Systems: Alternatives and Implications. In Proc. of the ACM SIGMOD Confewace, Seattle, Washington, June 1998.
S. TSUT, et al. Query Flocks: A Generalization of Association Rule Mining. In Proc. of the ACM SIGMOD Conference, Seattle, Washington, June 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Thomas, S., Chakravarthy, S. (1999). Performance Evaluation and Optimization of Join Queries for Association Rule Mining. In: Mohania, M., Tjoa, A.M. (eds) DataWarehousing and Knowledge Discovery. DaWaK 1999. Lecture Notes in Computer Science, vol 1676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48298-9_26
Download citation
DOI: https://doi.org/10.1007/3-540-48298-9_26
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66458-1
Online ISBN: 978-3-540-48298-7
eBook Packages: Springer Book Archive