[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

CBAR: an efficient method for mining association rules

Published: 01 April 2005 Publication History

Abstract

The discovery of association rules is an important data-mining task for which many algorithms have been proposed. However, the efficiency of these algorithms needs to be improved to handle real-world large datasets. In this paper, we present an efficient algorithm named cluster-based association rule (CBAR). The CBAR method is to create cluster tables by scanning the database once, and then clustering the transaction records to the k-th cluster table, where the length of a record is k. Moreover, the large itemsets are generated by contrasts with the partial cluster tables. This not only prunes considerable amounts of data reducing the time needed to perform data scans and requiring less contrast, but also ensures the correctness of the mined results. Experiments with the FoodMart transaction database provided by Microsoft SQL Server show that CBAR outperforms Apriori, a well-known and widely used association rule.

References

[1]
Agrawal, R., Imilienski, T. and Swami, A., Mining association rules between sets of items in large databases. 1993. Proceedings of the ACM SIGMOD International Conference on Management of Data, Washington, DC, 1993.
[2]
Agrawal, R. and Srikant, R., Fast algorithm for mining association rules in large databases. 1994. Proceedings of 1994 International Conference on VLDB.
[3]
Agrawal, R. and Srikant, R., Mining sequential patterns. 1995. Proceedings of the 11th International Conference on Data Engineering (ICDE), 1995.
[4]
Berzal, F., Cubero, J.C., Marin, N. and Serrano, J.M., TBAR: an efficient method for association rule mining in relational databases. Elserier Data and Knowledge, Engineering. v37. 47-64.
[5]
Brin, S., Motwani, R. and Silverstein, C., Beyond market baskets: generalizing association rules to correlations. 1997. ACM SIGMOD Conference on Management of Data, Tuscon, Arizona, 1997.
[6]
Brin, S., Motwani, R., Ullman, J.D. and Tsur, S., Dynamic itemset counting and implication rules for market basket data. 1997. Proceedings of the ACM SIGMOD International Conference on Management of Data, 1997.
[7]
Cabena, P., Hadjinian, P., Stadler, R., Verhees, J. and Zanasi, A., Discovering DATA MINING FROM CONCEPT TO IMPLEMEntation. 1997. Prentice-Hall, Englewood Cliffs.
[8]
Chen, M.S., Han, J. and Yu, P.S., Data mining: an overview from a database perspective. IEEE Transactions on Knowledge and Data Engineering. v8 i6. 866-883.
[9]
Cheung, D.W., Han, J., Ng, V.T., Fu, A.W. and Fu, Y., A fast distributed algorithm for mining association rules. 1996. Proceedings of International Conference on PDIS'96, Miami Beach, Florida, USA, 1996.
[10]
Dunkel, B. and Soparkar, N., . 1999. Data Organization and Access for Efficient Data Mining, 1999.ICDE, Australia.
[11]
Han, J. and Kamber, M., . 2000. Data Mining: Concepts and Techniques, 2000.Morgan Kaufmann, Los Altos.
[12]
Han, J. and Fu, Y., Mining multiple-level association rules in large databases. IEEE Transaction on Knowledge and Data Engineering. v11 i5. 798-805.
[13]
Data warehouse marketing trends/opportunities: an in-depth analysis of key market trends. META Group.
[14]
Pork, J.S., Chen, M.S. and Yu, P.S., An effective hash based algorithm for mining association rules. ACM SIGMOD. 175-186.
[15]
Srikant, R. and Agrawal, R., Mining generalized association rules. 1995. Proceedings of the 21st International Conference on VLDB, Zurich, Switzerland, 1995.
[16]
Savasere, A., Omiecinski, E. and Navathe, S., An efficient algorithm for mining association rules in large databases. 1995. Proceedings of 21th VLDB Conference, Zurich, Switzerland, 1995.
[17]
Toivonen, H., Sampling large databases for association rules. 1996. Proceedings of 22nd VLDB Conference, Mumbai, India, 1996.

Cited By

View all
  1. CBAR: an efficient method for mining association rules

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Knowledge-Based Systems
    Knowledge-Based Systems  Volume 18, Issue 2-3
    April, 2005
    57 pages

    Publisher

    Elsevier Science Publishers B. V.

    Netherlands

    Publication History

    Published: 01 April 2005

    Author Tags

    1. Association rule
    2. Cluster
    3. Data mining

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2020)Big data analysis of e-commerce loan risk of college students in the context of network financeInformation Systems and e-Business Management10.1007/s10257-019-00424-918:3(439-454)Online publication date: 1-Sep-2020
    • (2018)TFI-AprioriIntelligent Data Analysis10.3233/IDA-17347322:4(807-827)Online publication date: 1-Jan-2018
    • (2018)Identifying and structuring service functions of mobile applications in Google's Android MarketInformation Systems and e-Business Management10.1007/s10257-017-0366-716:2(383-406)Online publication date: 1-May-2018
    • (2017)Using association rules to assess purchase probability in online storesInformation Systems and e-Business Management10.1007/s10257-016-0329-415:3(751-780)Online publication date: 1-Aug-2017
    • (2016)Using minimal generators for composite isolated point extraction and conceptual binary relation coverageInformation Sciences: an International Journal10.1016/j.ins.2015.12.013336:C(129-144)Online publication date: 1-Apr-2016
    • (2016)An efficient algorithm for mining top-rank-k frequent patternsApplied Intelligence10.1007/s10489-015-0748-945:1(96-111)Online publication date: 1-Jul-2016
    • (2015)A two-stage constructive method for the unweighted minimum string cover problemKnowledge-Based Systems10.1016/j.knosys.2015.01.00377:C(103-113)Online publication date: 1-Mar-2015
    • (2013)MOWSProceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics10.1145/2479787.2479790(1-10)Online publication date: 12-Jun-2013
    • (2012)Improving the performance of association classifiers by rule prioritizationKnowledge-Based Systems10.1016/j.knosys.2012.06.00436(59-67)Online publication date: 1-Dec-2012
    • (2011)Advanced Matrix Algorithm (AMA): reducing number of scans for association rule generationInternational Journal of Business Intelligence and Data Mining10.1504/IJBIDM.2011.0394126:2(202-214)Online publication date: 1-Apr-2011
    • Show More Cited By

    View Options

    View options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media