[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Efficient Mining of Association Rules in Distributed Databases

Published: 01 December 1996 Publication History

Abstract

Many sequential algorithms have been proposed for mining of association rules. However, very little work has been done in mining association rules in distributed databases. A direct application of sequential algorithms to distributed databases is not effective, because it requires a large amount of communication overhead. In this study, an efficient algorithm, DMA, is proposed. It generates a small number of candidate sets and requires only O(n) messages for support count exchange for each candidate set, where n is the number of sites in a distributed database. The algorithm has been implemented on an experimental test bed and its performance is studied. The results show that DMA has superior performance when comparing with the direct application of a popular sequential algorithm in distributed databases.

References

[1]
R. Agrawal C. Faloutsos and A. Swami, "Efficient Similarity Search in Sequence Databases," Proc. Fourth Int'l Conf. Foundations of Data Organization and Algorithms, Oct. 1993.
[2]
R. Agrawal S. Ghosh T. Imielinski B. Iyer and A. Swami, "An Interval Classifier for Database Mining Application," Proc. 18th Int'l Conf. Very Large Data Bases, pp. 560-573, Vancouver, Canada, Aug. 1992.
[3]
R. Agrawal T. Imielinski and A. Swami, "Database Mining: A Performance Perspective," IEEE Trans. Knowledge and Data Engineering, vol. 5, pp. 914-925, 1993.
[4]
R. Agrawal T. Imielinski and A. Swami, "Mining Assocation Rules Between Sets of Items in Large Databases," Proc. ACM-SIGMOD Int'l Conf. Management of Data, pp. 207-216, May 1993.
[5]
R. Agrawal and J.C. Shafer, "Parallel Mining of Association Rules: Design, Implementation, and Experience," IBM Research Report RJ1004, 1996.
[6]
R. Agrawal and R. Srikant, "Fast Algorithms for Mining Association Rules," Proc. Int'l Conf. Very Large Data Bases, pp. 487-499, Santiago, Chile, Sept. 1994.
[7]
D.W. Cheung A. W.-C. Fu and J. Han, "Knowledge Discovery in Databases: A Rule-Based Attribute-Oriented Approach," Proc. Int'l Symp. Methodologies for Intelligent Systems, pp. 164-173, Charlotte, N.C., Oct. 1994.
[8]
D.W. Cheung J. Han V.T. Ng and C.Y. Wong, "Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique," Proc. IEEE Int'l Conf. Data Engineering, New Orleans, La., Feb. 1996.
[9]
U.M. Fayyad G. Piatetsky-Shapiro P. Smyth and R. Uthurusamy, Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press, 1995.
[10]
W.J. Frawley G. Piatetsky-Shapiro and C.J. Matheus, "Knowledge Discovery in Databases: An Overview," Knowledge Discovery in Databases, G. Piatetsky-Shapiro and W.J. Frawley, eds., pp. 1-27. AAAI/MIT Press, 1991.
[11]
A. Geist A. Beguelin J. Dongarra W. Jian R. Manchek and V. Sunderam, PVM: Parallel Virtual Machine. A Users' Guide and Tutorial for Networked Parallel Computing. MIT Press, 1994.
[12]
J. Han Y. Cai and N. Cercone, "Data-Driven Discovery of Quantitative Rules in Relational Databases," IEEE Trans Knowledge and Data Engineering, vol. 5, pp. 29-40, 1993.
[13]
J. Han and Y. Fu, "Discovery of Multiple-Level Association Rules from Large Databases," Proc. Int'l Conf. Very Large Data Bases, Zurich, pp. 420-431, Sept. 1995.
[14]
M. Klemettinen H. Mannila P. Ronkainen H. Toivonen and A.I. Verkamo, "Finding Interesting Rules from Large Sets of Discovered Association Rules," Proc. Third Int'l Conf. Information and Knowledge Management, pp. 401-408, Gaithersburg, Md., Nov. 1994.
[15]
R. Ng and J. Han, "Efficient and Effective Clustering Method for Spatial Data Mining," Proc Int'l Conf. Very Large Data Bases, pp. 144-155, Santiago, Chile, Sept. 1994
[16]
J.S. Park M.S. Chen and P.S. Yu, "An Effective Hash-Based Algorithm for Mining Association Rules," Proc. ACM-SIGMOD Int'l Conf. Management of Data, pp. 175-186, San Jose, Calif., May 1995.
[17]
J.S. Park M.S. Chen and P.S. Yu, "Efficient Parallel Data Mining for Association Rules," Proc. Int'l Conf. Information and Knowledge Management, Baltimore, Md., Nov. 1995.
[18]
G. Piatetsky-Shapiro and W.J. Frawley, Knowledge Discovery in Databases. AAAI/MIT Press, 1991.
[19]
A. Savasere E. Omiecinski and S. Navathe, "An Efficient Algorithm for Mining Assocation Rules in Large Databases," Proc. Int'l Conf. Very Large Data Bases, pp. 432-444, Zurich, Sept. 1995.
[20]
A. Silberschatz M. Stonebraker and J. Ullman, "Database Research: Achievements and Opportunities into the 21st Century," Report NSF Workshop Future of Databases Systems Research, May 1995.
[21]
R. Srikant and R. Agrawal, "Mining Generalized Assocation Rules," Proc. Int'l Conf. Very Large Data Bases, pp. 407-419, Zurich, Sept. 1995.
[22]
J.D. Ullman, Principles of Database and Knowledge-Base Systems, vols. 1 /2. Computer Science Press, 1989.

Cited By

View all
  • (2023)Efficient mining of association rules by reducing the number of passes over the databaseJournal of Computer Science and Technology10.1007/BF0295042316:2(182-188)Online publication date: 22-Mar-2023
  • (2023)Generalized multidimensional association rulesJournal of Computer Science and Technology10.1007/BF0294887615:4(388-392)Online publication date: 22-Mar-2023
  • (2022)A new method for privacy preserving association rule mining using homomorphic encryption with a secure communication protocolWireless Networks10.1007/s11276-022-03185-529:3(1197-1212)Online publication date: 21-Nov-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Knowledge and Data Engineering
IEEE Transactions on Knowledge and Data Engineering  Volume 8, Issue 6
December 1996
159 pages

Publisher

IEEE Educational Activities Department

United States

Publication History

Published: 01 December 1996

Author Tags

  1. Data mining
  2. association rule
  3. distributed algorithm
  4. distributed data mining
  5. distributed database
  6. knowledge discovery
  7. partitioned database.

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Efficient mining of association rules by reducing the number of passes over the databaseJournal of Computer Science and Technology10.1007/BF0295042316:2(182-188)Online publication date: 22-Mar-2023
  • (2023)Generalized multidimensional association rulesJournal of Computer Science and Technology10.1007/BF0294887615:4(388-392)Online publication date: 22-Mar-2023
  • (2022)A new method for privacy preserving association rule mining using homomorphic encryption with a secure communication protocolWireless Networks10.1007/s11276-022-03185-529:3(1197-1212)Online publication date: 21-Nov-2022
  • (2021)Exploring Decomposition for Solving Pattern Mining ProblemsACM Transactions on Management Information Systems10.1145/343977112:2(1-36)Online publication date: 11-Feb-2021
  • (2020)Pattern Sampling in Distributed DatabasesAdvances in Databases and Information Systems10.1007/978-3-030-54832-2_7(60-74)Online publication date: 25-Aug-2020
  • (2019)Evaluating Distributed Transactional Database SystemProceedings of the 11th International Conference on Computer Modeling and Simulation10.1145/3307363.3307364(203-207)Online publication date: 16-Jan-2019
  • (2018)Methodology Preview on Predicting Students Professional Identity Using Data Mining TechniquesProceedings of the Fourth International Conference on Engineering & MIS 201810.1145/3234698.3234754(1-6)Online publication date: 19-Jun-2018
  • (2017)Information fusion from multiple databases using meta-association rulesInternational Journal of Approximate Reasoning10.1016/j.ijar.2016.09.00680:C(185-198)Online publication date: 1-Jan-2017
  • (2016)Pharmacy robotic dispensing and planogram analysis using association rule mining with prescription dataExpert Systems with Applications: An International Journal10.1016/j.eswa.2016.02.04557:C(296-310)Online publication date: 15-Sep-2016
  • (2016)Meta-association rules for mining interesting associations in multiple datasetsApplied Soft Computing10.1016/j.asoc.2016.08.01449:C(212-223)Online publication date: 1-Dec-2016
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media