[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3478905.3478972acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdsitConference Proceedingsconference-collections
research-article
Open access

MaxRI: A method for discovering maximal rare itemsets

Published: 28 September 2021 Publication History

Abstract

Rare itemset mining got extensive attention due to its high importance in real-life applications. Rare itemset mining methods aim at discovering the whole set of rare itemsets in a dataset. Although current algorithms perform reasonably well in finding interesting rare itemsets, they also reveal a large number of rare itemsets, including redundant ones. As a result, skimming through these massive amounts of (partly redundant) itemsets is a big overhead in many applications. On the other hand, generating a massive number of rare itemsets also compromises the performance of algorithms in terms of time and memory. To address these limitations, we propose an efficient algorithm called maximal rare itemset (MaxRI) to discover maximal rare patterns (long rare itemset). Then, we propose another method RRI (Recover Rare Itemsets from maximal rare itemsets) to retrieve the interesting subset of rare itemsets of a user-given length, k, from the set of maximal rare itemsets. To the best of our knowledge, this is the first paper proposed for rare itemset mining by considering the representative rare patterns without redundant ones. Our experimental results indicate that our proposed methods’ performance is better than the up-to-date algorithms in terms of time and memory consumption.

References

[1]
[1] Agrawal, R., Srikant, R., Fast algorithms for mining association rules in large databases, Proceedings of the 20th International Conference on Very Large Data Bases, VLDB, pp. 487–499,1994.
[2]
[2] Chan, P. K., Stolfo, S. J., toward scalable learning with non-uniform class and cost distributions: a case study in credit card fraud detection. In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, pages 164-168, 2001.
[3]
[3] Ji, Y., Ying, H., Tran, J., Dews, P., Mansour, A., and Massanari, R. M., A method for mining infrequent causal associations and its application in finding adverse drug reaction signal pairs, IEEE transactions on Knowledge and Data Engineering, pp.721-733, 2012.
[4]
[4] Weiss, G. M., and Hirsh, H., Learning to predict rare events in event sequences, In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, pp. 359-363, 1998.
[5]
[5] Koh, Y. S., and Rountree, N., Finding sporadic rules using apriori-inverse, In PacificAsia Conference on Knowledge Discovery and Data Mining (PAKDD), Springer, pp. 97-106, 2005.
[6]
[6] Szathmary, L., Napoli, A., and Valtchev, P., Towards rare itemset mining, In IEEE International Conference on Tools with Artificial Intelligence (ICTAI) 2007, IEEE, pp. 305–312, 2007.
[7]
[7] Weiss G.M., Mining with rarity: a unifying framework, ACM SIGKDD Explorations Newsletter, pp. 7–19, 2004.
[8]
[8] Kataria, M., Oswald, C., and Sivaselvan, B., A Novel Rare Itemset Mining Algorithm Based on Recursive Elimination, Advances in Intelligent Systems and Computing, In Software Engineering, pp. 221-233, 2019.
[9]
[9] Tsang, S., Koh, Y. S., and Dobbie, G., RP-Tree: Rare Pattern Tree Mining, In Data Warehousing and Knowledge Discovery (DaWaK), Lecture Notes in Computer Science, Springer, pp. 277-288, 2011.
[10]
[10] Darrab, S., Broneske, D., and Saake, G., RPP algorithm: a method for discovering interesting rare itemsets, in The the International Conference on Data Mining and Big Data, Springer-Nature in Communications in Computer and Information Science (CCIS), Springer, 2020.
[11]
[11] Darrab, S.and ERGENÇ, B., Frequent pattern mining under multiple support thresholds, the International Conference on Applied Computer Science (ACS), Wseas Transactions on Computer Research, pp. 1-10, 2016.
[12]
[12] Adda, M., Wu, L.,and Feng, Y., Rare itemset mining, In Sixth International Conference on Machine Learning and Applications (ICMLA), IEEE, pp. 73-80, 2007.
[13]
[13] Troiano, L., Scibelli, G., and Birtolo, C., A fast algorithm for mining rare itemsets, In Proceedings of the International Conference on Intelligent Systems Design and Applications, IEEE Computer Society Press, pp. 1149-1155, 2009.
[14]
[14] Liu, B., Hsu, W., and Ma, Y., Mining association rules with multiple minimum supports, In Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 337-341, 1999.
[15]
[15] Kiran, R. U., and Re, P. K, An improved multiple minimum support based approach to mine rare association rules, In Computational Intelligence and Data Mining (CIDM), IEEE, pp. 340-347, 2009.
[16]
[16] Han, J., Pei, J., and Yin, Y., Mining frequent patterns without candidate generation, In Proceedings of the International Conference on Management of Data (ACM SIGMOD), PP. 1-12, 2000.
[17]
[17] Darrab, S., and Ergenc, B., Vertical pattern mining algorithm for multiple support thresholds, International Conference on Knowledge Based and Intelligent Information and Engineering (KES), Procedia Computer Science 112, pp. 417-426, 2017.
[18]
[18] Hu, Y. H., and Chen, Y. L., Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism, Decision Support Systems, pp. 1-24, 2006.
[19]
[19] Kiran, R. U., and Reddy, P. K., Novel techniques to reduce search space in multiple minimum supports-based frequent pattern mining algorithms, In Proceedings of the international conference on extending database technology (EDBT), pp. 11-20, 2011.
[20]
[20] Vo, B., Pham, S., Le, T., and Deng, Z. H., A novel approach for mining maximal frequent patterns, Expert Systems with Applications, pp. 178-186, 2017
[21]
[21] Lazaar, N., Lebbah, Y., Loudni, S., Maamar, M., Lemière, V., Bessière, C.,and Boizumault, P., A global constraint for closed frequent pattern mining, In International Conference on Principles and Practice of Constraint Programming, Springer, pp. 333-349, 2016.
[22]
[22] Darrab S., Broneske D., and Saake G., Modern Application and Challenges for Rare Itemset Mining, in the International Conference on Knowledge Discovery, International Journal of Machine Learning and Computing (IJMLC), 2019.
[23]
[23] Borah, A., and Nath, B., Rare pattern mining: challenges and future perspectives, Complex and Intelligent Systems, pp. 1-23, 2019.
[24]
[24] Frequent Itemset Mining Dataset Repository, http://fimi.uantwerpen.be/data/.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
DSIT 2021: 2021 4th International Conference on Data Science and Information Technology
July 2021
481 pages
ISBN:9781450390248
DOI:10.1145/3478905
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 September 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. maximal rare itemsets
  2. rare itemsets
  3. representative rare itemsets

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

DSIT 2021

Acceptance Rates

Overall Acceptance Rate 114 of 277 submissions, 41%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 339
    Total Downloads
  • Downloads (Last 12 months)99
  • Downloads (Last 6 weeks)13
Reflects downloads up to 01 Jan 2025

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media