[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Three-way Indexing ZDDs for Large-Scale Sparse Datasets

  • Conference paper
  • First Online:
Trends and Applications in Knowledge Discovery and Data Mining (PAKDD 2014)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8643))

Included in the following conference series:

  • 2215 Accesses

Abstract

Zero-suppressed decision diagrams (ZDDs) are a data structure for representing combinations over item sets. They have been applied to many areas such as data mining. When ZDDs represent large-scale sparse datasets, they tend to obtain an unbalanced form, which results performance degradation. In this paper, we propose a new data structure three-way indexing ZDD, as a variant of ZDDs. We furthermore present algorithms to convert between three-way indexing ZDDs and ordinary ZDDs. Experimental results show the effectiveness of our data structure and algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 71.50
Price includes VAT (United Kingdom)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 89.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    A frequent itemset with support threshold \(n\) is an itemset \(X\) such that the number of rows containing all items in \(X\) is more than or equal to \(n\). A frequent itemset is maximal if it is not contained in any other frequent itemset with the same support threshold.

References

  1. Goethals, B.: Survey on frequent pattern mining. Technical report (2002)

    Google Scholar 

  2. Minato, S.: Zero-suppressed BDDs for set manipulation in combinatorial problems. In: DAC ’93: Proceedings of the 30th International Design Automation Conference, pp. 272–277. ACM, New York (1993)

    Google Scholar 

  3. Minato, S., Uno, T., Arimura, H.: LCM over ZBDDs: fast generation of very large-scale frequent itemsets using a compact graph-based representation. In: Washio, T., Suzuki, E., Ting, K.M., Inokuchi, A. (eds.) PAKDD 2008. LNCS (LNAI), vol. 5012, pp. 234–246. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  4. Minato, S.: A fast algorithm for cofactor implication checking and its application for knowledge discovery. In: Proceedings of IEEE 8th International Conference on Computer and Information Technology, July 2008, pp. 53–58 (2008)

    Google Scholar 

  5. Minato, S.: Finding simple disjoint decompositions in frequent itemset data using zero-suppressed BDDs. In: Proceedings of IEEE ICDM 2005 Workshop on Computational Intelligence in Data Mining, November 2005, pp. 3–11 (2005)

    Google Scholar 

  6. Minato, S., Ito, K.: Symmetric item set mining method using zero-suppressed BDDs and application to biological data. Trans. Jpn. Soc. Artif. Intell. 22(2), 156–164 (2007)

    Article  Google Scholar 

  7. Coudert, O.: Solving graph optimization problems with ZBDDs. In: Proceedings of the 1997 European Conference on Design and Test, March 1997, pp. 224–228 (1997)

    Google Scholar 

  8. Loekito, E., Bailey, J.: Fast mining of high dimensional expressive contrast patterns using zero-suppressed binary decision diagrams. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 307–316. ACM (2006)

    Google Scholar 

  9. Minato, S.: Techniques of BDD/ZDD: brief history and recent activity. IEICE Trans. Inf. Syst. E96-D(7), 1419–1429 (2013)

    Google Scholar 

  10. Minato, S.-I.: Z-Skip-Links for fast traversal of ZDDs representing large-scale sparse datasets. In: Bodlaender, H.L., Italiano, G.F. (eds.) ESA 2013. LNCS, vol. 8125, pp. 731–742. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  11. Bentley, J.L., Sedgewick, R.: Fast algorithms for sorting and searching strings. In: Proceedings of 8th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA ’97), January 1997, pp. 360–369 (1997)

    Google Scholar 

  12. Knuth, D.E.: The Art of Computer Programming, vol. 4a. Addison-Wesley Professional, New Jersey (2011)

    Google Scholar 

  13. Bentley, J.L., Saxe, J.B.: Algorithms on vector sets. SIGACT News 11(2), 36–39 (1979)

    Article  Google Scholar 

  14. Toda, T.: Fast compression of large-scale hypergraphs for solving combinatorial problems. In: Fürnkranz, J., Hüllermeier, E., Higuchi, T. (eds.) DS 2013. LNCS, vol. 8140, pp. 281–293. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hiroshi Aoki .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Aoki, H., Toda, T., Minato, Si. (2014). Three-way Indexing ZDDs for Large-Scale Sparse Datasets. In: Peng, WC., et al. Trends and Applications in Knowledge Discovery and Data Mining. PAKDD 2014. Lecture Notes in Computer Science(), vol 8643. Springer, Cham. https://doi.org/10.1007/978-3-319-13186-3_41

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-13186-3_41

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-13185-6

  • Online ISBN: 978-3-319-13186-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics