[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2339530.2339636acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

USpan: an efficient algorithm for mining high utility sequential patterns

Published: 12 August 2012 Publication History

Abstract

Sequential pattern mining plays an important role in many applications, such as bioinformatics and consumer behavior analysis. However, the classic frequency-based framework often leads to many patterns being identified, most of which are not informative enough for business decision-making. In frequent pattern mining, a recent effort has been to incorporate utility into the pattern selection framework, so that high utility (frequent or infrequent) patterns are mined which address typical business concerns such as dollar value associated with each pattern. In this paper, we incorporate utility into sequential pattern mining, and a generic framework for high utility sequence mining is defined. An efficient algorithm, USpan, is presented to mine for high utility sequential patterns. In USpan, we introduce the lexicographic quantitative sequence tree to extract the complete set of high utility sequences and design concatenation mechanisms for calculating the utility of a node and its children with two effective pruning strategies. Substantial experiments on both synthetic and real datasets show that USpan efficiently identifies high utility sequences from large scale data with very low minimum utility.

Supplementary Material

JPG File (307_t_talk_4.jpg)
MP4 File (307_t_talk_4.mp4)

References

[1]
R. Agrawal and R. Srikant, Mining sequential patterns, ICDE 1995, pp. 3--14.
[2]
C. F. Ahmed, S. K. Tanbeer and B. Jeong, Mining High Utility Web Access Sequences in Dynamic Web Log Data, SNPD 2010, pp. 76--81.
[3]
C. F. Ahmed, S. K. Tanbeer, J. Byeong-Soo and L. Young-Koo, Efficient tree structures for high utility pattern mining in incremental databases, TKDE 2009, vol. 21, pp. 1708--1721.
[4]
C. F. Ahmed, S. K. Tanbeer and B. Jeong, A Novel Approach for Mining High-Utility Sequential Patterns in Sequence Databases, ETRI Journal, 2010, vol. 32, no. 5, pp. 676--686.
[5]
J. Ayres, J. Flannick, J. Gehrke and T. Yiu, Sequential PAttern mining using a bitmap representation, ICDM 2002, pp. 429--435.
[6]
L. Cao, P. Yu, C. Zhang and Y Zhao. Domain Driven Data Mining. Springer, 2010.
[7]
Y. Li, J. Yeh and C. Chang, Isolated items discarding strategy for discovering high utility itemsets, Data and Knowledge Engineering, Vol. 64, Issue 1, pp. 198--217, Jan., 2008.
[8]
Y. Liu, W. Liao and A. Choudhary, A two-phase algorithm for fast discovery of high utility itemsets, PAKDD 2005, vol. 3518, pp. 689--695.
[9]
N. R. Mabroukeh and C. I. Ezeife, A taxonomy of sequential pattern mining algorithms, ACM Comput. Surv., 2010, vol. 43, pp. 1--41.
[10]
J. Pei, J. Han, B. Mortazavi-Asl, J. Wang, H. Pinto, Q. Chen, U. Dayal and M. C. Hsu., PrefixSpan: mining sequential patterns efficiently by prefix-projected pattern growth, ICDE 2001, pp. 215--224.
[11]
B. Shie, H. Hsiao, V. S. Tseng and P. S. Yu, Mining high utility mobile sequential patterns in mobile commerce environments, DASFAA 2011, pp. 224--238.
[12]
V. S. Tseng, C.-W. Wu, B.-E. Shie and P. S. Yu, UP-Growth: an efficient algorithm for high utility itemset mining, KDD 2010, pp. 253--262.
[13]
H. Yao, H. J. Hamilton and C. J. Butz, A foundational approach to mining itemset utilities from databases, ICDM 2004, pp. 482--486.
[14]
M. J. Zaki, SPADE: An Efficient Algorithm for Mining Frequent Sequences, Machine Learning, 2001, vol. 42, pp. 31--60.

Cited By

View all
  • (2025)HUPSP-LAL: Efficiently mining utility-driven sequential patterns in uncertain sequencesExpert Systems with Applications10.1016/j.eswa.2025.126536270(126536)Online publication date: Apr-2025
  • (2025)Enabling knowledge discovery through low utility itemset miningExpert Systems with Applications10.1016/j.eswa.2024.125955265(125955)Online publication date: Mar-2025
  • (2024)Totally-ordered Sequential Rules for Utility MaximizationACM Transactions on Knowledge Discovery from Data10.1145/362845018:4(1-23)Online publication date: 12-Feb-2024
  • Show More Cited By

Index Terms

  1. USpan: an efficient algorithm for mining high utility sequential patterns

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '12: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2012
    1616 pages
    ISBN:9781450314626
    DOI:10.1145/2339530
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 August 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. high utility sequential pattern mining
    2. sequential pattern mining

    Qualifiers

    • Research-article

    Conference

    KDD '12
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)47
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)HUPSP-LAL: Efficiently mining utility-driven sequential patterns in uncertain sequencesExpert Systems with Applications10.1016/j.eswa.2025.126536270(126536)Online publication date: Apr-2025
    • (2025)Enabling knowledge discovery through low utility itemset miningExpert Systems with Applications10.1016/j.eswa.2024.125955265(125955)Online publication date: Mar-2025
    • (2024)Totally-ordered Sequential Rules for Utility MaximizationACM Transactions on Knowledge Discovery from Data10.1145/362845018:4(1-23)Online publication date: 12-Feb-2024
    • (2024)Toward Correlated Sequential RulesIEEE Transactions on Artificial Intelligence10.1109/TAI.2024.34293065:10(5340-5351)Online publication date: Oct-2024
    • (2024)Interpretable Classifier Models for Decision Support Using High Utility Gain PatternsIEEE Access10.1109/ACCESS.2024.345556312(126088-126107)Online publication date: 2024
    • (2024)Towards utility-driven contiguous sequential patterns in uncertain multi-sequencesKnowledge-Based Systems10.1016/j.knosys.2023.111314284(111314)Online publication date: Jan-2024
    • (2024)Mining frequent temporal duration-based patterns on time interval sequential databaseInformation Sciences10.1016/j.ins.2024.120421(120421)Online publication date: Mar-2024
    • (2024)Targeted mining of contiguous sequential patternsInformation Sciences10.1016/j.ins.2023.119791653(119791)Online publication date: Jan-2024
    • (2024)Sequential pattern mining algorithms and their applications: a technical reviewInternational Journal of Data Science and Analytics10.1007/s41060-024-00659-xOnline publication date: 5-Oct-2024
    • (2024)High-utility sequential pattern mining in incremental databaseThe Journal of Supercomputing10.1007/s11227-024-06568-x81:1Online publication date: 25-Oct-2024
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media