[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Randomnet: clustering time series using untrained deep neural networks

Published: 22 June 2024 Publication History

Abstract

Neural networks are widely used in machine learning and data mining. Typically, these networks need to be trained, implying the adjustment of weights (parameters) within the network based on the input data. In this work, we propose a novel approach, RandomNet, that employs untrained deep neural networks to cluster time series. RandomNet uses different sets of random weights to extract diverse representations of time series and then ensembles the clustering relationships derived from these different representations to build the final clustering results. By extracting diverse representations, our model can effectively handle time series with different characteristics. Since all parameters are randomly generated, no training is required during the process. We provide a theoretical analysis of the effectiveness of the method. To validate its performance, we conduct extensive experiments on all of the 128 datasets in the well-known UCR time series archive and perform statistical analysis of the results. These datasets have different sizes, sequence lengths, and they are from diverse fields. The experimental results show that the proposed method is competitive compared with existing state-of-the-art methods.

References

[1]
Berndt DJ, Clifford J (1994) Using dynamic time warping to find patterns in time series. In: KDD Workshop, vol. 10, pp 359–370. Seattle, WA
[2]
Chitta R, Jin R, Jain AK (2012) Efficient kernel clustering using random fourier features. In: 2012 IEEE 12th international conference on data mining, pp 161–170. IEEE
[3]
Dau HA, Bagnall A, Kamgar K, Yeh C-CM, Zhu Y, Gharghabi S, Ratanamahatana CA, and Keogh E The ucr time series archive IEEE/CAA J Automatica Sinica 2019 6 6 1293-1305
[4]
Dempster A, Petitjean F, and Webb GI Rocket: exceptionally fast and accurate time series classification using random convolutional kernels Data Min Knowl Disc 2020 34 5 1454-1495
[5]
Dempster A, Schmidt DF, Webb GI (2021) Minirocket: A very fast (almost) deterministic transform for time series classification. In: Proceedings of the 27th ACM SIGKDD, pp 248–257
[6]
Demšar J Statistical comparisons of classifiers over multiple data sets J Mach Learn Res 2006 7 Jan 1-30
[7]
Farahmand A-m, Pourazarm S, Nikovski D (2017) Random projection filter bank for time series data. In: NIPS, pp 6562–6572
[8]
Fern XZ, Brodley CE (2004) Solving cluster ensemble problems by bipartite graph partitioning. In: Proceedings of the twenty-first international conference on machine learning, p 36. ACM
[9]
Fujita A, Severino P, Kojima K, Sato JR, Patriota AG, and Miyano S Functional clustering of time series gene expression data by granger causality BMC Syst Biol 2012 6 1 137
[10]
Guo X, Gao L, Liu X, Yin J (2017) Improved deep embedded clustering with local structure preservation. In: IJCAI, pp 1753–1759
[11]
He Q, Jin X, Du C, Zhuang F, and Shi Z Clustering in extreme learning machine feature space Neurocomputing 2014 128 88-95
[12]
Hoeffding W Probability inequalities for sums of bounded random variables collected Works Wassily Hoeffding 1994 58 409-426
[13]
Karypis G and Kumar V A fast and high quality multilevel scheme for partitioning irregular graphs SIAM J Sci Comput 1998 20 1 359-392
[14]
Kumar M, Patel NR, Woo J (2002) Clustering seasonality patterns in the presence of errors. In: Proceedings of the Eighth ACM SIGKDD, pp 557–563. ACM
[15]
Lei Q, Yi J, Vaculin R, Wu L, Dhillon IS (2019) Similarity preserving representation learning for time series clustering. In: Proceedings of the 28th international joint conference on artificial intelligence, pp 2845–2851. AAAI Press
[16]
Li X, Lin J, Zhao L (2019) Linear time complexity time series clustering with symbolic pattern forest. In: Proceedings of the 28th international joint conference on artificial intelligence, pp 2930–2936. AAAI Press
[17]
Lin J, Keogh E, Wei L, and Lonardi S Experiencing sax: a novel symbolic representation of time series Data Min Knowl Disc 2007 15 2 107-144
[18]
Ma Q, Zheng J, Li S, and Cottrell GW Learning representations for time series clustering Adv Neural Inf Process Syst 2019 32 3776-3786
[19]
Ma Q, Chen C, Li S, and Cottrell GW Learning representations for incomplete time series clustering Proc AAAI Conf Artif Intell 2021 35 10 8837-8846
[20]
Maaten L and Hinton G Visualizing data using t-sne J Mach Learn Res 2008 9 11 2579-2605
[21]
MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth berkeley symposium on mathematical statistics and probability, vol. 1, pp 281–297. Oakland, CA, USA
[22]
Madiraju NS, Sadat SM, Fisher D, Karimabadi H (2018) Deep temporal clustering: fully unsupervised learning of time-domain features. arXiv preprint arXiv:1802.01059
[23]
Paparrizos J, Gravano L (2015) k-shape: efficient and accurate clustering of time series. In: Proceedings of the 2015 ACM SIGMOD international conference on management of data, pp 1855–1870. ACM
[24]
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, and Dubourg V Scikit-learn: machine learning in python J Mach Learn Res 2011 12 2825-2830
[25]
Peng Y, Zheng W-L, and Lu B-L An unsupervised discriminative extreme learning machine and its applications to data clustering Neurocomputing 2016 174 250-264
[26]
Petitjean F, Ketterlin A, and Gançarski P A global averaging method for dynamic time warping, with applications to clustering Pattern Recogn 2011 44 3 678-693
[27]
Rahimi A, Recht B (2007) Random features for large-scale kernel machines. In: NIPS, vol. 3, p. 5. Citeseer
[28]
Rand WM Objective criteria for the evaluation of clustering methods J Am Stat Assoc 1971 66 336 846-850
[29]
Révész P The Laws of Large Numbers 2014 Cambridge Academic Press
[30]
Saito N, Coifman RR (1994) Local feature extraction and its applications using a library of bases. PhD thesis, Yale University
[31]
Steinbach M, Tan P-N, Kumar V, Klooster S, Potter C (2003) Discovery of climate indices using clustering. In: Proceedings of the Ninth ACM SIGKDD, pp 446–455. ACM
[32]
Subhani N, Rueda L, Ngom A, and Burden CJ Multiple gene expression profile alignment for microarray time-series data clustering Bioinformatics 2010 26 18 2281-2288
[33]
Tan CW, Dempster A, Bergmeir C, and Webb GI Multirocket: multiple pooling operators and transformations for fast and effective time series classification Data Min Knowl Disc 2022 36 5 1623-1646
[34]
Wismüller A, Lange O, Dersch DR, Leinsinger GL, Hahn K, Pütz B, and Auer D Cluster analysis of biomedical image time-series Int J Comput Vision 2002 46 2 103-128
[35]
Wu L, Chen P-Y, Yen IE-H, Xu F, Xia Y, Aggarwal C (2018) Scalable spectral clustering using random binning features. In: Proceedings of the 24th ACM SIGKDD, pp 2506–2515
[36]
Yang J, Leskovec J (2011) Patterns of temporal variation in online media. In: Proceedings of the Fourth ACM international conference on web search and data mining, pp 177–186
[37]
Zakaria J, Mueen A, Keogh E (2012) Clustering time series using unsupervised-shapelets. In: 2012 IEEE 12th international conference on data mining, pp 785–794. IEEE
[38]
Zhang Q, Wu J, Yang H, Tian Y, Zhang C (2016) Unsupervised feature learning from time series. In: IJCAI, pp 2322–2328

Index Terms

  1. Randomnet: clustering time series using untrained deep neural networks
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image Data Mining and Knowledge Discovery
        Data Mining and Knowledge Discovery  Volume 38, Issue 6
        Nov 2024
        893 pages

        Publisher

        Kluwer Academic Publishers

        United States

        Publication History

        Published: 22 June 2024
        Accepted: 01 June 2024
        Received: 24 February 2024

        Author Tags

        1. Time series clustering
        2. Scalable
        3. Random kernels
        4. Deep neural network

        Qualifiers

        • Research-article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 0
          Total Downloads
        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 11 Dec 2024

        Other Metrics

        Citations

        View Options

        View options

        Login options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media