Abstract
Considering the literature and the importance of using of the metaheuristic techniques in time series data mining tasks, especially time series clustering (TsC), it seems lack of a comparative study of such techniques in terms of efficiency for TsC problem. Hence, we try to offer the possibility of theoretical and empirical evaluation of metaheuristic techniques in TsC problem. In fact, we follow two main goals by performing this theoretical and empirical evaluation. These goals include: at the first, we would like to prove the effective role of metaheuristic techniques to enhance the efficiency of TsC algorithms, eliminate some challenges of such algorithms (e.g., sensitivity to initialization of some primary parameters), and indicates the popularity of their use in TsC problems during the last years because of their characteristics. Second, we would like to offer the possibility of a comparison between some metaheuristic techniques to analyze the percentage of their effectiveness for enhancing the accuracy of clustering results in TsC problems. The comparative analysis of results of the empirical evaluation for ten standard time series data sets collected from the UCR time series data sets repository show three keys conclusions: (1) generally, metaheuristic techniques have provided reasonable results and significant improvement in terms of efficiency in TsC problem in all experiments due to the main characteristic of metaheuristics that is finding an approximate solution more quickly. But, it is concluded that fuzzy metaheuristic techniques based on population solution class (e.g., FATPSO and FPSO) to provide better results versus single solution class in TsC algorithms. (2) It is deducted that consuming maximum computational time presented by fuzzy metaheuristic techniques in comparison to other optimization techniques in the performed experiments. (3) The paper has proposed that a hybrid of fuzzy metaheuristics (e.g., FATPSO) and the base TsC algorithms can be considered as a more adequate choice to present better results in terms of accuracy in different application areas.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Gogna A, Tayal A (2013) Metaheuristics: review and application. J Exp Theor Artif Intell 25(4):503–526
Tsai CW, Rodrigues J (2014) Metaheuristic scheduling for cloud: a survey. IEEE Syst J 8(1):279–9
Talbi El-Ghazali (2009) Metaheuristics: from design to implementation, 1st edn. Wiley, Chichester
Abualigah L (2020) Multi-verse optimizer algorithm: a comprehensive survey of its results, variants, and applications. Neural Comput Appl 32:12381–12401. https://doi.org/10.1007/s00521-020-04839-1
Abualigah L, Diabat A, Geem ZW (2020) A comprehensive survey of the harmony search algorithm in clustering applications. Appl Sci 10(11):3827
Abualigah L, Shehab M, Alshinwan M et al (2020) Ant lion optimizer: a comprehensive survey of its variants and applications. Arch Computat Methods Eng. https://doi.org/10.1007/s11831-020-09420-6
Fu TC (2011) A review on time series data mining. Eng Appl Artif Intell 24(1):164–181
Izakian Z, Mesgari M (2015) Fuzzy clustering of time series data: a particle swarm optimization approach. J AI Data Min 3(1):39–46
Pérez-Ortiz M, Durán-Rosal AM, Gutiérrez PA, Sánchez-Monedero J, Nikolaou A, Fernández-Navarro F, Hervás-Martínez C (2019) On the use of evolutionary time series analysis for segmenting paleoclimate data. Neurocomputing 326:3–14
Mehrmolaei S, Keyvanpour MR (2017) Fuzzy-clustering time series: population-based an enhanced technique. In: Proceeding of IEEE conference on artificial intelligence and robotics (IRANOPEN), pp 84–90
Deb S, Tian Z, Fong S, Wong R, Millham R, Wong KK (2018) Elephant search algorithm applied to data clustering. Soft Comput 22(18):6035–6046
Chou CH, Hsieh SC, Qiu CJ (2017) Hybrid genetic algorithm and fuzzy clustering for bankruptcy prediction. Appl Soft Comput 56:298–316
Islam MZ, Estivill-Castro V, Rahman MA, Bossomaier T (2018) Combining K-Means and a genetic algorithm through a novel arrangement of genetic operators for high quality clustering. Expert Syst Appl 91:402–417
Abubakar AM, Man-Yahya S (2019) CAT swarm optimization based clustering algorithm for fuzzy time series forecasting. Arid Zone J Eng Technol Environ 15(1):40–54
Alonso AM, Peña D (2019) Clustering time series by linear dependency. Stat Comput 29(4):655–676
Silva Filho TM, Pimentel BA, Souza RM, Oliveira AL (2015) Hybrid methods for fuzzy clustering based on fuzzy c-means and improved particle swarm optimization. Expert Syst Appl 42(17–18):6315–6328
Navajas-Guerrero A, Manjarres D, Portillo E, Landa-Torres I (2019, May) A novel heuristic approach for the simultaneous selection of the optimal clustering method and its internal parameters for time series data. In: International workshop on soft computing models in industrial and environmental applications, pp 179–189. Springer, Cham
Ramadas M, Abraham A (2019) Metaheuristics and data clustering. In: Metaheuristics for data clustering and image segmentation, pp 7–55. Springer, Cham
Menéndez HD, Otero FE, Camacho D (2016) Medoid-based clustering using ant colony optimization. Swarm Intell 10(2):123–145
Oduntan OI, Thulasiraman P (2018, November) Hybrid metaheuristic algorithm for clustering. In: 2018 IEEE symposium series on computational intelligence (SSCI), pp 1–9. IEEE
Kushwaha N, Pant M (2020) Fuzzy particle swarm page rank clustering algorithm. In: Soft computing: theories and applications, pp 895–904. Springer, Singapore
Panapakidis IP, Christoforidis GC (2019) Exploring the potential of metaheuristics-based clustering in load profiling applications. In: 2019 1st international conference on energy transition in the mediterranean area (SyNERGY MED), pp 1–6. IEEE
Anderson OD (1977) The Box-Jenkins approach to time series analysis. RAIRO Oper Res 11(1):3–29
Mehrmolaei S, Keyvanpour MR (2018) TsP-SA: usage of time series techniques on healthcare data. Int J Electron Healthc 10(3):190–230
Keogh E (2008) Indexing and mining time series data. Encyclopedia of GIS, pp 493–497
Aghabozorgi S, Shirkhorshidi AS, Wah TY (2015) Time-series clustering: a decade review. Inform Syst 53:16–38
Guijo-Rubio D, Durán-Rosal AM, Gutiérrez PA, Troncoso A, Hervás-Martínez C (2020) Time-series clustering based on the characterization of segment typologies. In: IEEE transactions on cybernetics
Li H (2019) Multivariate time series clustering based on common principal component analysis. Neurocomputing 349:239–247
Sheikholeslami G, Chatterjee S, Zhang A (1998) Wavecluster: a multi-resolution clustering approach for very large spatial databases. VLDB 98:428–439
Corduas M, Piccolo D (2008) Time series clustering and classification by the autoregressive metric. Comput Stat Data Anal 52(4):1860–1872
Jahanbakhsh A, Seyedzadegan M (2016) An improved method of fuzzy c-means clustering by using feature selection and weighting. Int J Comput Sci Netw Secur 16(10):64–69
Wang X, Wang Y, Wang L (2004) Improving fuzzy c-means clustering based on feature-weight learning”. Pattern Recogn Lett 25(10):1123–1132
Lee CH, Su, YY, Lin YC, Lee SJ (2017) Time series forecasting based on weighted clustering. In: 2017 2nd IEEE international conference on computational intelligence and applications (ICCIA), pp 421–425. IEEE
Hathaway RJ, Bezdek JC (2003) Visual cluster validity for prototype generator clustering models. Pattern Recogn Lett 24(9–10):1563–1569
Izakian H, Pedrycz W, Jamal I (2015) Fuzzy clustering of time series data using dynamic time warping distance. Eng Appl Artif Intell 39:235–244
Halkidi M, Batistakis Y, Vazirgiannis M (2003) Clustering validity checking methods: part ii
Kotsakos D, Trajcevski G, Gunopulos D, Aggarwal CC (2013) time-series data clustering, books. google.com. Chapter 15:357–380
Mori U, Mendiburu A, Lozano JA (2015) Similarity measure selection for clustering time series databases. IEEE Trans Knowl Data Eng 28(1):181–195
Tan PN (2018) Introduction to data mining. Pearson Education India, Chennai
Moosavi SR, Qajar J, Riazi M (2018) A comparison of methods for denoising of well test pressure data. J Petrol Explor Prod Technol 8(4):1519–1534
Ali M, Jones MW, Xie X, Williams M (2019) TimeCluster: dimension reduction applied to temporal data for visual analytics. Vis Comput 35(6–8):1013–1026
Van Der Maaten L, Postma E, Van den Herik J (2009) Dimensionality reduction: a comparative. J Mach Learn Res 10(66–71):13
Yu P, Li Y, Lin H, Wu N (2016) Removal of random noise in seismic data by time-varying window-length time-frequency peak filtering. Acta Geophys 64(5):1703–1714
Kane A, Shiri N (2017) Multivariate time series representation and similarity search using pca. Ind Conf Data Min. Springer, Cham, pp 122–136
Luchian H, Breaban ME, Bautu A (2015) On meta-heuristics in optimization and data analysis. Application to geosciences. In: Artificial intelligent approaches in petroleum geosciences, pp 53–100. Springer, Cham
Asih AMS, Sopha BM, Kriptaniadewa G (2017) Comparison study of metaheuristics: empirical application of delivery problems. Int J Eng Bus Manag 9:1–12. https://doi.org/10.1177/1847979017743603
Tsai CW, Chiang MC, Ksentini A, Chen M (2016) Metaheuristic algorithms for healthcare: open issues and challenges. Comput Electr Eng 53:421–434
Razavi SH, Ebadati EOM, Asadi S, Kaur H (2015) An efficient grouping genetic algorithm for data clustering and big data analysis. Comput Intell Big Data Anal. Springer, Cham, pp 119–142
Mehrmolaei S, Keyvanpour MR (2019) an enhanced hybrid model for event prediction in healthcare time series. Int J Knowl Based Intell Eng Syst 23:131–147
Niknam T, Amiri B (2010) An efficient hybrid approach based on PSO, ACO and k-means for cluster analysis. Appl Soft Comput 10(1):183–197
Khosravanian R, Mansouri V, Wood DA, Alipour MR (2018) A comparative study of several metaheuristic algorithms for optimizing complex 3-D well-path designs. J Petrol Explor Prod Technol 8(4):1487–1503
Ilango SS, Vimal S, Kaliappan M, Subbulakshmi P (2018) Optimization using artificial bee colony based clustering approach for big data. Cluster Comput, pp 1–9
Antosiewicz M, Koloch G, Kamiński B (2013) Choice of best possible metaheuristic algorithm for the travelling salesman problem with limited computational time: quality, uncertainty and speed. J Theor Appl Comput Sci 7(1):46–55
Liu H, Abraham A (2005) Fuzzy adaptive turbulent particle swarm optimization. In: Fifth international conference on hybrid intelligent systems (HIS’05), p 6. IEEE
Livada I, Synnefa A, Haddad S, Paolini R, Garshasbi S, Ulpiani G, Fiorito F, Vassilakopoulou K, Osmond P, Santamouris M (2019) Time series analysis of ambient air-temperature during the period 1970–2016 over Sydney, Australia. Sci Total Environ 648:1627–1638
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Mehrmolaei, S., Keyvanpour, M.R. & Savargiv, M. Metaheuristics on time series clustering problem: theoretical and empirical evaluation. Evol. Intel. 15, 329–348 (2022). https://doi.org/10.1007/s12065-020-00511-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12065-020-00511-8