A Clustering Algorithm Based on Fitness Probability Scores for Cluster Centers Optimization

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12953))

Included in the following conference series:

International Conference on Computational Science and Its Applications

1511 Accesses

Abstract

In the present paper, we propose an iterative clustering approach that sequentially applies five processes, namely: the assign, delete, split, delete and optimization. It is based on the fitness probability scores of the cluster centers to identify the least fitted centers to undergo an optimization process, aiming to improve the centers from one iteration to another. Moreover, the parameters of the algorithm for the delete, split and optimization processes are dynamically tuned as problem dependent functions. The presented clustering algorithm is evaluated using four data sets, two randomly generated and two well-known sets. The obtained clustering algorithm is compared with other clustering algorithms through the visualization of the clustering, the value of a validity measure and the value of the objective function of the optimization process. The comparison of results shows that the proposed clustering algorithm is effective and robust.

This work has been supported by FCT – Fundação para a Ciência e Tecnologia within the R&D Units Project Scope: UIDB/00013/2020 and UIDP/00013/2020 of CMAT-UM.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 67.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 84.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Improving the Clustering Algorithms Automatic Generation Process with Cluster Quality Indexes

Multi-start local search algorithm based on a novel objective function for clustering analysis

Article 06 April 2023

OPE-HCA: an optimal probabilistic estimation approach for hierarchical clustering algorithm

Article 05 August 2015

Notes

1.
https://www.mathworks.com/matlabcentral/fileexchange/52905-dbscan-clustering-algorithm.

References

MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Le Cam, L.M., Neyman, J. (eds.) Proceedings of the fifth Berkeley Symposium on Mathematical Statistics and Probability, University of California Press, vol. 1, pp. 281–297 (1967)
Google Scholar
Ester, M., Kriegel, H.-P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: KKD-1996 Proceedings, pp. 226–231. AAAI Press (1996)
Google Scholar
Mohammed, J.Z., Meira, Jr., W.: Data Mining and Machine Learning: Fundamental Concepts and Algorithms, 2nd (edn.). Cambridge University Press, Cambridge (2020)
Google Scholar
Mirkin, B.: Clustering For Data Mining: A Data Recovery Approach. Computer Science and Data Analysis Series. Chapman & Hall/CRC, Boca Raton (2005)
Google Scholar
Higham, D.J., Kalna, G., Kibble, M.: Spectral clustering and its use in bioinformatics. J. Comput. Appl. Math. 204, 25–37 (2007)
Article MathSciNet Google Scholar
Haraty, R.A., Dimishkich, M., Masud, M.: An enhanced k-means clustering algorithm for pattern discovery in healthcare data. Int. J. Distrib. Sens. Netw. 2015, Article ID 615740, p. 11 (2015)
Google Scholar
Sarkar, M., Yegnanarayana, B., Khemani, D.: A clustering algorithm using evolutionary programming-based approach. Pattern Recogn. Lett. 18, 975–986 (1997)
Article Google Scholar
Das, S., Abraham, A., Konar, A.: Automatic clustering using an improved differential evaluation algorithm. IEEE Trans. Syst. Man Cyber. Syst. 38, 218–237 (2008)
Article Google Scholar
Kwedlo, W.: A clustering method combining differential evolution with K-means algorithm. Pattern Recogn. Lett. 32, 1613–1621 (2011)
Article Google Scholar
Cura, T.: A particle swarm optimization approach to clustering. Expert Syst. Appl. 39, 1582–1588 (2012)
Article Google Scholar
Patel, K.G.K., Dabhi, V.K., Prajapati, H.B.: Clustering using a combination of particle swarm optimization and K-means. J. Intell. Syst. 26(3), 457–469 (2017)
Article Google Scholar
El-Shorbagy, M.A., Ayoub, A.Y., Mousa, A.A., El-Desoky, I.M.: An enhanced genetic algorithm with new mutation for cluster analysis. Comput. Stat. 34, 1355–1392 (2019)
Article MathSciNet Google Scholar
Ezugwu, A.E.-S., Agbaje, M.B., Aljojo, N., Els, R., Chiroma, H., Elaziz, M.A.: A comparative performance of hybrid firefly algorithms for automatic data clustering. IEEE Access 8, 121089–121118 (2020)
Article Google Scholar
He, Z., Yu, C.: Clustering stability-based evolutionary K-means. Soft. Comput. 23, 305–321 (2019)
Google Scholar
Memarsadeghi, N., Mount, D.M., Netanyahu, N.S., Le Moigne, J.: A fast implementation of the ISODATA clustering algorithm. Int. J. Computat. Geom. Appl. 17(1), 71–103 (2007)
Article MathSciNet Google Scholar
Prabha, K.A., Visalakshi, N.K.: Improved particle swarm optimization based K-means clustering. In: International Conference on Intelligent Computing Applications, pp. 59–63. IEEE Publisher CPS (2014). https://doi.org/10.1109/ICICA.2014.21
Heris, M.K.: Evolutionary Data Clustering in MATLAB. https://yarpiz.com/64/ypml101-evolutionary-clustering Yarpiz (2015)
Asvadi, A.: K-means Clustering Code. Department of ECE, SPR Lab., Babol (Noshirvani) University of Technology (2013). http://www.a-asvadi.ir/
Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. PAMI-1(2), 224–227 (1979)
Google Scholar
Dua, D., Graff, C.: UCI machine learning repository http://archive.ics.uci.edu/ml. University of California, School of Information and Computer Science, Irvine (2019)

Download references

Acknowledgments

The authors wish to thank two anonymous referees for their comments and suggestions to improve the paper.

Author information

Authors and Affiliations

Centre of Mathematics, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal
M. Fernanda P. Costa
ALGORITMI Center, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal
Ana Maria A. C. Rocha & Edite M. G. P. Fernandes

Authors

M. Fernanda P. Costa
View author publications
You can also search for this author in PubMed Google Scholar
Ana Maria A. C. Rocha
View author publications
You can also search for this author in PubMed Google Scholar
Edite M. G. P. Fernandes
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M. Fernanda P. Costa .

Editor information

Editors and Affiliations

University of Perugia, Perugia, Italy
Osvaldo Gervasi
University of Basilicata, Potenza, Potenza, Italy
Beniamino Murgante
Covenant University, Ota, Nigeria
Sanjay Misra
University of Cagliari, Cagliari, Italy
Chiara Garau
University of Cagliari, Cagliari, Italy
Ivan Blečić
Monash University, Clayton, VIC, Australia
David Taniar
Kyushu Sangyo University, Fukuoka, Japan
Bernady O. Apduhan
University of Minho, Braga, Portugal
Ana Maria A. C. Rocha
Polytechnic University of Bari, Bari, Italy
Eufemia Tarantino
Polytechnic University of Bari, Bari, Italy
Carmelo Maria Torre

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Costa, M.F.P., Rocha, A.M.A.C., Fernandes, E.M.G.P. (2021). A Clustering Algorithm Based on Fitness Probability Scores for Cluster Centers Optimization. In: Gervasi, O., et al. Computational Science and Its Applications – ICCSA 2021. ICCSA 2021. Lecture Notes in Computer Science(), vol 12953. Springer, Cham. https://doi.org/10.1007/978-3-030-86976-2_26

Download citation

DOI: https://doi.org/10.1007/978-3-030-86976-2_26
Published: 12 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86975-5
Online ISBN: 978-3-030-86976-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Clustering Algorithm Based on Fitness Probability Scores for Cluster Centers Optimization

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Improving the Clustering Algorithms Automatic Generation Process with Cluster Quality Indexes

Multi-start local search algorithm based on a novel objective function for clustering analysis

OPE-HCA: an optimal probabilistic estimation approach for hierarchical clustering algorithm

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Clustering Algorithm Based on Fitness Probability Scores for Cluster Centers Optimization

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Improving the Clustering Algorithms Automatic Generation Process with Cluster Quality Indexes

Multi-start local search algorithm based on a novel objective function for clustering analysis

OPE-HCA: an optimal probabilistic estimation approach for hierarchical clustering algorithm

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation