Abstract
Clustering has been used extensively for exploratory data analysis. GK clustering algorithm can provide a data partition that is more meaningful than the standard fuzzy c-means and its variants. In this paper we propose a novel approach towards fuzzy clustering which reduces the processing time significantly while keeping the results highly accurate. It is a matrix based approach using the concept of equivalent samples and the weighting samples. Equivalence is measured in terms of proximity of the samples and then weighted samples are used as an input to the modified GK clustering algorithm. Objective function and validation index estimates are used to assess the goodness of partition. Experimental results are shown to emphasize the benefits of the proposed technique in domains like Telecom where we have massive data sets to be processed for real time clustering and recommendation engines.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Döring, C., Lesot, M.-J., Kruse, R.: Data Analysis with fuzzy clustering methods. Computational Statistics and Data Analysis, 192–214 (2006)
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press (1981)
Zadeh, I.A.: Fuzzy Sets. Information Control 8, 338–353
Jain, A.K., Murty, M.N., Flynn, P.J.: Data Clustering: A Review. ACM Computing Surveys 31(3) (1999)
Gustafson, D.E., Kessel, W.C.: Fuzzy Clustering with fuzzy covariance matrix. In: Proc IEEE CDC, San Diego,CA, USA, pp. 761–766 (1979)
Dunn, J.C.: A Fuzzy relative of the ISODATA process and its use in detecting compact well separated cluster. Journal of Cybernetics 3, 32–57
Borgelt, C.: Accelerating Fuzzy Clustering. Information Sciences 179(23), 3985–3997 (2009)
Havens, T.C., Chitta, R., Jain, A.K., Jin, R.: Speedup of fuzzy and possibilistic kernel c- means for large-scale clustering. In: International Conference on Fuzzy systems. IEEE (2011)
Arfaoui, O., Sassi, M.: Fuzzy Clustering of large scale data sets using Principal Component Analysis. In: International Conference on Fuzzy Systems. IEEE (2011)
Kannan, S.R., Ramathilagam, S., Chung, P.C.: Effective fuzzy c-means clustering algorithms for data clustering problems. Expert Systems with Applications (2011)
Gu, Y., Hall, L.O., Goldgof, D.B.: Evaluating Scalable fuzzy clustering. In: International Conference on Systems man and Cybernetics. IEEE (2010)
Yue, S., Wei, M., Wang, J.-S., Wang, H.: A general grid- clustering approach. Pattern Recognition Letters 29 (2008)
Lei, G., Yu, X., Yang, X., Chen, S.: An Incremental Clustering Algorithm Based on Grid. In: Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD). IEEE (2011)
Rickard, J.T., Yager, R.R., Miller, W.: Mountain Clustering on Nonuniform Grids. In: Proceedings of the 33rd Applied Imagery Pattern Recognition Workshop (AIPR 2004). IEEE (2004)
Yager, R.R., Filev, D.P.: Approximate Clustering Via the Mountain Method. IEEE Transactions on Systems, Man and Cybernetics 24(8) (1994)
Velthuizen, R.P., Hall, L.O., Clarke, L.P., Silbiger, M.L.: An Investigation of Mountain Method Clustering for Large data sets. Pattern Recognition (1997)
Abonyi, J., Balasko, B., Feil, B.: Fuzzy Clustering and Data Analysis Toolbox, http://www.fmt.vein.hu/softcomp/fclusttoolboc/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Aggarwal, G., Bhatia, M.P.S. (2012). A Novel Faster Approximate Fuzzy Clustering Approach with Highly Accurate Results. In: Parashar, M., Kaushik, D., Rana, O.F., Samtaney, R., Yang, Y., Zomaya, A. (eds) Contemporary Computing. IC3 2012. Communications in Computer and Information Science, vol 306. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32129-0_25
Download citation
DOI: https://doi.org/10.1007/978-3-642-32129-0_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32128-3
Online ISBN: 978-3-642-32129-0
eBook Packages: Computer ScienceComputer Science (R0)