Cited By
View all- Saeed MAl Aghbari ZAlsharidah M(2020)Big data clustering techniques based on Spark: a literature reviewPeerJ Computer Science10.7717/peerj-cs.3216(e321)Online publication date: 30-Nov-2020
Clustering is a popular unsupervised data mining technique. It has been applied in various data mining and big data applications. Efficient clustering algorithms and implementation techniques are keys to cope with the scalability and performance ...
In order to improve the efficiency of personnel matching system, this paper proposes a k-means clustering algorithm based on spark platform to complete the personnel matching model; spark platform completes the clustering iterative operation in the ...
In the field of data mining, clustering is one of the important methods. K-Means is a typical distance-based clustering algorithm; 2-tier clustering should implement scalable clustering by means of dividing, sampling and knowledge integrating. Among ...
Association for Computing Machinery
New York, NY, United States
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in