Settling Time vs. Accuracy Tradeoffs for Clustering Big Data
Abstract
Supplemental Material
- Download
- 31.58 MB
References
Index Terms
- Settling Time vs. Accuracy Tradeoffs for Clustering Big Data
Recommendations
A framework for statistical clustering with constant time approximation algorithms for K-median and K-means clustering
We consider a framework of sample-based clustering . In this setting, the input to a clustering algorithm is a sample generated i.i.d by some unknown arbitrary distribution. Based on such a sample, the algorithm has to output a clustering of the full ...
On the k-means/median cost function
Highlights- We try to understand how the optimal k-means cost behaves as a function of the k (the number of centers/clusters).
- We show that D 2 sampling is a useful method for designing pseudo-approximation algorithm and movement-based coreset for ...
AbstractIn this work, we study the k-means cost function. Given a dataset X ⊆ R d and an integer k, the goal of the Euclidean k-means problem is to find a set of k centers C ⊆ R d such that Φ ( C , X ) ≡ ∑ x ∈ X min c ∈ C ‖ x − c ‖ 2 is minimized. Let ...
RK-Means Clustering: K-Means with Reliability
This paper presents an RK-means clustering algorithm which is developed for reliable data grouping by introducing a new reliability evaluation to the K-means clustering algorithm. The conventional K-means clustering algorithm has two shortfalls: 1) the ...
Comments
Please enable JavaScript to view thecomments powered by Disqus.Information & Contributors
Information
Published In
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
Author Tags
Qualifiers
- Research-article
Funding Sources
- European Union's Horizon 2020 research and innovation programme
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 232Total Downloads
- Downloads (Last 12 months)232
- Downloads (Last 6 weeks)44
Other Metrics
Citations
View Options
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in