Computer Science > Machine Learning

arXiv:2302.11296 (cs)

[Submitted on 22 Feb 2023]

Title:Refining a $k$-nearest neighbor graph for a computationally efficient spectral clustering

Authors:Mashaan Alshammari, John Stavrakakis, Masahiro Takatsuka

View PDF

Abstract:Spectral clustering became a popular choice for data clustering for its ability of uncovering clusters of different shapes. However, it is not always preferable over other clustering methods due to its computational demands. One of the effective ways to bypass these computational demands is to perform spectral clustering on a subset of points (data representatives) then generalize the clustering outcome, this is known as approximate spectral clustering (ASC). ASC uses sampling or quantization to select data representatives. This makes it vulnerable to 1) performance inconsistency (since these methods have a random step either in initialization or training), 2) local statistics loss (because the pairwise similarities are extracted from data representatives instead of data points). We proposed a refined version of $k$-nearest neighbor graph, in which we keep data points and aggressively reduce number of edges for computational efficiency. Local statistics were exploited to keep the edges that do not violate the intra-cluster distances and nullify all other edges in the $k$-nearest neighbor graph. We also introduced an optional step to automatically select the number of clusters $C$. The proposed method was tested on synthetic and real datasets. Compared to ASC methods, the proposed method delivered a consistent performance despite significant reduction of edges.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2302.11296 [cs.LG]
	(or arXiv:2302.11296v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2302.11296
Journal reference:	Pattern Recognition, Volume 114, 2021
Related DOI:	https://doi.org/10.1016/j.patcog.2021.107869

Submission history

From: Mashaan Alshammari Dr. [view email]
[v1] Wed, 22 Feb 2023 11:31:32 UTC (2,491 KB)

Computer Science > Machine Learning

Title:Refining a $k$-nearest neighbor graph for a computationally efficient spectral clustering

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Refining a $k$-nearest neighbor graph for a computationally efficient spectral clustering

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators