Header menu link for other important links
X
EPTAS for k-means clustering of affine subspaces
E. Eiben, F.V. Fomin, P.A. Golovach, W. Lochet, , K. Simonov
Published in Association for Computing Machinery
2021
Pages: 2649 - 2659
Abstract
We consider a generalization of the fundamental k-means clustering for data with incomplete or corrupted entries. When data objects are represented by points in Rd, a data point is said to be incomplete when some of its entries are missing or unspecified. An incomplete data point with at most ∆ unspecified entries corresponds to an axis-parallel affine subspace of dimension at most ∆, called a ∆-point. Thus we seek a partition of n input ∆-points into k clusters minimizing the k-means objective. For ∆ = 0, when all coordinates of each point are specified, this is the usual k-means clustering. We give an algorithm that finds an (1 + ε)-approximate solution in time f(k,ε,∆) · n2 · d for some function f of k,ε, and ∆ only. © 2021 by SIAM
About the journal
JournalData powered by TypesetProceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms
PublisherData powered by TypesetAssociation for Computing Machinery