This thesis introduces optimized algorithms for k-means++ initialization and Lloyd's algorithm, leveraging sorted data, prefix sums, and binary search for improved computational performance.
The main contributions are: (1) an optimized k-cluster algorithm achieving O(l * k^2 * log n) complexity for greedy k-means++ initialization and O(i * k * log n) for Lloyd's algorithm, and (2) a binary search-based two-cluster algorithm, achieving O(log n) runtime with deterministic convergence to a Lloyd's algorithm local minimum.
Benchmarks demonstrate over 4500x speedup compared to scikit-learn for large datasets while maintaining clustering quality measured by within-cluster sum of squares (WCSS).
The algorithms also achieve a 300x speedup in an LLM quantization task, highlighting their utility in emerging applications.