ICML2023

Approximation Algorithms for Fair Range Clustering

Sèdjro Salomon Hotegni, Sepideh Mahabadi, Ali Vakilian

25 citations

Abstract

This paper studies the fair range clustering problem in which the data points are from different demographic groups and the goal is to pick kk centers with the minimum clustering cost such that each group is at least minimally represented in the centers set and no group dominates the centers set. More precisely, given a set of nn points in a metric space (P,d)(P,d) where each point belongs to one of the \ell different demographics (i.e., P=P1P2PP = P_1 \uplus P_2 \uplus \cdots \uplus P_\ell) and a set of \ell intervals [α1,β1],,[α,β][\alpha_1, \beta_1], \cdots, [\alpha_\ell, \beta_\ell] on desired number of centers from each group, the goal is to pick a set of kk centers CC with minimum p\ell_p-clustering cost (i.e., (vPd(v,C)p)1/p(\sum_{v\in P} d(v,C)^p)^{1/p}) such that for each group ii\in \ell, CPi[αi,βi]|C\cap P_i| \in [\alpha_i, \beta_i]. In particular, the fair range p\ell_p-clustering captures fair range kk-center, kk-median and kk-means as its special cases. In this work, we provide efficient constant factor approximation algorithms for fair range p\ell_p-clustering for all values of p[1,)p\in [1,\infty).