NeurIPS2022
Linear Label Ranking with Bounded Noise
Dimitris Fotakis, Alkis Kalavasis, Vasilis Kontonis, Christos Tzamos
3 citations
Abstract
Label Ranking (LR) is the supervised task of learning a sorting function that maps feature vectors ๐ฅ โ R ๐ to rankings ๐ ( ๐ฅ ) โ S ๐ over a finite set of ๐ labels. We focus on the fundamental case of learning linear sorting functions (LSFs) under Gaussian marginals: ๐ฅ is sampled from the ๐ -dimensional standard normal and the ground truth ranking ๐ โ ( ๐ฅ ) is the ordering induced by sorting the coordinates of the vector ๐ โ ๐ฅ , where ๐ โ โ R ๐ ร ๐ is unknown. We consider learning LSFs in the presence of bounded noise: assuming that a noiseless example is of the form ( ๐ฅ , ๐ โ ( ๐ฅ )) , we observe ( ๐ฅ , ๐ ) , where for any pair of elements ๐ ฬธ = ๐ , the probability that the order of ๐, ๐ is different in ๐ than in ๐ โ ( ๐ฅ ) is at most ๐ < 1 / 2 . We design efficient non-proper and proper learning algorithms that learn hypotheses within normalized Kendallโs Tau distance ๐ from the ground truth with ๐ = ฬ๏ธ ๐ ( ๐ log( ๐ ) /๐ ) labeled examples and runtime poly ( ๐, ๐ ) . For the more challenging top-๐ disagreement loss, we give an efficient proper learning algorithm that achieves ๐ top-๐ disagreement with the ground truth with ๐ = ฬ๏ธ ๐ ( ๐๐๐/๐ ) samples and poly ( ๐ ) runtime.