NeurIPS2022

Linear Label Ranking with Bounded Noise

Dimitris Fotakis, Alkis Kalavasis, Vasilis Kontonis, Christos Tzamos

3 citations

Abstract

Label Ranking (LR) is the supervised task of learning a sorting function that maps feature vectors ๐‘ฅ โˆˆ R ๐‘‘ to rankings ๐œŽ ( ๐‘ฅ ) โˆˆ S ๐‘˜ over a finite set of ๐‘˜ labels. We focus on the fundamental case of learning linear sorting functions (LSFs) under Gaussian marginals: ๐‘ฅ is sampled from the ๐‘‘ -dimensional standard normal and the ground truth ranking ๐œŽ โ‹† ( ๐‘ฅ ) is the ordering induced by sorting the coordinates of the vector ๐‘Š โ‹† ๐‘ฅ , where ๐‘Š โ‹† โˆˆ R ๐‘˜ ร— ๐‘‘ is unknown. We consider learning LSFs in the presence of bounded noise: assuming that a noiseless example is of the form ( ๐‘ฅ , ๐œŽ โ‹† ( ๐‘ฅ )) , we observe ( ๐‘ฅ , ๐œ‹ ) , where for any pair of elements ๐‘– ฬธ = ๐‘— , the probability that the order of ๐‘–, ๐‘— is different in ๐œ‹ than in ๐œŽ โ‹† ( ๐‘ฅ ) is at most ๐œ‚ < 1 / 2 . We design efficient non-proper and proper learning algorithms that learn hypotheses within normalized Kendallโ€™s Tau distance ๐œ– from the ground truth with ๐‘ = ฬƒ๏ธ€ ๐‘‚ ( ๐‘‘ log( ๐‘˜ ) /๐œ– ) labeled examples and runtime poly ( ๐‘, ๐‘˜ ) . For the more challenging top-๐‘Ÿ disagreement loss, we give an efficient proper learning algorithm that achieves ๐œ– top-๐‘Ÿ disagreement with the ground truth with ๐‘ = ฬƒ๏ธ€ ๐‘‚ ( ๐‘‘๐‘˜๐‘Ÿ/๐œ– ) samples and poly ( ๐‘ ) runtime.