NeurIPS2025
Optimal and Provable Calibration in High-Dimensional Binary Classification: Angular Calibration and Platt Scaling
Yufan Li, Pragya Sur
2 citations
Abstract
We study the fundamental problem of calibrating a linear binary classifier of the form , where the feature vector is Gaussian, is a link function, and is an estimator of the true linear weight . By interpolating with a noninformative , we construct a well-calibrated predictor whose interpolation weight depends on the angle between the estimator and the true linear weight . We establish that this angular calibration approach is provably well-calibrated in a high-dimensional regime where the number of samples and features both diverge, at a comparable rate. The angle can be consistently estimated. Furthermore, the resulting predictor is uniquely , minimizing the Bregman divergence to the true label distribution within a suitable class of calibrated predictors. Our work is the first to provide a calibration strategy that satisfies both calibration and optimality properties provably in high dimensions. Additionally, we identify conditions under which a classical Platt-scaling predictor converges to our Bregman-optimal calibrated solution. Thus, Platt-scaling also inherits these desirable properties provably in high dimensions.