NeurIPS2022

JAWS: Auditing Predictive Uncertainty Under Covariate Shift

Drew Prinster, Anqi Liu, Suchi Saria

17 citations

Abstract

We propose JAWS, a series of wrapper methods for distribution-free uncertainty quantification tasks under covariate shift, centered on the core method JAW, the JAckknife+ Weighted with data-dependent likelihood-ratio weights. JAWS also includes computationally efficient Approximations of JAW using higher-order influence functions: JAWA. Theoretically, we show that JAW relaxes the jackknife+'s assumption of data exchangeability to achieve the same finite-sample coverage guarantee even under covariate shift. JAWA further approaches the JAW guarantee in the limit of the sample size or the influence function order under common regularity assumptions. Moreover, we propose a general approach to repurposing predictive interval-generating methods and their guarantees to the reverse task: estimating the probability that a prediction is erroneous, based on user-specified error criteria such as a safe or acceptable tolerance threshold around the true label. We then propose JAW-E and JAWA-E as the repurposed proposed methods for this Error assessment task. Practically, JAWS outperform state-of-the-art predictive inference baselines in a variety of biased real world data sets for interval-generation and error-assessment predictive uncertainty auditing tasks. Guarantee (under covariate shift) Method Task Finite sample Asymptotic Avoids retraining JAW Interval generation JAWA Interval generation JAW-E Error assessment JAWA-E Error assessment