ICLR2026

ProReGen: Progressive Residual Generation under Attribute Correlations

Ruby Shrestha, Ajay Gopi, Casey Meisenzahl, Bipin Lekhak, Linwei Wang

Abstract

Attribute correlations in the training data will compromise the ability of a deep generative model (DGM) to synthesize images with under-represented attribute combinations (i.e.,\textit{i.e.,} minority samples). Existing approaches mitigate this by data re-sampling to remove attribute correlations seen by the DGM, using a classifier to provide pseudo-supervision\textit{pseudo-supervision} on generated counterfactual samples, or incorporating inductive bias to explicitly decompose the generation into independent sub-mechanisms. We present ProReGen, a progressive residual generation\textit{progressive residual generation} approach inspired by the classical Robinson's transformation, to partial out from an image attribute x2\mathbf{x}_2 its component m(x1)m(\mathbf{x}_1) that is predictable by other image attributes x1\mathbf{x}_1, and the residual γ=x2m(x1)\gamma = \mathbf{x}_2 - m(\mathbf{x}_1) that is not. This simplifies the problem of learning a DGM g(x1,x2)g(\mathbf{x}_1, \mathbf{x}_2) conditioned on correlated inputs, to learning g~(x1,γ)\tilde{g}(\mathbf{x}_1, \gamma) conditioned on orthogonal inputs. It further allows us to progressively learn g~\tilde{g} by first shifting the burden to abundant majority samples to learn g~(x1,γ=0)\tilde{g}(\mathbf{x}_1, \gamma = 0), and then expanding it with additional layers g_resg\_{\text{res}} to resolve its difference to g~(x1,γ)\tilde{g}(\mathbf{x}_1, \gamma) using residual attribute γ\gamma on limited minority samples. On three benchmark datasets with curated varying strengths of attribute correlation and one dataset with natural attribute correlation, we demonstrate that ProReGen---with input orthogonalization and progressive residual learning---improved the correctness of minority generations compared to existing strategies.