ICML2023

Learning Mixtures of Gaussians with Censored Data

Wai Ming Tai, Bryon Aragam

1 citation

Abstract

We study the problem of learning mixtures of Gaussians with censored data. Statistical learning with censored data is a classical problem, with numerous practical applications, however, finite-sample guarantees for even simple latent variable models such as Gaussian mixtures are missing. Formally, we are given censored data from a mixture of univariate Gaussians i=1kwiN(μi,σ2),\sum_{i=1}^k w_i \mathcal{N}(\mu_i,\sigma^2), i.e. the sample is observed only if it lies inside a set SS. The goal is to learn the weights wiw_i and the means μi\mu_i. We propose an algorithm that takes only 1εO(k)\frac{1}{\varepsilon^{O(k)}} samples to estimate the weights wiw_i and the means μi\mu_i within ε\varepsilon error.