WWW2026
Generative Regression Based Watch Time Prediction for Short-Video Recommendation
Hongxu Ma, Kai Tian, Tao Zhang, Xuefeng Zhang, Han Zhou, Chenghou Jin, Chunjie Chen, Han Li, Jihong Guan, Shuigeng Zhou
被引用 6 次
摘要
Watch time prediction (WTP) has emerged as a pivotal task in short video recommendation systems, designed to quantify user engagement through continuous interaction modeling. Predicting users' watch times on videos often encounters fundamental challenges, including wide value ranges and imbalanced data distributions, which can lead to significant estimation bias when directly applying regression techniques. Recent studies have attempted to address these issues by converting the continuous watch time estimation into an ordinal regression task. While these methods demonstrate partial effectiveness, they exhibit notable limitations: (1) The discretization process frequently relies on bucket partitioning, inherently reducing prediction flexibility and accuracy. (2) The interdependencies among different partition intervals remain underutilized, missing opportunities for effective error correction. Inspired by language modeling paradigms, we propose a novel Generative Regression (GR) framework that reformulates WTP as a sequence generation task. Our approach employs structural discretization to enable nearly lossless value reconstruction while maintaining prediction flexibility. Through carefully designed vocabulary construction and label encoding schemes, each watch time is bijectively mapped to a token sequence. To mitigate the training-inference discrepancy caused by teacher-forcing, we introduce a curriculum learning with embedding mixup strategy that gradually transitions from guided to free-generation modes. We test our models extensively on two public datasets, a large-scale offline industrial dataset, and an online A/B test on Kuaishou App with over 400 million daily active users (DAU) and GR consistently outperforms existing state-of-the-art approaches significantly. Our code is available at https://github.com/snailma0229/GR.git.