WWW2026

A Frequency-Aware Mixture of Heterogeneous Experts Framework for Knowledge Tracing

Youheng Bai, Mingliang Hou, Teng Guo, Zitao Liu, Weiqi Luo

Abstract

Knowledge tracing (KT) aims to personalize online education on large-scale web-based platforms by modeling students' evolving knowledge states from their interaction sequences. However, most KT models rely on a single encoder architecture (e.g., self-attention or RNN), with fixed inductive biases that fails to capture the diversity of learning behaviors. Specifically, student learning unfolds across multiple timescales, and interaction sequences contain diverse frequency components ranging from short-term variations to long-term trends. Our data-driven analysis reveals that existing encoders exhibit characteristic frequency biases (e.g., self-attention tends to emphasize low-frequency patterns), highlighting the limitations of any single architecture. To address this problem, we propose FA-KT, a frequency-aware mixture of heterogeneous experts framework. FA-KT combines self-attention, Mamba, CNN, and LSTM experts, each with complementary frequency biases. A frequency-aware router analyzes each sequence's frequency characteristics and adaptively combines experts to create dynamic, personalized encoders for individual students. Across five benchmark datasets, FA-KT consistently outperforms 20 strong KT baselines in predicting future performance. Code is available at https://pykt.org/.