ICML2025
Maximum Total Correlation Reinforcement Learning
Bang You, Puze Liu, Huaping Liu, Jan Peters, Oleg Arenz
摘要
RPC LZ-SAC MTC (ours) SAC Action State Figure 1. Maximizing the total correlation within trajectories results in more consistent behavior. As shown in our experiments, this consistency increases robustness to noise and dynamics changes.