ICML2025

Maximum Total Correlation Reinforcement Learning

Bang You, Puze Liu, Huaping Liu, Jan Peters, Oleg Arenz

摘要

RPC LZ-SAC MTC (ours) SAC Action State Figure 1. Maximizing the total correlation within trajectories results in more consistent behavior. As shown in our experiments, this consistency increases robustness to noise and dynamics changes.