ICML2025
Action-Dependent Optimality-Preserving Reward Shaping
Grant C. Forbes, Jianxun Wang, Leonardo Villalobos-Arias, Arnav Jhala, David L. Roberts
Abstract
Recent RL research has utilized reward shapingparticularly complex shaping rewards such as intrinsic motivation (IM)-to encourage agent exploration in sparse-reward environments. While