ICML2023

Robust Situational Reinforcement Learning in Face of Context Disturbances

Jinpeng Zhang, Yufeng Zheng, Chuheng Zhang, Li Zhao, Lei Song, Yuan Zhou, Jiang Bian

被引用 5 次

摘要

Motivation • Context variable: the dynamic and uncontrollable environmental factor in many real-world tasks • E.g., Inventory Control and Adaptive Cruise Control (ACC): Context variables are the customer demand and speed of lead car, respectively, which are independent of agent's action, and have large uncertainty Sudden brake of the lead car