ICML2025

Tracking Most Significant Shifts in Infinite-Armed Bandits

Joe Suk, Jung-hun Kim

摘要

We study an infinite-armed bandit problem where actions' mean rewards are initially sampled from a reservoir distribution. Most prior works in this setting focused on stationary rewards (