Tang, Jingwen; Shi, Cong; Duenyas, Izak - 2022
demand distributions are unknown, based on the structure of optimal policies, we propose an online learning algorithm, termed … $O(\sqrt{T\log T})$, which matches the lower bound for any learning algorithms, up to a logarithmic factor. The novelties …