Selected Publications
.
2023.
Offline Planning and Online Learning under Recovering Rewards. Management Science.
.
2023. Stochastic Multi-armed Bandits: Optimal Trade-off among Optimality, Consistency, and Tail Risk. NeurIPS 2023 Spotlight (top 3%).
.
2023. A Simple and Optimal Policy Design with Safety against Heavy-tailed Risk for Multi-armed Bandits. To appear in NeurIPS 2022.
.
2022. Dynamic Planning and Learning under Recovering Rewards. To appear in ICML 2021.
.
2021.