Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning.
CoRR, 2024
Learning in Safety-critical, Lifelong, and Multi-agent Systems: Bandits and RL Approaches
PhD thesis, 2023
Scaling Distributed Multi-task Reinforcement Learning with Experience Sharing.
CoRR, 2023
Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost.
Proceedings of the International Conference on Machine Learning, 2023
Provably Efficient Lifelong Reinforcement Learning with Linear Representation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Provably Efficient Lifelong Reinforcement Learning with Linear Function Approximation.
CoRR, 2022
Doubly Pessimistic Algorithms for Strictly Safe Off-Policy Optimization.
Proceedings of the 56th Annual Conference on Information Sciences and Systems, 2022
Safe Linear Thompson Sampling With Side Information.
IEEE Trans. Signal Process., 2021
UCB-based Algorithms for Multinomial Logistic Regression Bandits.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Regret Bounds for Safe Gaussian Process Bandit Optimization.
Proceedings of the IEEE International Symposium on Information Theory, 2021
Safe Reinforcement Learning with Linear Function Approximation.
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 55th Annual Conference on Information Sciences and Systems, 2021
Decentralized Multi-Agent Linear Bandits with Safety Constraints.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Regret Bound for Safe Gaussian Process Bandit Optimization.
Proceedings of the 2nd Annual Conference on Learning for Dynamics and Control, 2020
Generalized Linear Bandits with Safety Constraints.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Safe Linear Thompson Sampling.
CoRR, 2019
Linear Stochastic Bandits Under Safety Constraints.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019