PhD thesis, 2023

Scaling Distributed Multi-task Reinforcement Learning with Experience Sharing.

[DOI]

CoRR, 2023

Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost.

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Provably Efficient Lifelong Reinforcement Learning with Linear Representation.

[DOI]

Lin Yang

Ching-An Cheng

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Provably Efficient Lifelong Reinforcement Learning with Linear Function Approximation.

[DOI]

Lin F. Yang

Ching-An Cheng

CoRR, 2022

Doubly Pessimistic Algorithms for Strictly Safe Off-Policy Optimization.

[DOI]

Lin F. Yang

Proceedings of the 56th Annual Conference on Information Sciences and Systems, 2022

2021

Safe Linear Thompson Sampling With Side Information.

[DOI]

Ahmadreza Moradipari

IEEE Trans. Signal Process., 2021

UCB-based Algorithms for Multinomial Logistic Regression Bandits.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Regret Bounds for Safe Gaussian Process Bandit Optimization.

[DOI]

Proceedings of the IEEE International Symposium on Information Theory, 2021

Safe Reinforcement Learning with Linear Function Approximation.

[DOI]

Lin Yang

Proceedings of the 38th International Conference on Machine Learning, 2021

Safe Linear Bandits.

[DOI]

Ahmadreza Moradipari

Proceedings of the 55th Annual Conference on Information Sciences and Systems, 2021

Decentralized Multi-Agent Linear Bandits with Safety Constraints.

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Regret Bound for Safe Gaussian Process Bandit Optimization.

[DOI]

Proceedings of the 2nd Annual Conference on Learning for Dynamics and Control, 2020

Generalized Linear Bandits with Safety Constraints.

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Safe Linear Thompson Sampling.

[DOI]

Ahmadreza Moradipari

CoRR, 2019

Linear Stochastic Bandits Under Safety Constraints.

[DOI]