Siwei Wang

Zhixuan Fang

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

2023

Taming the Exponential Action Set: Sublinear Regret and Fast Convergence to Nash Equilibrium in Online Congestion Games.

[BibT_eX]

[DOI]

CoRR, 2023

Contextual Combinatorial Bandits with Probabilistically Triggered Arms.

[BibT_eX]

[DOI]

Mohammad H. Hajiesmaili

Adam Wierman

CoRR, 2023

Contextual Combinatorial Bandits with Probabilistically Triggered Arms.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

The pure exploration problem with general reward functions depending on full distributions.

[BibT_eX]

[DOI]

Mach. Learn., 2022

Regret Analysis for Hierarchical Experts Bandit Problem.

[BibT_eX]

[DOI]

Qihan Guo

Jun Zhu

CoRR, 2022

Risk-Sensitive Reinforcement Learning: Iterated CVaR and the Worst Path.

[BibT_eX]

[DOI]

CoRR, 2022

Matching in Multi-arm Bandit with Collision.

[BibT_eX]

[DOI]

Yirui Zhang

Zhixuan Fang

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Batch-Size Independent Regret Bounds for Combinatorial Semi-Bandits with Probabilistically Triggered Arms or Independent Arms.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Combinatorial Bandits with Linear Constraints: Beyond Knapsacks and Fairness.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Thompson Sampling for (Combinatorial) Pure Exploration.

[BibT_eX]

[DOI]

Jun Zhu

Proceedings of the International Conference on Machine Learning, 2022

2021

Pure Exploration Bandit Problem with General Reward Functions Depending on Full Distributions.

[BibT_eX]

[DOI]

CoRR, 2021

Continuous Mean-Covariance Bandits.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback.

[BibT_eX]

[DOI]

Haoyun Wang

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

A One-Size-Fits-All Solution to Conservative Bandit Problems.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits.

[BibT_eX]

[DOI]

John C. S. Lui

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Dueling Bandits: From Two-dueling to Multi-dueling.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2018

Multi-armed Bandits with Compensation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Thompson Sampling for Combinatorial Semi-Bandits.

[BibT_eX]

[DOI]