Xuezhou Zhang

According to our database¹, Xuezhou Zhang authored at least 43 papers between 2018 and 2024.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Scale-free Adversarial Reinforcement Learning.

[BibT_eX]

[DOI]

Mingyu Chen

Xuezhou Zhang

Proceedings of the Thirty Seventh Annual Conference on Learning Theory, June 30, 2024

Exact Policy Recovery in Offline RL with Both Heavy-Tailed Rewards and Data Corruption.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Federated Multi-Level Optimization over Decentralized Networks.

[BibT_eX]

[DOI]

Shuoguang Yang

Xuezhou Zhang

Mengdi Wang

CoRR, 2023

Improved Algorithms for Adversarial Bandits with Unbounded Losses.

[BibT_eX]

[DOI]

Mingyu Chen

Xuezhou Zhang

CoRR, 2023

Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Representation Learning for Low-rank General-sum Markov Games.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Provable Benefits of Representational Transfer in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

Byzantine-Robust Online and Offline Distributed Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022

Provable Defense against Backdoor Policies in Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Representation Learning for General-sum Low-rank Markov Games.

[BibT_eX]

[DOI]

CoRR, 2022

Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization.

[BibT_eX]

[DOI]

CoRR, 2022

Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration.

[BibT_eX]

[DOI]

CoRR, 2022

Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Decentralized Gossip-Based Stochastic Bilevel Optimization over Communication Networks.

[BibT_eX]

[DOI]

Shuoguang Yang

Xuezhou Zhang

Mengdi Wang

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Provable Defense against Backdoor Policies in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning approach.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Optimal Estimation of Policy Gradient via Double Fitted Iteration.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Representation Learning for Online and Offline RL in Low-rank MDPs.

[BibT_eX]

[DOI]

Masatoshi Uehara

Xuezhou Zhang

Wen Sun

Proceedings of the Tenth International Conference on Learning Representations, 2022

Corruption-robust Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

Corruption-Robust Offline Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Reward Poisoning in Reinforcement Learning: Attacks Against Unknown Learners in Unknown Environments.

[BibT_eX]

[DOI]

CoRR, 2021

Controllable and Diverse Text Generation in E-commerce.

[BibT_eX]

[DOI]

Proceedings of the WWW '21: The Web Conference 2021, 2021

Neural Additive Models: Interpretable Machine Learning with Neural Nets.

[BibT_eX]

[DOI]

Benjamin J. Lengerich

Rich Caruana

Geoffrey E. Hinton

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Robust Policy Gradient against Strong Data Corruption.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Using Machine Teaching to Investigate Human Assumptions when Teaching Reinforcement Learners.

[BibT_eX]

[DOI]

Proceedings of the 43rd Annual Meeting of the Cognitive Science Society, 2021

The Sample Complexity of Teaching by Reinforcement on Q-Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Using Machine Teaching to Investigate Human Assumptions when Teaching Reinforcement Learners.

[BibT_eX]

[DOI]

CoRR, 2020

The Teaching Dimension of Q-learning.

[BibT_eX]

[DOI]

CoRR, 2020

Neural Additive Models: Interpretable Machine Learning with Neural Nets.

[BibT_eX]

[DOI]

CoRR, 2020

Task-agnostic Exploration in Reinforcement Learning.

[BibT_eX]

[DOI]

Xuezhou Zhang

Yuzhe Ma

Adish Singla

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Online Data Poisoning Attacks.

[BibT_eX]

[DOI]

Xuezhou Zhang

Xiaojin Zhu

Laurent Lessard

Proceedings of the 2nd Annual Conference on Learning for Dynamics and Control, 2020

Adaptive Reward-Poisoning Attacks against Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

Policy Poisoning in Batch Reinforcement Learning and Control.

[BibT_eX]

[DOI]

CoRR, 2019

Online Data Poisoning Attack.

[BibT_eX]

[DOI]

Xuezhou Zhang

Xiaojin Zhu

CoRR, 2019

Policy Poisoning in Batch Reinforcement Learning and Control.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Axiomatic Interpretability for Multiclass Additive Models.

[BibT_eX]

[DOI]

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

An Optimal Control Approach to Sequential Machine Teaching.

[BibT_eX]

[DOI]

Laurent Lessard

Xuezhou Zhang

Xiaojin Zhu

Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018

Interpretability is Harder in the Multiclass Setting: Axiomatic Interpretability for Multiclass Additive Models.

[BibT_eX]

[DOI]

CoRR, 2018

Training Set Camouflage.

[BibT_eX]

[DOI]

Proceedings of the Decision and Game Theory for Security - 9th International Conference, 2018

Teacher Improves Learning by Selecting a Training Subset.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

Training Set Debugging Using Trusted Items.

[BibT_eX]

[DOI]

Xuezhou Zhang

Xiaojin Zhu

Stephen J. Wright

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Xuezhou Zhang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...