Xuezhou Zhang

According to our database1, Xuezhou Zhang authored at least 45 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
State-free Reinforcement Learning.
CoRR, 2024

Efficient Reinforcement Learning in Probabilistic Reward Machines.
CoRR, 2024

Scale-free Adversarial Reinforcement Learning.
Proceedings of the Thirty Seventh Annual Conference on Learning Theory, June 30, 2024

Exact Policy Recovery in Offline RL with Both Heavy-Tailed Rewards and Data Corruption.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Federated Multi-Level Optimization over Decentralized Networks.
CoRR, 2023

Improved Algorithms for Adversarial Bandits with Unbounded Losses.
CoRR, 2023

Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP.
Proceedings of the International Conference on Machine Learning, 2023

Representation Learning for Low-rank General-sum Markov Games.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Provable Benefits of Representational Transfer in Reinforcement Learning.
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

Byzantine-Robust Online and Offline Distributed Reinforcement Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Provable Defense against Backdoor Policies in Reinforcement Learning.
CoRR, 2022

Representation Learning for General-sum Low-rank Markov Games.
CoRR, 2022

Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization.
CoRR, 2022

Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration.
CoRR, 2022

Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Decentralized Gossip-Based Stochastic Bilevel Optimization over Communication Networks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Provable Defense against Backdoor Policies in Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory.
Proceedings of the International Conference on Machine Learning, 2022

Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning approach.
Proceedings of the International Conference on Machine Learning, 2022

Optimal Estimation of Policy Gradient via Double Fitted Iteration.
Proceedings of the International Conference on Machine Learning, 2022

Representation Learning for Online and Offline RL in Low-rank MDPs.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Corruption-robust Offline Reinforcement Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
Corruption-Robust Offline Reinforcement Learning.
CoRR, 2021

Reward Poisoning in Reinforcement Learning: Attacks Against Unknown Learners in Unknown Environments.
CoRR, 2021

Controllable and Diverse Text Generation in E-commerce.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Neural Additive Models: Interpretable Machine Learning with Neural Nets.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Robust Policy Gradient against Strong Data Corruption.
Proceedings of the 38th International Conference on Machine Learning, 2021

Using Machine Teaching to Investigate Human Assumptions when Teaching Reinforcement Learners.
Proceedings of the 43rd Annual Meeting of the Cognitive Science Society, 2021

The Sample Complexity of Teaching by Reinforcement on Q-Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Using Machine Teaching to Investigate Human Assumptions when Teaching Reinforcement Learners.
CoRR, 2020

The Teaching Dimension of Q-learning.
CoRR, 2020

Neural Additive Models: Interpretable Machine Learning with Neural Nets.
CoRR, 2020

Task-agnostic Exploration in Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Online Data Poisoning Attacks.
Proceedings of the 2nd Annual Conference on Learning for Dynamics and Control, 2020

Adaptive Reward-Poisoning Attacks against Reinforcement Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Policy Poisoning in Batch Reinforcement Learning and Control.
CoRR, 2019

Online Data Poisoning Attack.
CoRR, 2019

Policy Poisoning in Batch Reinforcement Learning and Control.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Axiomatic Interpretability for Multiclass Additive Models.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

An Optimal Control Approach to Sequential Machine Teaching.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Interpretability is Harder in the Multiclass Setting: Axiomatic Interpretability for Multiclass Additive Models.
CoRR, 2018

Training Set Camouflage.
Proceedings of the Decision and Game Theory for Security - 9th International Conference, 2018

Teacher Improves Learning by Selecting a Training Subset.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

Training Set Debugging Using Trusted Items.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018


  Loading...