Yufeng Zhang

Affiliations:
  • Northwestern University, Evanston, IL, USA


According to our database1, Yufeng Zhang authored at least 17 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs.
CoRR, 2024

A Mean-Field Analysis of Neural Gradient Descent-Ascent: Applications to Functional Conditional Moment Equations.
CoRR, 2024

Can Large Language Models Play Games? A Case Study of A Self-Play Approach.
CoRR, 2024

2023
What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization.
CoRR, 2023

2022
An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models.
CoRR, 2022

Federated Offline Reinforcement Learning.
CoRR, 2022

Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation.
Proceedings of the International Conference on Machine Learning, 2022

Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes.
Proceedings of the International Conference on Machine Learning, 2022

2021
Provably Efficient Generative Adversarial Imitation Learning for Online and Offline Setting with Linear Function Approximation.
CoRR, 2021

Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Infinite-Dimensional Optimization for Zero-Sum Games via Variational Transport.
Proceedings of the 38th International Conference on Machine Learning, 2021

Provably Efficient Actor-Critic for Risk-Sensitive and Robust Adversarial RL: A Linear-Quadratic Case.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
Variational Transport: A Convergent Particle-BasedAlgorithm for Distributional Optimization.
CoRR, 2020

Generative Adversarial Imitation Learning with Neural Networks: Global Optimality and Convergence Rate.
CoRR, 2020

Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Generative Adversarial Imitation Learning with Neural Network Parameterization: Global Optimality and Convergence Rate.
Proceedings of the 37th International Conference on Machine Learning, 2020


  Loading...