Yuexiang Zhai

According to our database1, Yuexiang Zhai authored at least 20 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

2019
2020
2021
2022
2023
2024
0
1
2
3
4
5
6
7
2
5
1
1
1
2
2
1
1
1
2
1

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning.
CoRR, 2024

Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement.
CoRR, 2024

RLIF: Interactive Imitation Learning as Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models.
CoRR, 2023

White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?
CoRR, 2023

Investigating the Catastrophic Forgetting in Multimodal Large Language Models.
CoRR, 2023

Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning.
CoRR, 2023

Closed-Loop Transcription via Convolutional Sparse Coding.
CoRR, 2023

Understanding the Complexity Gains of Single-Task RL with a Curriculum.
Proceedings of the International Conference on Machine Learning, 2023

2022
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning.
J. Artif. Intell. Res., 2022

Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Computational Benefits of Intermediate Rewards for Hierarchical Planning.
CoRR, 2021

Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Complete Dictionary Learning via L4-Norm Maximization over the Orthogonal Group.
J. Mach. Learn. Res., 2020

Understanding l4-based Dictionary Learning: Interpretation, Stability, and Robustness.
Proceedings of the 8th International Conference on Learning Representations, 2020

Geometric Analysis of Nonconvex Optimization Landscapes for Overcomplete Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Analysis of the Optimization Landscapes for Overcomplete Representation Learning.
CoRR, 2019

Complete Dictionary Learning via 𝓁<sup>4</sup>-Norm Maximization over the Orthogonal Group.
CoRR, 2019

Learning to Reconstruct 3D Manhattan Wireframes From a Single Image.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019


  Loading...