Baihe Huang

According to our database1, Baihe Huang authored at least 20 papers between 2020 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Stochastic Zeroth-Order Optimization under Strongly Convexity and Lipschitz Hessian: Minimax Sample Complexity.
CoRR, 2024

Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics.
CoRR, 2024

Data Acquisition via Experimental Design for Decentralized Data Markets.
CoRR, 2024

On Representation Complexity of Model-based and Model-free Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Enhancing Language Model Alignment: A Confidence-Based Approach to Label Smoothing.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence.
SIAM J. Optim., June, 2023

Towards Optimal Statistical Watermarking.
CoRR, 2023

Evaluating and Incentivizing Diverse Data Contributions in Collaborative Learning.
CoRR, 2023

Sample Complexity for Quadratic Bandits: Hessian Dependent Bounds and Optimal Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Optimal Sample Complexity Bounds for Non-convex Optimization under Kurdyka-Lojasiewicz Condition.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
A Dynamic Fast Gaussian Transform.
CoRR, 2022

Towards General Function Approximation in Zero-Sum Markov Games.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Solving SDP Faster: A Robust IPM Framework and Efficient Implementation.
Proceedings of the 63rd IEEE Annual Symposium on Foundations of Computer Science, 2022

Offline Reinforcement Learning with Realizability and Single-policy Concentrability.
Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022

2021
FL-NTK: A Neural Tangent Kernel-based Framework for Federated Learning Convergence Analysis.
CoRR, 2021

Solving Tall Dense SDPs in the Current Matrix Multiplication Time.
CoRR, 2021

Optimal Gradient-based Algorithms for Non-concave Bandit Optimization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Going Beyond Linear RL: Sample Efficient Neural Function Approximation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

FL-NTK: A Neural Tangent Kernel-based Framework for Federated Learning Analysis.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
InstaHide's Sample Complexity When Mixing Two Private Images.
CoRR, 2020


  Loading...