Banghua Zhu

Orcid: 0000-0002-7320-3533

According to our database1, Banghua Zhu authored at least 43 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Noisy Sorting Capacity.
IEEE Trans. Inf. Theory, September, 2024

Noisy Computing of the OR and MAX Functions.
IEEE J. Sel. Areas Inf. Theory, 2024

How to Evaluate Reward Models for RLHF.
CoRR, 2024

Taming Overconfidence in LLMs: Reward Calibration in RLHF.
CoRR, 2024

From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline.
CoRR, 2024

Noisy Computing of the Threshold Function.
CoRR, 2024

Generative AI Security: Challenges and Countermeasures.
CoRR, 2024

Efficient Prompt Caching via Embedding Similarity.
CoRR, 2024

Fairness in Serving Large Language Models.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

SLoRA: Scalable Serving of Thousands of LoRA Adapters.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

Guided Online Distillation: Promoting Safe Reinforcement Learning by Offline Demonstration.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Towards the Fundamental Limits of Knowledge Transfer over Finite Domains.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

The Effective Horizon Explains Deep RL Performance in Stochastic Environments.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Towards Optimal Statistical Watermarking.
CoRR, 2023

S-LoRA: Serving Thousands of Concurrent LoRA Adapters.
CoRR, 2023

QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources.
CoRR, 2023

Pairwise Proximal Policy Optimization: Harnessing Relative Feedback for LLM Alignment.
CoRR, 2023

Fine-Tuning Language Models with Advantage-Induced Policy Alignment.
CoRR, 2023

On Optimal Caching and Model Multiplexing for Large Model Inference.
CoRR, 2023

Online Learning in a Creator Economy.
CoRR, 2023

The Sample Complexity of Online Contract Design.
Proceedings of the 24th ACM Conference on Economics and Computation, 2023

Doubly-Robust Self-Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Towards Optimal Caching and Model Selection for Large Model Inference.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On the Optimal Bounds for Noisy Computing.
Proceedings of the IEEE International Symposium on Information Theory, 2023

Variable-Length Insertion-Based Noisy Sorting.
Proceedings of the IEEE International Symposium on Information Theory, 2023

Principled Reinforcement Learning with Human Feedback from Pairwise or K-wise Comparisons.
Proceedings of the International Conference on Machine Learning, 2023

Online Learning in Stackelberg Games with an Omniscient Follower.
Proceedings of the International Conference on Machine Learning, 2023

Jump-Start Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

Byzantine-Robust Federated Learning with Optimal Statistical Rates.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism.
IEEE Trans. Inf. Theory, 2022

Minimax Off-Policy Evaluation for Multi-Armed Bandits.
IEEE Trans. Inf. Theory, 2022

Byzantine-Robust Federated Learning with Optimal Statistical Rates and Privacy Guarantees.
CoRR, 2022

Robust Estimation for Nonparametric Families via Generative Adversarial Networks.
CoRR, 2022

Robust Estimation for Non-parametric Families via Generative Adversarial Networks.
Proceedings of the IEEE International Symposium on Information Theory, 2022

2021
Linear Representation Meta-Reinforcement Learning for Instant Adaptation.
CoRR, 2021

2020
Deconstructing Generative Adversarial Networks.
IEEE Trans. Inf. Theory, 2020

Robust estimation via generalized quasi-gradients.
CoRR, 2020

When does the Tukey Median work?
Proceedings of the IEEE International Symposium on Information Theory, 2020

2019
Joint Transceiver Optimization for Wireless Communication PHY Using Neural Network.
IEEE J. Sel. Areas Commun., 2019

Generalized Resilience and Robust Statistics.
CoRR, 2019

2017
Sparse Tensor Decomposition for Haplotype Assembly of Diploids and Polyploids.
Proceedings of the 8th ACM International Conference on Bioinformatics, 2017


  Loading...