Alexander Bukharin

Orcid: 0000-0003-2913-5112

According to our database1, Alexander Bukharin authored at least 14 papers between 2021 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
HelpSteer2-Preference: Complementing Ratings with Preferences.
CoRR, 2024

RNR: Teaching Large Language Models to Follow Roles and Rules.
CoRR, 2024

Robust Reinforcement Learning from Corrupted Human Feedback.
CoRR, 2024

Adaptive Preference Scaling for Reinforcement Learning with Human Feedback.
CoRR, 2024

Data Diversity Matters for Robust Instruction Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Data Diversity Matters for Robust Instruction Tuning.
CoRR, 2023

Deep Reinforcement Learning from Hierarchical Weak Preference Feedback.
CoRR, 2023

Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Machine Learning Force Fields with Data Cost Aware Training.
Proceedings of the International Conference on Machine Learning, 2023

Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Early Detection of COVID-19 Hotspots Using Spatio-Temporal Data.
IEEE J. Sel. Top. Signal Process., 2022

PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance.
Proceedings of the International Conference on Machine Learning, 2022

2021
High-Resolution Spatio-Temporal Model for County-Level COVID-19 Activity in the U.S.
ACM Trans. Manag. Inf. Syst., 2021

Early Detection of COVID-19 Hotspots Using Spatio-Temporal Data.
CoRR, 2021


  Loading...