Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
A Quadratic Synchronization Rule for Distributed Deep Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Why (and When) does Local SGD Generalize Better than SGD?
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Fast Federated Learning in the Presence of Arbitrary Device Unavailability.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Predicting the Length of Stay of Patients in Hospitals.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021