Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models.
CoRR, 2024
Data, Data Everywhere: A Guide for Pretraining Dataset Construction.
CoRR, 2024
Data, Data Everywhere: A Guide for Pretraining Dataset Construction.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
The Importance of Background Information for Out of Distribution Generalization.
CoRR, 2022
Observational Supervision for Medical Image Classification Using Gaze Data.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021
Biomedical Information Extraction for Disease Gene Prioritization.
CoRR, 2020