WebRED: Effective Pretraining And Finetuning For Relation Extraction On The Web.
CoRR, 2021
Distributed multigrid neural solvers on megavoxel domains.
Proceedings of the International Conference for High Performance Computing, 2021
Is Batch Norm unique? An empirical investigation and prescription to emulate the best properties of common normalizers without batch dependence.
CoRR, 2020
Deep Generative Models that Solve PDEs: Distributed Computing for Training Large Data-Free Models.
Proceedings of the 6th IEEE/ACM Workshop on Machine Learning in High Performance Computing Environments, 2020
Assessing The Factual Accuracy of Generated Text.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019
A Mean Field Theory of Batch Normalization.
Proceedings of the 7th International Conference on Learning Representations, 2019
Active Learning for Speech Recognition: the Power of Gradients.
CoRR, 2016