Aidan N. Gomez

Affiliations:
  • Cohere


According to our database1, Aidan N. Gomez authored at least 22 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Aya 23: Open Weight Releases to Further Multilingual Progress.
CoRR, 2024

2022
Interlocking Backpropagation: Improving depthwise model-parallelism.
J. Mach. Learn. Res., 2022

Exploring Low Rank Training of Deep Neural Networks.
CoRR, 2022

Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval.
Proceedings of the International Conference on Machine Learning, 2022

Prioritized Training on Points that are Learnable, Worth Learning, and not yet Learnt.
Proceedings of the International Conference on Machine Learning, 2022

2021
Prioritized training on points that are learnable, worth learning, and not yet learned.
CoRR, 2021

Robustness to Pruning Predicts Generalization in Deep Neural Networks.
CoRR, 2021

Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Interlocking Backpropagation: Improving depthwise model-parallelism.
CoRR, 2020

SliceOut: Training Transformers and CNNs faster while using less memory.
CoRR, 2020

Wat zei je? Detecting Out-of-Distribution Translations with Variational Transformers.
CoRR, 2020

Predicting Twitter Engagement With Deep Language Models.
Proceedings of the RecSys Challenge '20: Proceedings of the Recommender Systems Challenge 2020, 2020

2019
RL: Generic reinforcement learning codebase in TensorFlow.
J. Open Source Softw., 2019

A Systematic Comparison of Bayesian Deep Learning Robustness in Diabetic Retinopathy Tasks.
CoRR, 2019

The Difficulty of Training Sparse Neural Networks.
CoRR, 2019

Learning Sparse Networks Using Targeted Dropout.
CoRR, 2019

2018
Depthwise Separable Convolutions for Neural Machine Translation.
Proceedings of the 6th International Conference on Learning Representations, 2018

Unsupervised Cipher Cracking Using Discrete GANs.
Proceedings of the 6th International Conference on Learning Representations, 2018

Tensor2Tensor for Neural Machine Translation.
Proceedings of the 13th Conference of the Association for Machine Translation in the Americas, 2018

2017
One Model To Learn Them All.
CoRR, 2017

Attention is All you Need.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

The Reversible Residual Network: Backpropagation Without Storing Activations.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017


  Loading...