Aidan N. Gomez

Affiliations:

Cohere

According to our database¹, Aidan N. Gomez authored at least 23 papers between 2017 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier.

[BibT_eX]

[DOI]

CoRR, 2024

Aya 23: Open Weight Releases to Further Multilingual Progress.

[BibT_eX]

[DOI]

CoRR, 2024

2022

Interlocking Backpropagation: Improving depthwise model-parallelism.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2022

Exploring Low Rank Training of Deep Neural Networks.

[BibT_eX]

[DOI]

Siddhartha Rao Kamalakara

CoRR, 2022

Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval.

[BibT_eX]

[DOI]

Pascal Notin

Mafalda Dias

Jonathan Frazer

Javier Marchena-Hurtado

Aidan N. Gomez

Debora S. Marks

Yarin Gal

Proceedings of the International Conference on Machine Learning, 2022

Prioritized Training on Points that are Learnable, Worth Learning, and not yet Learnt.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

2021

Prioritized training on points that are learnable, worth learning, and not yet learned.

[BibT_eX]

[DOI]

CoRR, 2021

Robustness to Pruning Predicts Generalization in Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2021

Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020

Interlocking Backpropagation: Improving depthwise model-parallelism.

[BibT_eX]

[DOI]

CoRR, 2020

SliceOut: Training Transformers and CNNs faster while using less memory.

[BibT_eX]

[DOI]

CoRR, 2020

Wat zei je? Detecting Out-of-Distribution Translations with Variational Transformers.

[BibT_eX]

[DOI]

Tim Z. Xiao

Aidan N. Gomez

Yarin Gal

CoRR, 2020

Predicting Twitter Engagement With Deep Language Models.

[BibT_eX]

[DOI]

Proceedings of the RecSys Challenge '20: Proceedings of the Recommender Systems Challenge 2020, 2020

2019

RL: Generic reinforcement learning codebase in TensorFlow.

[BibT_eX]

[DOI]

Bryan M. Li

Alexander I. Cowen-Rivers

Piotr Kozakowski

David Tao

Siddhartha Kamalakara

J. Open Source Softw., 2019

A Systematic Comparison of Bayesian Deep Learning Robustness in Diabetic Retinopathy Tasks.

[BibT_eX]

[DOI]

CoRR, 2019

The Difficulty of Training Sparse Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Learning Sparse Networks Using Targeted Dropout.

[BibT_eX]

[DOI]

CoRR, 2019

2018

Depthwise Separable Convolutions for Neural Machine Translation.

[BibT_eX]

[DOI]

Lukasz Kaiser

Aidan N. Gomez

François Chollet

Proceedings of the 6th International Conference on Learning Representations, 2018

Unsupervised Cipher Cracking Using Discrete GANs.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Tensor2Tensor for Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 13th Conference of the Association for Machine Translation in the Americas, 2018

2017

One Model To Learn Them All.

[BibT_eX]

[DOI]

CoRR, 2017

Attention is All you Need.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

The Reversible Residual Network: Backpropagation Without Storing Activations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Aidan N. Gomez

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...