Alexander A. Alemi

Orcid: 0000-0001-7147-9184

Affiliations:
  • Google Inc, Mountain View, CA, USA
  • Cornell University, Ithaca, NY, USA (PhD 2015)


According to our database1, Alexander A. Alemi authored at least 42 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models.
Trans. Mach. Learn. Res., 2024

Training LLMs over Neurally Compressed Text.
CoRR, 2024

Scaling Exponents Across Parameterizations and Optimizers.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Small-scale proxies for large-scale Transformer training instabilities.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?
CoRR, 2023

Speed Limits for Deep Learning.
CoRR, 2023

Variational Prediction.
CoRR, 2023

Weighted Ensemble Self-Supervised Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Bayesian Imitation Learning for End-to-End Mobile Manipulation.
Proceedings of the International Conference on Machine Learning, 2022

PACm-Bayes: Narrowing the Empirical Risk Gap in the Misspecified Bayesian Regime.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
A Closer Look at the Adversarial Robustness of Information Bottleneck Models.
CoRR, 2021

Does Knowledge Distillation Really Work?
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Density of States Estimation for Out of Distribution Detection.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
CEB Improves Model Robustness.
Entropy, 2020

VIB is Half Bayes.
CoRR, 2020

PAC<sup>m</sup>-Bayes: Narrowing the Empirical Risk Gap in the Misspecified Bayesian Regime.
CoRR, 2020

Neural Tangents: Fast and Easy Infinite Neural Networks in Python.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Thermodynamic Computing.
CoRR, 2019

On Predictive Information Sub-optimality of RNNs.
CoRR, 2019

Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces.
CoRR, 2019

On the Use of ArXiv as a Dataset.
CoRR, 2019

On Variational Bounds of Mutual Information.
Proceedings of the 36th International Conference on Machine Learning, 2019

Information in Infinite Ensembles of Infinitely-Wide Neural Networks.
Proceedings of the Symposium on Advances in Approximate Bayesian Inference, 2019

Variational Predictive Information Bottleneck.
Proceedings of the Symposium on Advances in Approximate Bayesian Inference, 2019

2018
β-VAEs can retain label information even at high compression.
CoRR, 2018

TherML: Thermodynamics of Machine Learning.
CoRR, 2018

Uncertainty in the Variational Information Bottleneck.
CoRR, 2018

GILBO: One Metric to Measure Them All.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Watch Your Step: Learning Node Embeddings via Graph Attention.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Fixing a Broken ELBO.
Proceedings of the 35th International Conference on Machine Learning, 2018

2017
TensorFlow Distributions.
CoRR, 2017

An Information-Theoretic Analysis of Deep Latent-Variable Models.
CoRR, 2017

Watch Your Step: Learning Graph Embeddings Through Attention.
CoRR, 2017

Jeffrey's prior sampling of deep sigmoidal networks.
CoRR, 2017

Motion Prediction Under Multimodality with Conditional Stochastic Networks.
CoRR, 2017

Deep Variational Information Bottleneck.
Proceedings of the 5th International Conference on Learning Representations, 2017

Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Improved generator objectives for GANs.
CoRR, 2016

DeepMath - Deep Sequence Models for Premise Selection.
CoRR, 2016

DeepMath - Deep Sequence Models for Premise Selection.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2015
Clustering via Content-Augmented Stochastic Blockmodels.
CoRR, 2015

Text Segmentation based on Semantic Word Embeddings.
CoRR, 2015


  Loading...