Andrew M. Dai

Affiliations:

Google

According to our database¹, Andrew M. Dai authored at least 60 papers between 2011 and 2024.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Best Practices and Lessons Learned on Synthetic Data for Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context.

[BibT_eX]

[DOI]

CoRR, 2024

2023

PaLM: Scaling Language Modeling with Pathways.

[BibT_eX]

[DOI]

Vinodkumar Prabhakaran

Thanumalayan Sankaranarayana Pillai

Kathy Meier-Hellstern

J. Mach. Learn. Res., 2023

Gemini: A Family of Highly Capable Multimodal Models.

[BibT_eX]

[DOI]

CoRR, 2023

Training Socially Aligned Language Models in Simulated Human Society.

[BibT_eX]

[DOI]

CoRR, 2023

PaLM 2 Technical Report.

[BibT_eX]

[DOI]

CoRR, 2023

MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks.

[BibT_eX]

[DOI]

CoRR, 2023

Order Matters in the Presence of Dataset Imbalance for Multilingual Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Brainformers: Trading Simplicity for Efficiency.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Mind's Eye: Grounded Language Model Reasoning through Simulation.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Massively Multilingual Shallow Fusion with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Scaling Instruction-Finetuned Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.

[BibT_eX]

[DOI]

CoRR, 2022

Mixture-of-Experts with Expert Choice Routing.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

GLaM: Efficient Scaling of Language Models with Mixture-of-Experts.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Finetuned Language Models are Zero-Shot Learners.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Co-training Transformer with Videos and Images Improves Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

BEDS-Bench: Behavior of EHR-models under Distributional Shift-A Benchmark.

[BibT_eX]

[DOI]

Balaji Lakshminarayanan

Andrew M. Dai

CoRR, 2021

Training independent subnetworks for robust prediction.

[BibT_eX]

[DOI]

Balaji Lakshminarayanan

Andrew Mingbo Dai

Dustin Tran

Proceedings of the 9th International Conference on Learning Representations, 2021

MUFASA: Multimodal Fusion Architecture Search for Electronic Health Records.

[BibT_eX]

[DOI]

Zhen Xu

David R. So

Andrew M. Dai

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Learnability and Complexity of Quantum Samples.

[BibT_eX]

[DOI]

CoRR, 2020

Google COVID-19 Search Trends Symptoms Dataset: Anonymization Process Description (version 1.0).

[BibT_eX]

[DOI]

CoRR, 2020

Learning Unstable Dynamical Systems with Time-Weighted Logarithmic Loss.

[BibT_eX]

[DOI]

Kamil Nar

Yuan Xue

Andrew M. Dai

CoRR, 2020

Compositionality and Capacity in Emergent Languages.

[BibT_eX]

[DOI]

Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

Learning to Select Best Forecast Tasks for Clinical Outcome Prediction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Deep State-Space Generative Model For Correlated Time-to-Event Predictions.

[BibT_eX]

[DOI]

Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Flow Contrastive Estimation of Energy-Based Models.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Explaining an increase in predicted risk for clinical alerts.

[BibT_eX]

[DOI]

Proceedings of the ACM CHIL '20: ACM Conference on Health, 2020

Analyzing the role of model uncertainty for electronic health records.

[BibT_eX]

[DOI]

Michael W. Dusenberry

Proceedings of the ACM CHIL '20: ACM Conference on Health, 2020

Capacity, Bandwidth, and Compositionality in Emergent Language Learning.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Learning the Graphical Structure of Electronic Health Records with Graph Convolutional Transformer.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Natural Questions: a Benchmark for Question Answering Research.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2019

Deep Physiological State Space Model for Clinical Forecasting.

[BibT_eX]

[DOI]

CoRR, 2019

Modelling EHR timeseries by restricting feature interaction.

[BibT_eX]

[DOI]

CoRR, 2019

Federated and Differentially Private Learning for Electronic Health Records.

[BibT_eX]

[DOI]

Stephen R. Pfohl

Andrew M. Dai

Katherine A. Heller

CoRR, 2019

Learning an Adaptive Learning Rate Schedule.

[BibT_eX]

[DOI]

CoRR, 2019

Improved Patient Classification with Language Model Pretraining Over Clinical Notes.

[BibT_eX]

[DOI]

Jonas Kemp

Alvin Rajkomar

Andrew M. Dai

CoRR, 2019

Graph Convolutional Transformer: Learning the Graphical Structure of Electronic Health Records.

[BibT_eX]

[DOI]

Edward Choi

Zhen Xu

Yujia Li

Michael W. Dusenberry

Gerardo Flores

Yuan Xue

Andrew M. Dai

CoRR, 2019

Gmail Smart Compose: Real-Time Assisted Writing.

[BibT_eX]

[DOI]

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Music Transformer: Generating Music with Long-Term Structure.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

2018

Scalable and accurate deep learning with electronic health records.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2018

Reply: metrics to assess machine learning models.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2018

An Improved Relative Self-Attention Mechanism for Transformer with Application to Music Generation.

[BibT_eX]

[DOI]

CoRR, 2018

Scalable and accurate deep learning for electronic health records.

[BibT_eX]

[DOI]

CoRR, 2018

Embedding Text in Hyperbolic Spaces.

[BibT_eX]

[DOI]

Bhuwan Dhingra

Christopher J. Shallue

Mohammad Norouzi

Andrew M. Dai

George E. Dahl

Proceedings of the Twelfth Workshop on Graph-Based Methods for Natural Language Processing, 2018

Learning Longer-term Dependencies in RNNs with Auxiliary Losses.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Learning Longer-term Dependencies in RNNs with Auxiliary Losses.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Many Paths to Equilibrium: GANs Do Not Need to Decrease a Divergence At Every Step.

[BibT_eX]

[DOI]

William Fedus

Mihaela Rosca

Balaji Lakshminarayanan

Andrew M. Dai

Shakir Mohamed

Ian J. Goodfellow

Proceedings of the 6th International Conference on Learning Representations, 2018

MaskGAN: Better Text Generation via Filling in the _______.

[BibT_eX]

[DOI]

William Fedus

Ian J. Goodfellow

Andrew M. Dai

Proceedings of the 6th International Conference on Learning Representations, 2018

AirDialogue: An Environment for Goal-Oriented Dialogue Research.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Who Said What: Modeling Individual Labelers Improves Classification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Adversarial Training Methods for Semi-Supervised Text Classification.

[BibT_eX]

[DOI]

Takeru Miyato

Andrew M. Dai

Ian J. Goodfellow

Proceedings of the 5th International Conference on Learning Representations, 2017

HyperNetworks.

[BibT_eX]

[DOI]

David Ha

Andrew M. Dai

Quoc V. Le

Proceedings of the 5th International Conference on Learning Representations, 2017

2016

Virtual Adversarial Training for Semi-Supervised Text Classification.

[BibT_eX]

[DOI]

Takeru Miyato

Andrew M. Dai

Ian J. Goodfellow

CoRR, 2016

Generating Sentences from a Continuous Space.

[BibT_eX]

[DOI]

Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, 2016

2015

The Supervised Hierarchical Dirichlet Process.

[BibT_eX]

[DOI]

Andrew M. Dai

Amos J. Storkey

IEEE Trans. Pattern Anal. Mach. Intell., 2015

Document Embedding with Paragraph Vectors.

[BibT_eX]

[DOI]

Andrew M. Dai

Christopher Olah

Quoc V. Le

CoRR, 2015

Semi-supervised Sequence Learning.

[BibT_eX]

[DOI]

Andrew M. Dai

Quoc V. Le

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

2011

The Grouped Author-Topic Model for Unsupervised Entity Resolution.

[BibT_eX]

[DOI]

Andrew M. Dai

Amos J. Storkey

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2011, 2011

Language-independent compound splitting with morphological operations.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Andrew M. Dai

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...