Andrew M. Dai

Affiliations:
  • Google


According to our database1, Andrew M. Dai authored at least 62 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Scaling Instruction-Finetuned Language Models.
J. Mach. Learn. Res., 2024

Learning to Select the Best Forecasting Tasks for Clinical Outcome Prediction.
CoRR, 2024

Best Practices and Lessons Learned on Synthetic Data for Language Models.
CoRR, 2024

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context.
CoRR, 2024

2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks.
Trans. Mach. Learn. Res., 2023

PaLM: Scaling Language Modeling with Pathways.
J. Mach. Learn. Res., 2023

Gemini: A Family of Highly Capable Multimodal Models.
CoRR, 2023

Training Socially Aligned Language Models in Simulated Human Society.
CoRR, 2023

PaLM 2 Technical Report.
CoRR, 2023

Order Matters in the Presence of Dataset Imbalance for Multilingual Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Brainformers: Trading Simplicity for Efficiency.
Proceedings of the International Conference on Machine Learning, 2023

Mind's Eye: Grounded Language Model Reasoning through Simulation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Massively Multilingual Shallow Fusion with Large Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Scaling Instruction-Finetuned Language Models.
CoRR, 2022

Mixture-of-Experts with Expert Choice Routing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


Finetuned Language Models are Zero-Shot Learners.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Co-training Transformer with Videos and Images Improves Action Recognition.
CoRR, 2021

BEDS-Bench: Behavior of EHR-models under Distributional Shift-A Benchmark.
CoRR, 2021

Training independent subnetworks for robust prediction.
Proceedings of the 9th International Conference on Learning Representations, 2021

MUFASA: Multimodal Fusion Architecture Search for Electronic Health Records.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Learnability and Complexity of Quantum Samples.
CoRR, 2020

Google COVID-19 Search Trends Symptoms Dataset: Anonymization Process Description (version 1.0).
CoRR, 2020

Learning Unstable Dynamical Systems with Time-Weighted Logarithmic Loss.
CoRR, 2020

Compositionality and Capacity in Emergent Languages.
Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

Learning to Select Best Forecast Tasks for Clinical Outcome Prediction.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Deep State-Space Generative Model For Correlated Time-to-Event Predictions.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Flow Contrastive Estimation of Energy-Based Models.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Explaining an increase in predicted risk for clinical alerts.
Proceedings of the ACM CHIL '20: ACM Conference on Health, 2020

Analyzing the role of model uncertainty for electronic health records.
Proceedings of the ACM CHIL '20: ACM Conference on Health, 2020

Capacity, Bandwidth, and Compositionality in Emergent Language Learning.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Learning the Graphical Structure of Electronic Health Records with Graph Convolutional Transformer.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Natural Questions: a Benchmark for Question Answering Research.
Trans. Assoc. Comput. Linguistics, 2019

Deep Physiological State Space Model for Clinical Forecasting.
CoRR, 2019

Modelling EHR timeseries by restricting feature interaction.
CoRR, 2019

Federated and Differentially Private Learning for Electronic Health Records.
CoRR, 2019

Learning an Adaptive Learning Rate Schedule.
CoRR, 2019

Improved Patient Classification with Language Model Pretraining Over Clinical Notes.
CoRR, 2019

Graph Convolutional Transformer: Learning the Graphical Structure of Electronic Health Records.
CoRR, 2019

Gmail Smart Compose: Real-Time Assisted Writing.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Music Transformer: Generating Music with Long-Term Structure.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Scalable and accurate deep learning with electronic health records.
npj Digit. Medicine, 2018

Reply: metrics to assess machine learning models.
npj Digit. Medicine, 2018

An Improved Relative Self-Attention Mechanism for Transformer with Application to Music Generation.
CoRR, 2018

Scalable and accurate deep learning for electronic health records.
CoRR, 2018

Embedding Text in Hyperbolic Spaces.
Proceedings of the Twelfth Workshop on Graph-Based Methods for Natural Language Processing, 2018

Learning Longer-term Dependencies in RNNs with Auxiliary Losses.
Proceedings of the 35th International Conference on Machine Learning, 2018

Learning Longer-term Dependencies in RNNs with Auxiliary Losses.
Proceedings of the 6th International Conference on Learning Representations, 2018

Many Paths to Equilibrium: GANs Do Not Need to Decrease a Divergence At Every Step.
Proceedings of the 6th International Conference on Learning Representations, 2018

MaskGAN: Better Text Generation via Filling in the _______.
Proceedings of the 6th International Conference on Learning Representations, 2018

AirDialogue: An Environment for Goal-Oriented Dialogue Research.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Who Said What: Modeling Individual Labelers Improves Classification.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Adversarial Training Methods for Semi-Supervised Text Classification.
Proceedings of the 5th International Conference on Learning Representations, 2017

HyperNetworks.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
Virtual Adversarial Training for Semi-Supervised Text Classification.
CoRR, 2016

Generating Sentences from a Continuous Space.
Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, 2016

2015
The Supervised Hierarchical Dirichlet Process.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Document Embedding with Paragraph Vectors.
CoRR, 2015

Semi-supervised Sequence Learning.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

2011
The Grouped Author-Topic Model for Unsupervised Entity Resolution.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2011, 2011

Language-independent compound splitting with morphological operations.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011


  Loading...