Nan Du

Affiliations:
  • Google
  • Georgia Institute of Technology, GA, USA (former)


According to our database1, Nan Du authored at least 43 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Learning to Select the Best Forecasting Tasks for Clinical Outcome Prediction.
CoRR, 2024

Knowledge Graph Reasoning with Self-supervised Reinforcement Learning.
CoRR, 2024

Mixture-of-Experts Meets Instruction Tuning: A Winning Combination for Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
PaLM: Scaling Language Modeling with Pathways.
J. Mach. Learn. Res., 2023

Learning to Skip for Language Modeling.
CoRR, 2023

Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts.
CoRR, 2023

PaLM 2 Technical Report.
CoRR, 2023

DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Brainformers: Trading Simplicity for Efficiency.
Proceedings of the International Conference on Machine Learning, 2023

Lifelong Language Pretraining with Distribution-Specialized Experts.
Proceedings of the International Conference on Machine Learning, 2023

ReAct: Synergizing Reasoning and Acting in Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Massively Multilingual Shallow Fusion with Large Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Designing Effective Sparse Expert Models.
CoRR, 2022

Mixture-of-Experts with Expert Choice Routing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


Finetuned Language Models are Zero-Shot Learners.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
R2D2: Relational Text Decoding with Transformers.
CoRR, 2021

2020
Learning to Select Best Forecast Tasks for Clinical Outcome Prediction.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

The Medical Scribe: Corpus Development and Model Performance Analyses.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Deep State-Space Generative Model For Correlated Time-to-Event Predictions.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

2019
Deep Physiological State Space Model for Clinical Forecasting.
CoRR, 2019

Learning to Infer Entities, Properties and their Relations from Clinical Conversations.
CoRR, 2019

Learning to Infer Entities, Properties and their Relations from Clinical Conversations.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Extracting Symptoms and their Status from Clinical Conversations.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Learning Temporal Point Processes via Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017
Scalable Influence Maximization for Multiple Products in Continuous-Time Diffusion Networks.
J. Mach. Learn. Res., 2017

2016
Modeling, learning, and inference of high-dimensional asynchronous event data.
PhD thesis, 2016

Influence Estimation and Maximization in Continuous-Time Diffusion Networks.
ACM Trans. Inf. Syst., 2016

Coevolutionary Latent Feature Processes for Continuous-Time User-Item Interactions.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Recurrent Marked Temporal Point Processes: Embedding Event History to Vector.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Isotonic Hawkes Processes.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2015
Time-Sensitive Recommendation From Recurrent User Activities.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Dirichlet-Hawkes Processes with Applications to Clustering Continuous-Time Document Streams.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Constructing Disease Network and Temporal Progression Model via Context-Sensitive Hawkes Process.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

Back to the Past: Source Identification in Diffusion Networks from Partially Observed Cascades.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

2014
Shaping Social Activity by Incentivizing Users.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Learning Time-Varying Coverage Functions.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Influence Function Learning in Information Diffusion Networks.
Proceedings of the 31th International Conference on Machine Learning, 2014

2013
Continuous-Time Influence Maximization for Multiple Items.
CoRR, 2013

Scalable Influence Estimation in Continuous-Time Diffusion Networks.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Uncover Topic-Sensitive Information Diffusion Networks.
Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics, 2013

2012
Learning Networks of Heterogeneous Influence.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012


  Loading...