Bo Dai

Affiliations:
  • Google Brain, USA
  • Georgia Institute of Technology, Atlanta, GA, USA (PhD)
  • Chinese Academy of Science, Institute of Automation, NLPR/LIAMA, Beijing, China (former)


According to our database1, Bo Dai authored at least 137 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Matryoshka: Learning to Drive Black-Box LLMs with LLMs.
CoRR, 2024

Faster WIND: Accelerating Iterative Best-of-<i>N</i> Distillation for LLM Alignment.
CoRR, 2024

Primal-Dual Spectral Representation for Off-policy Evaluation.
CoRR, 2024

Scalable spectral representations for network multiagent control.
CoRR, 2024

VideoAgent: Self-Improving Video Generation.
CoRR, 2024

EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing.
CoRR, 2024

Spectral Representation for Causal Estimation with Hidden Confounders.
CoRR, 2024

UQE: A Query Engine for Unstructured Databases.
CoRR, 2024

Diffusion Spectral Representation for Reinforcement Learning.
CoRR, 2024

HYDRA: Model Factorization Framework for Black-Box LLM Personalization.
CoRR, 2024

Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF.
CoRR, 2024

Efficient Duple Perturbation Robustness in Low-rank MDPs.
CoRR, 2024

Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint.
CoRR, 2024

Beyond Expectations: Learning with Stochastic Dominance Made Practical.
CoRR, 2024

Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Probabilistic Adaptation of Black-Box Text-to-Video Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
DF2: Distribution-Free Decision-Focused Learning.
CoRR, 2023

Probabilistic Adaptation of Text-to-Video Models.
CoRR, 2023

Energy-based Predictive Representations for Partially Observed Reinforcement Learning.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

AdaPlanner: Adaptive Planning from Feedback with Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Ordering-based Conditions for Global Convergence of Policy Gradient Methods.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Universal Policies via Text-Guided Video Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Stochastic Gradient Succeeds for Bandits.
Proceedings of the International Conference on Machine Learning, 2023

Score-based Continuous-time Discrete Diffusion Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Any-scale Balanced Samplers for Discrete Space.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Spectral Decomposition Representation for Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Latent Variable Representation for Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

On Task-personalized Multimodal Few-shot Learning for Visually-rich Document Entity Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic Embedding.
Proceedings of the 62nd IEEE Conference on Decision and Control, 2023

Discrete Langevin Samplers via Wasserstein Gradient Flow.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Learning to Optimize with Stochastic Dominance Constraints.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Learning to Optimize with Stochastic Dominance Constraints.
CoRR, 2022

Discrete Langevin Sampler via Wasserstein Gradient Flow.
CoRR, 2022

SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition.
CoRR, 2022

On the Effect of Log-Barrier Regularization in Decentralized Softmax Gradient Play in Multiagent Systems.
CoRR, 2022

Can Small Heads Help? Understanding and Improving Multi-Task Generalization.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

A free lunch from the noise: Provable and practical exploration for representation learning.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The Role of Baselines in Policy Gradient Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Oracle Inequalities for Model Selection in Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Making Linear MDPs Practical via Contrastive Representation Learning.
Proceedings of the International Conference on Machine Learning, 2022

Marginal Distribution Adaptation for Discrete Sets via Module-Oriented Divergence Minimization.
Proceedings of the International Conference on Machine Learning, 2022

Model Selection in Batch Policy Optimization.
Proceedings of the International Conference on Machine Learning, 2022

Understanding and Leveraging Overparameterization in Recursive Value Estimation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Neural Stochastic Dual Dynamic Programming.
Proceedings of the Tenth International Conference on Learning Representations, 2022

SMARTAVE: Structured Multimodal Transformer for Product Attribute Value Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Offline Policy Selection under Uncertainty.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

The Curse of Passive Data Collection in Batch Reinforcement Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Self-Adaptive Imitation Learning: Learning Tasks with Delayed Rewards from Sub-optimal Demonstrations.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data.
CoRR, 2021

Towards understanding retrosynthesis by energy-based models.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Nearly Horizon-Free Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Combiner: Full Attention Transformer with Sparse Computation Cost.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Understanding the Effect of Stochasticity in Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

On the Optimality of Batch Policy Optimization Algorithms.
Proceedings of the 38th International Conference on Machine Learning, 2021

LEGO: Latent Execution-Guided Reasoning for Multi-Hop Question Answering on Knowledge Graphs.
Proceedings of the 38th International Conference on Machine Learning, 2021

Leveraging Non-uniformity in First-order Non-convex Optimization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Overcoming Catastrophic Forgetting by Bayesian Generative Regularization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Learning to Defend by Learning to Attack.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
Small Towers Make Big Differences.
CoRR, 2020

Energy-based View of Retrosynthesis.
CoRR, 2020

Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach.
CoRR, 2020

Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations.
CoRR, 2020

Differentiable Top-k Operator with Optimal Transport.
CoRR, 2020

Reinforcement Learning via Fenchel-Rockafellar Duality.
CoRR, 2020

Off-Policy Imitation Learning from Observations.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Off-Policy Evaluation via the Regularized Lagrangian.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Differentiable Top-k with Optimal Transport.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Escaping the Gravitational Pull of Softmax.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Provably Efficient Neural Estimation of Structural Equation Models: An Adversarial Approach.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

CoinDICE: Off-Policy Confidence Interval Estimation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Energy-Based Processes for Exchangeable Data.
Proceedings of the 37th International Conference on Machine Learning, 2020

Batch Stationary Distribution Estimation.
Proceedings of the 37th International Conference on Machine Learning, 2020

Scalable Deep Generative Modeling for Sparse Graphs.
Proceedings of the 37th International Conference on Machine Learning, 2020

GenDICE: Generalized Offline Estimation of Stationary Values.
Proceedings of the 8th International Conference on Learning Representations, 2020

Learning to Plan in High Dimensions via Neural Exploration-Exploitation Trees.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
AlgaeDICE: Policy Gradient from Arbitrary Experience.
CoRR, 2019

Overcoming Catastrophic Forgetting by Generative Regularization.
CoRR, 2019

Learning to Plan via Neural Exploration-Exploitation Trees.
CoRR, 2019

Meta Architecture Search.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Energy-Inspired Models: Learning with Sampler-Induced Distributions.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Exponential Family Estimation via Adversarial Dynamics Embedding.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Retrosynthesis Prediction with Conditional Graph Logic Network.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Revisiting Auxiliary Latent Variables in Generative Models.
Proceedings of the Deep Generative Models for Highly Structured Data, 2019

Learning to Defense by Learning to Attack.
Proceedings of the Deep Generative Models for Highly Structured Data, 2019

Kernel Exponential Family Estimation via Doubly Dual Embedding.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Learning over functions, distributions and dynamics via stochastic optimization.
PhD thesis, 2018

Bayesian Meta-network Architecture Learning.
CoRR, 2018

Learning to Defense by Learning to Attack.
CoRR, 2018

Learning Deep Hidden Nonlinear Dynamics from Aggregate Data.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

Predictive Approximate Bayesian Computation via Saddle Points.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Cooperative neural networks (CoNN): Exploiting prior independence structure for improved classification.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learning towards Minimum Hyperspherical Energy.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Coupled Variational Bayes via Optimization Embedding.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Structured Inference for Recurrent Hidden Semi-markov Model.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Towards Black-box Iterative Machine Teaching.
Proceedings of the 35th International Conference on Machine Learning, 2018

SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation.
Proceedings of the 35th International Conference on Machine Learning, 2018

Learning Steady-States of Iterative Algorithms over Graphs.
Proceedings of the 35th International Conference on Machine Learning, 2018

Syntax-Directed Variational Autoencoder for Structured Data.
Proceedings of the 6th International Conference on Learning Representations, 2018

Boosting the Actor with Dual Critic.
Proceedings of the 6th International Conference on Learning Representations, 2018

Decoupled Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multi-scale Nystrom Method.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017
Smoothed Dual Embedding Control.
CoRR, 2017

Deep Hyperspherical Learning.
CoRR, 2017

Towards Black-box Iterative Machine Teaching.
CoRR, 2017

Iterative Machine Teaching.
CoRR, 2017

Deep Hyperspherical Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Learning from semantically dependent multi-tasks.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Iterative Machine Teaching.
Proceedings of the 34th International Conference on Machine Learning, 2017

Stochastic Generative Hashing.
Proceedings of the 34th International Conference on Machine Learning, 2017

Recurrent Hidden Semi-Markov Model.
Proceedings of the 5th International Conference on Learning Representations, 2017

Learning from Conditional Distributions via Dual Embeddings.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016
A Context-Aware Framework for Reducing Bandwidth Usage of Mobile Video Chats.
IEEE Trans. Multim., 2016

Learning from Conditional Distributions via Dual Kernel Embeddings.
CoRR, 2016

Discriminative Embeddings of Latent Variable Models for Structured Data.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Provable Bayesian Inference via Particle Mirror Descent.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

2015
Scalable Bayesian Inference via Particle Mirror Descent.
CoRR, 2015

2014
Information-Theoretic Semi-Supervised Metric Learning via Entropy Regularization.
Neural Comput., 2014

Scalable Kernel Methods via Doubly Stochastic Gradients.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Nonparametric Estimation of Multi-View Latent Variable Models.
Proceedings of the 31th International Conference on Machine Learning, 2014

Transductive Learning with Multi-class Volume Approximation.
Proceedings of the 31th International Conference on Machine Learning, 2014

2013
Maximum volume clustering: a new discriminative clustering approach.
J. Mach. Learn. Res., 2013

Robust Low Rank Kernel Embeddings of Multivariate Distributions.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Squared-loss Mutual Information Regularization: A Novel Information-theoretic Approach to Semi-supervised Learning.
Proceedings of the 30th International Conference on Machine Learning, 2013

2011
Maximum Volume Clustering.
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011

2010
Minimum Conditional Entropy Clustering: A Discriminative Framework for Clustering.
Proceedings of the 2nd Asian Conference on Machine Learning, 2010

Rough Margin Based Core Vector Machine.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2010

Compact Margin Machine.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2010

A Multiple Instance Approach for Keyword-Based Retrieval in Un-annotated Image Database.
Proceedings of the Advances in Multimedia Modeling, 2010

Neural-network based regression model with prior from ranking information.
Proceedings of the International Joint Conference on Neural Networks, 2010

Bayesian Maximum Margin Clustering.
Proceedings of the ICDM 2010, 2010


  Loading...