Zheng Wen

Affiliations:
  • DeepMind, USA
  • Stanford University, USA (PhD 2013)


According to our database1, Zheng Wen authored at least 60 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Online Bandit Learning with Offline Preference Data.
CoRR, 2024

2023
Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale.
Trans. Mach. Learn. Res., 2023

Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping.
Trans. Mach. Learn. Res., 2023

Reinforcement Learning, Bit by Bit.
Found. Trends Mach. Learn., 2023

RLHF and IIA: Perverse Incentives.
CoRR, 2023

Efficient Online Learning with Offline Datasets for Infinite Horizon MDPs: A Bayesian Approach.
CoRR, 2023

Approximate Thompson Sampling via Epistemic Neural Networks.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Epistemic Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Leveraging Demonstrations to Improve Online Learning: Quality Matters.
Proceedings of the International Conference on Machine Learning, 2023

2022
Robustness of Epinets against Distributional Shifts.
CoRR, 2022

Evaluating high-order predictive distributions in deep learning.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

An Analysis of Ensemble Sampling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The Neural Testbed: Evaluating Joint Predictions.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Evaluating Predictive Distributions: Does Bayesian Deep Learning Work?
CoRR, 2021

Evaluating Probabilistic Inference in Deep Learning: Beyond Marginal Predictions.
CoRR, 2021

Epistemic Neural Networks.
CoRR, 2021

2020
Low-rank Tensor Bandits.
CoRR, 2020

On Efficiency in Hierarchical Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Hypermodels for Exploration.
Proceedings of the 8th International Conference on Learning Representations, 2020

Stochastic Online Learning with Probabilistic Graph Feedback.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Deep Exploration via Randomized Value Functions.
J. Mach. Learn. Res., 2019

Waterfall Bandits: Learning to Sell Ads Online.
CoRR, 2019

Bootstrapping Upper Confidence Bound.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
A Tutorial on Thompson Sampling.
Found. Trends Mach. Learn., 2018

Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits.
CoRR, 2018

New Insights into Bootstrapping for Bandits.
CoRR, 2018

Scalar Posterior Sampling with Applications.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Offline Evaluation of Ranking Policies with Click Models.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

2017
Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization.
Math. Oper. Res., 2017

Stochastic Low-Rank Bandits.
CoRR, 2017

Posterior Sampling for Large Scale Reinforcement Learning.
CoRR, 2017

Diffusion Independent Semi-Bandit Influence Maximization.
CoRR, 2017

Does Weather Matter?: Causal Analysis of TV Logs.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Get to the Bottom: Causal Analysis for User Modeling.
Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization, 2017

An Interactive Points of Interest Guidance System.
Proceedings of the Companion Publication of the 22nd International Conference on Intelligent User Interfaces, 2017

Bernoulli Rank-1 Bandits for Click Feedback.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Online Learning to Rank in Stochastic Click Models.
Proceedings of the 34th International Conference on Machine Learning, 2017

Model-Independent Online Learning for Influence Maximization.
Proceedings of the 34th International Conference on Machine Learning, 2017

Stochastic Rank-1 Bandits.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016
Generalization and Exploration via Randomized Value Functions.
Proceedings of the 33nd International Conference on Machine Learning, 2016

DCM Bandits: Learning to Rank with Multiple Clicks.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2015
A new incompressibility discretization for a hybrid particle MAC grid representation with surface tension.
J. Comput. Phys., 2015

Cascading Bandits.
CoRR, 2015

Combinatorial Cascading Bandits.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Optimal Greedy Diversity for Recommendation.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Cascading Bandits: Learning to Rank in the Cascade Model.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

2014
Generalization and Exploration via Randomized Value Functions.
CoRR, 2014

DUM: Diversity-Weighted Utility Maximization for Recommendations.
CoRR, 2014

Diversified Utility Maximization for Recommendations.
Proceedings of the Poster Proceedings of the 8th ACM Conference on Recommender Systems, 2014

Large-Scale Optimistic Adaptive Submodularity.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
A new incompressibility discretization for a hybrid particle MAC grid representation with surface tension.
PhD thesis, 2013

Efficient Exploration and Value Function Generalization in Deterministic Systems.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Adaptive Submodular Maximization in Bandit Setting.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

2012
Semi-implicit surface tension formulation with a Lagrangian surface mesh on an Eulerian simulation grid.
J. Comput. Phys., 2012

2011
Asynchronous Evolution for Fully-Implicit and Semi-Implicit Time Integration.
Comput. Graph. Forum, 2011

2010
A novel algorithm for incompressible flow using only a coarse grid projection.
ACM Trans. Graph., 2010

On the Dynamic Response of a Saturating Static Feedback-Controlled Single Integrator Driven by White Noise.
IEEE Trans. Autom. Control., 2010

2008
On the disturbance response and external stability of a saturating static-feedback-controlled double integrator.
Autom., 2008


  Loading...