George Tucker

According to our database1, George Tucker authored at least 54 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Training Language Models to Self-Correct via Reinforcement Learning.
CoRR, 2024

Gemma: Open Models Based on Gemini Research and Technology.
CoRR, 2024

Guided Evolution with Binary Discriminators for ML Program Search.
CoRR, 2024

2023
Gemini: A Family of Highly Capable Multimodal Models.
CoRR, 2023

Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios.
IROS, 2023

Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Oracle Inequalities for Model Selection in Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Model Selection in Batch Policy Optimization.
Proceedings of the International Conference on Machine Learning, 2022

DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Offline Policy Selection under Uncertainty.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization.
CoRR, 2021

Coupled Gradient Estimators for Discrete Latent Variables.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization.
Proceedings of the 9th International Conference on Learning Representations, 2021

Benchmarks for Deep Off-Policy Evaluation.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
RL Unplugged: Benchmarks for Offline Reinforcement Learning.
CoRR, 2020

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems.
CoRR, 2020

D4RL: Datasets for Deep Data-Driven Reinforcement Learning.
CoRR, 2020

Conservative Q-Learning for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

DisARM: An Antithetic Gradient Estimator for Binary Latent Variables.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Meta-Learning without Memorization.
Proceedings of the 8th International Conference on Learning Representations, 2020

Model Based Reinforcement Learning for Atari.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Behavior Regularized Offline Reinforcement Learning.
CoRR, 2019

Reinforcement Learning Driven Heuristic Optimization.
CoRR, 2019

Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction.
CoRR, 2019

Model-Based Reinforcement Learning for Atari.
CoRR, 2019

Learning to Walk Via Deep Reinforcement Learning.
Proceedings of the Robotics: Science and Systems XV, 2019

Don't Blame the ELBO! A Linear VAE Perspective on Posterior Collapse.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Energy-Inspired Models: Learning with Sampler-Induced Distributions.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On Variational Bounds of Mutual Information.
Proceedings of the 36th International Conference on Machine Learning, 2019

Guided evolutionary strategies: augmenting random search with surrogate gradients.
Proceedings of the 36th International Conference on Machine Learning, 2019

The Laplacian in RL: Learning Representations with Efficient Approximations.
Proceedings of the 7th International Conference on Learning Representations, 2019

Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives.
Proceedings of the 7th International Conference on Learning Representations, 2019

Understanding Posterior Collapse in Generative Latent Variable Models.
Proceedings of the Deep Generative Models for Highly Structured Data, 2019

Revisiting Auxiliary Latent Variables in Generative Models.
Proceedings of the Deep Generative Models for Highly Structured Data, 2019

2018
Soft Actor-Critic Algorithms and Applications.
CoRR, 2018

Guided evolutionary strategies: escaping the curse of dimensionality in random search.
CoRR, 2018

Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

The Mirage of Action-Dependent Baselines in Reinforcement Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

Smoothed Action Value Functions for Learning Gaussian Policies.
Proceedings of the 35th International Conference on Machine Learning, 2018

Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling.
Proceedings of the 6th International Conference on Learning Representations, 2018

Learning Hard Alignments with Variational Inference.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
An online sequence-to-sequence model for noisy speech recognition.
CoRR, 2017

REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Filtering Variational Objectives.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models.
Proceedings of the 5th International Conference on Learning Representations, 2017

Regularizing Neural Networks by Penalizing Confident Output Distributions.
Proceedings of the 5th International Conference on Learning Representations, 2017

Particle Value Functions.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
Compacting Neural Network Classifiers via Dropout Training.
CoRR, 2016

Max-pooling loss training of long short-term memory networks for small-footprint keyword spotting.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Model Compression Applied to Small-Footprint Keyword Spotting.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2014
Network topology and parameter estimation: from experimental design methods to gene regulatory network kinetics using a community based approach.
BMC Syst. Biol., 2014

2013
A sampling framework for incorporating quantitative mass spectrometry data in protein interaction analysis.
BMC Bioinform., 2013


  Loading...