Shixiang Gu

According to our database1, Shixiang Gu authored at least 69 papers between 2012 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Optimal Mapping of Soil Erodibility in a Plateau Lake Watershed: Empirical Models Empowered by Machine Learning.
Remote. Sens., August, 2024

Construction of a High-Resolution Waterlogging Disaster Monitoring Framework Based on the APSIM Model: A Case Study of Jingzhou and Bengbu.
Remote. Sens., July, 2024

Scaling Instruction-Finetuned Language Models.
J. Mach. Learn. Res., 2024

Geometric-Averaged Preference Optimization for Soft Preference Labels.
CoRR, 2024

Multimodal Web Navigation with Instruction-Finetuned Foundation Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Collective Intelligence for 2D Push Manipulations With Mobile Robots.
IEEE Robotics Autom. Lett., May, 2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

DreamSparse: Escaping from Plato's Cave with 2D Frozen Diffusion Model Given Sparse Views.
CoRR, 2023

Multimodal Web Navigation with Instruction-Finetuned Foundation Models.
CoRR, 2023

Learning a Universal Human Prior for Dexterous Manipulation from Human Preference.
CoRR, 2023

Bi-Manual Block Assembly via Sim-to-Real Reinforcement Learning.
CoRR, 2023

Aligning Text-to-Image Models using Human Feedback.
CoRR, 2023

DreamSparse: Escaping from Plato's Cave with 2D Diffusion Model Given Sparse Views.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

For SALE: State-Action Representation Learning for Deep Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Mind's Eye: Grounded Language Model Reasoning through Simulation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Large Language Models Can Self-Improve.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Collective Intelligence for Object Manipulation with Mobile Robots.
CoRR, 2022

Scaling Instruction-Finetuned Language Models.
CoRR, 2022

Deep Billboards towards Lossless Real2Sim in Virtual Reality.
CoRR, 2022

Can Wikipedia Help Offline Reinforcement Learning?
CoRR, 2022

World robot challenge 2020 - partner robot: a data-driven approach for room tidying with mobile manipulator.
Adv. Robotics, 2022

Large Language Models are Zero-Shot Reasoners.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error.
Proceedings of the International Conference on Machine Learning, 2022

Generalized Decision Transformer for Offline Hindsight Information Matching.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Tool as Embodiment for Recursive Manipulation.
CoRR, 2021

VaxNeRF: Revisiting the Classic for Voxel-Accelerated Neural Radiance Field.
CoRR, 2021

Amortized Prompt: Lightweight Fine-Tuning for CLIP in Domain Generalization.
CoRR, 2021

Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization.
CoRR, 2021

Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning.
CoRR, 2021

Identifying Co-Adaptation of Algorithmic and Implementational Innovations in Deep Reinforcement Learning: A Taxonomy and Case Study of Inference-based Algorithms.
CoRR, 2021

Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

A Minimalist Approach to Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL.
Proceedings of the 38th International Conference on Machine Learning, 2021

Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning.
Proceedings of the Robotics: Science and Systems XVI, 2020

Weakly-Supervised Reinforcement Learning for Controllable Behavior.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Dynamics-Aware Unsupervised Discovery of Skills.
Proceedings of the 8th International Conference on Learning Representations, 2020

Human-centric dialog training via offline reinforcement learning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
Sample-efficient deep reinforcement learning for continuous control.
PhD thesis, 2019

Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?
CoRR, 2019

Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog.
CoRR, 2019

Language as an Abstraction for Hierarchical Deep Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives.
Proceedings of the 7th International Conference on Learning Representations, 2019

Near-Optimal Representation Learning for Hierarchical Reinforcement Learning.
Proceedings of the 7th International Conference on Learning Representations, 2019

Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

A Divergence Minimization Perspective on Imitation Learning Methods.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

2018
Data-Efficient Hierarchical Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

The Mirage of Action-Dependent Baselines in Reinforcement Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

Temporal Difference Models: Model-Free Deep RL for Model-Based Control.
Proceedings of the 6th International Conference on Learning Representations, 2018

Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control.
Proceedings of the 34th International Conference on Machine Learning, 2017

Tuning Recurrent Neural Networks with Reinforcement Learning.
Proceedings of the 5th International Conference on Learning Representations, 2017

Categorical Reparameterization with Gumbel-Softmax.
Proceedings of the 5th International Conference on Learning Representations, 2017

Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
MuProp: Unbiased Backpropagation for Stochastic Neural Networks.
Proceedings of the 4th International Conference on Learning Representations, 2016

Deep Reinforcement Learning for Robotic Manipulation.
CoRR, 2016

Continuous Deep Q-Learning with Model-based Acceleration.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2015
Towards Deep Neural Network Architectures Robust to Adversarial Examples.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Particle Gibbs for Infinite Hidden Markov Models.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Neural Adaptive Sequential Monte Carlo.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

2012
Realtime HDR (High Dynamic Range) video for eyetap wearable computers, FPGA-based seeing aids, and glasseyes (EyeTaps).
Proceedings of the 25th IEEE Canadian Conference on Electrical and Computer Engineering, 2012


  Loading...