Yuke Zhu

According to our database1, Yuke Zhu authored at least 134 papers between 2012 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Interpretable machine learning model for predicting acute kidney injury in critically ill patients.
BMC Medical Informatics Decis. Mak., December, 2024

An AI-Aided Algorithm for Multivariate Polynomial Reconstruction on Cartesian Grids and the PLG Finite Difference Method.
J. Sci. Comput., December, 2024

PRIME: Scaffolding Manipulation Tasks With Behavior Primitives for Data-Efficient Imitation Learning.
IEEE Robotics Autom. Lett., October, 2024

Voyager: An Open-Ended Embodied Agent with Large Language Models.
Trans. Mach. Learn. Res., 2024

Granger Causal Interaction Skill Chains.
Trans. Mach. Learn. Res., 2024

Action-conditional implicit visual dynamics for deformable object manipulation.
Int. J. Robotics Res., 2024

SPOT: SE(3) Pose Trajectory Diffusion for Object-Centric Manipulation.
CoRR, 2024

DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning.
CoRR, 2024

Multi-Task Interactive Robot Fleet Learning with Visual World Models.
CoRR, 2024

One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation.
CoRR, 2024

HOVER: Versatile Neural Whole-Body Controller for Humanoid Robots.
CoRR, 2024

Harmon: Whole-Body Motion Generation of Humanoid Robots from Language Descriptions.
CoRR, 2024

OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation.
CoRR, 2024

BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile Manipulation.
CoRR, 2024

KinScene: Model-Based Mobile Manipulation of Articulated Scenes.
CoRR, 2024

PRESTO: Fast motion planning using diffusion models based on key-configuration environment representation.
CoRR, 2024

LongVILA: Scaling Long-Context Visual Language Models for Long Videos.
CoRR, 2024

ARDuP: Active Region Video Diffusion for Universal Policies.
CoRR, 2024

RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots.
CoRR, 2024

DrEureka: Language Model Guided Sim-To-Real Transfer.
CoRR, 2024

Vision-based Manipulation from Single Human Video with Open-World Object Graphs.
CoRR, 2024

InterPreT: Interactive Predicate Learning from Language Feedback for Generalizable Task Planning.
CoRR, 2024

LOTUS: Continual Imitation Learning for Robot Manipulation Through Unsupervised Skill Discovery.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Model-Based Runtime Monitoring with Interactive Imitation Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Doduo: Learning Dense Visual Correspondence from Unsupervised Semantic-Aware Flow.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Eureka: Human-Level Reward Design via Coding Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Few-View Object Reconstruction with Unknown Categories and Camera Poses.
Proceedings of the International Conference on 3D Vision, 2024

2023
Foundation Models in Robotics: Applications, Challenges, and the Future.
CoRR, 2023

Edge Wasserstein Distance Loss for Oriented Object Detection.
CoRR, 2023

RotaTR: Detection Transformer for Dense and Rotated Object.
CoRR, 2023

Interactive Robot Learning from Verbal Correction.
CoRR, 2023

Granger-Causal Hierarchical Skill Discovery.
CoRR, 2023

Robot Learning on the Job: Human-in-the-Loop Autonomy and Learning During Deployment.
Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Cross-Episodic Curriculum for Transformer Agents.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Symbolic State Space Optimization for Long Horizon Mobile Manipulation Planning.
IROS, 2023

Learning to Walk by Steering: Perceptive Quadrupedal Locomotion in Dynamic Environments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Ditto in the House: Building Articulation Models of Indoor Scenes through Interactive Perception.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

VIMA: Robot Manipulation with Multimodal Prompts.
Proceedings of the International Conference on Machine Learning, 2023

Deep Imitation Learning for Humanoid Loco-manipulation Through Human Teleoperation.
Proceedings of the 22nd IEEE-RAS International Conference on Humanoid Robots, 2023

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Fast Monocular Scene Reconstruction with Global-Sparse Local-Dense Grids.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Generalizable Manipulation Policies with Object-Centric 3D Representations.
Proceedings of the Conference on Robot Learning, 2023

MimicPlay: Long-Horizon Imitation Learning by Watching Human Play.
Proceedings of the Conference on Robot Learning, 2023

MUTEX: Learning Unified Policies from Multimodal Task Specifications.
Proceedings of the Conference on Robot Learning, 2023

MimicGen: A Data Generation System for Scalable Robot Learning using Human Demonstrations.
Proceedings of the Conference on Robot Learning, 2023

Building Compositional Robot Autonomy with Modularity and Abstraction.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Bottom-Up Skill Discovery From Unsegmented Demonstrations for Long-Horizon Robot Manipulation.
IEEE Robotics Autom. Lett., 2022

VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors.
CoRR, 2022

VIMA: General Robot Manipulation with Multimodal Prompts.
CoRR, 2022

ACID: Action-Conditional Implicit Visual Dynamics for Deformable Object Manipulation.
Proceedings of the Robotics: Science and Systems XVIII, New York City, NY, USA, June 27, 2022

Pre-Trained Language Models for Interactive Decision-Making.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Visually Grounded Task and Motion Planning for Mobile Manipulation.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

OSCAR: Data-Driven Operational Space Control for Adaptive and Robust Robot Manipulation.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation Tasks.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Causal Dynamics Learning for Task-Independent State Abstraction.
Proceedings of the International Conference on Machine Learning, 2022

RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Coopernaut: End-to-End Driving with Cooperative Perception for Networked Vehicles.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Ditto: Building Digital Twins of Articulated Objects from Interaction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

VIOLA: Object-Centric Imitation Learning for Vision-Based Robot Manipulation.
Proceedings of the Conference on Robot Learning, 2022

Learning and Retrieval from Prior Data for Skill-based Imitation Learning.
Proceedings of the Conference on Robot Learning, 2022

1st Place Solution for FungiCLEF 2022 Competition: Fine-grained Open-set Fungi Recognition.
Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

2021
Reinforcement Learning in Factored Action Spaces using Tensor Decompositions.
CoRR, 2021

Discovering Generalizable Skills via Automated Generation of Diverse Tasks.
Proceedings of the Robotics: Science and Systems XVII, Virtual Event, July 12-16, 2021., 2021

Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations.
Proceedings of the Robotics: Science and Systems XVII, Virtual Event, July 12-16, 2021., 2021

MultiBench: Multiscale Benchmarks for Multimodal Representation Learning.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Hierarchical Planning for Long-Horizon Manipulation with Geometric and Symbolic Scene Graphs.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Deep Affordance Foresight: Planning Through What Can Be Done in the Future.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Learning Multi-Arm Manipulation Through Collaborative Teleoperation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Fast Uncertainty Quantification for Deep Object Pose Estimation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Emergent Hand Morphology and Control from Optimizing Robust Grasps of Diverse Objects.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Detect, Reject, Correct: Crossmodal Compensation of Corrupted Sensors.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition.
Proceedings of the 38th International Conference on Machine Learning, 2021

SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies.
Proceedings of the 38th International Conference on Machine Learning, 2021

Adaptive Procedural Task Generation for Hard-Exploration Problems.
Proceedings of the 9th International Conference on Learning Representations, 2021

DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Dynamic Metric Learning: Towards a Scalable Metric Space To Accommodate Multiple Semantic Scales.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

What Matters in Learning from Offline Human Demonstrations for Robot Manipulation.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

2020
Making Sense of Vision and Touch: Learning Multimodal Representations for Contact-Rich Tasks.
IEEE Trans. Robotics, 2020

Learning task-oriented grasping for tool manipulation from simulated self-supervision.
Int. J. Robotics Res., 2020

Human-in-the-Loop Imitation Learning using Remote Teleoperation.
CoRR, 2020

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning.
CoRR, 2020

OCEAN: Online Task Inference for Compositional Tasks with Context Adaptation.
Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, 2020

Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

6-PACK: Category-level 6D Pose Tracker with Anchor-Based Keypoints.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

KETO: Learning Keypoint Representations for Tool Manipulation.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Spherical Feature Transform for Deep Metric Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning a Contact-Adaptive Controller for Robust, Efficient Legged Locomotion.
Proceedings of the 4th Conference on Robot Learning, 2020

2019
Closing the perception-action loop: towards general-purpose robot autonomy.
PhD thesis, 2019

Causal Induction from Visual Observations for Goal Directed Tasks.
CoRR, 2019

Dual Sequential Monte Carlo: Tunneling Filtering and Planning in Continuous POMDPs.
CoRR, 2019

SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning.
CoRR, 2019

Regression Planning Networks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Scaling Robot Supervision to Hundreds of Hours with RoboTurk: Robotic Manipulation Dataset through Human Reasoning and Dexterity.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Continuous Relaxation of Symbolic Planner for One-Shot Imitation Learning.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks.
Proceedings of the International Conference on Robotics and Automation, 2019

Situational Fusion of Visual Representation for Visual Navigation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Dynamics Learning with Cascaded Variational Inference for Multi-Step Manipulation.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

2018
Reinforcement and Imitation Learning for Diverse Visuomotor Skills.
Proceedings of the Robotics: Science and Systems XIV, 2018

Neural Task Programming: Learning to Generalize Across Hierarchical Tasks.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Point Pair Features Based Object Recognition with Improved Training Pipeline.
Proceedings of the Intelligent Robotics and Applications - 11th International Conference, 2018

Digital Template System for Measuring Turbine Blade Forging and Its Calibration Method.
Proceedings of the Intelligent Robotics and Applications - 11th International Conference, 2018

ROBOTURK: A Crowdsourcing Platform for Robotic Skill Learning through Imitation.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

SURREAL: Open-Source Reinforcement Learning Framework and Robot Manipulation Benchmark.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

2017
Vicus: Exploiting local structures to improve network-based analysis of biological data.
PLoS Comput. Biol., 2017

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations.
Int. J. Comput. Vis., 2017

AI2-THOR: An Interactive 3D Environment for Visual AI.
CoRR, 2017

AdaPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems.
Proceedings of the Robotics Research, The 18th International Symposium, 2017

Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Target-driven visual navigation in indoor scenes using deep reinforcement learning.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Visual Semantic Planning Using Deep Successor Representations.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Knowledge Acquisition for Visual Question Answering via Iterative Querying.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Scene Graph Generation by Iterative Message Passing.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Visual7W: Grounded Question Answering in Images.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Building a Large-scale Multimodal Knowledge Base for Visual Question Answering.
CoRR, 2015

Action Recognition by Hierarchical Mid-Level Action Elements.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014
Modelling relational statistics with Bayes Nets.
Mach. Learn., 2014

Reasoning about Object Affordances in a Knowledge Base Representation.
Proceedings of the Computer Vision - ECCV 2014, 2014

StrokeBank: Automating Personalized Chinese Handwriting Generation.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Latent Spatio-temporal Models for Action Localization and Recognition in Nursing Home Surveillance Video.
Proceedings of the 13. IAPR International Conference on Machine Vision Applications, 2013

Graphical Model-Based Learning in High Dimensional Feature Spaces.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Random Regression for Bayes Nets Applied to Relational Data.
Proceedings of the 2nd International Workshop on Statistical Relational AI (StaRAI-12), 2012


  Loading...