Yuke Zhu

Zhixuan Li

J. Sci. Comput., December, 2024

PRIME: Scaffolding Manipulation Tasks With Behavior Primitives for Data-Efficient Imitation Learning.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., October, 2024

Voyager: An Open-Ended Embodied Agent with Large Language Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Granger Causal Interaction Skill Chains.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Action-conditional implicit visual dynamics for deformable object manipulation.

[BibT_eX]

[DOI]

Bokui Shen

Christopher Bongsoo Choy

Int. J. Robotics Res., 2024

SPOT: SE(3) Pose Trajectory Diffusion for Object-Centric Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Multi-Task Interactive Robot Fleet Learning with Visual World Models.

[BibT_eX]

[DOI]

CoRR, 2024

One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation.

[BibT_eX]

[DOI]

CoRR, 2024

HOVER: Versatile Neural Whole-Body Controller for Humanoid Robots.

[BibT_eX]

[DOI]

CoRR, 2024

Harmon: Whole-Body Motion Generation of Humanoid Robots from Language Descriptions.

[BibT_eX]

[DOI]

CoRR, 2024

OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation.

[BibT_eX]

[DOI]

CoRR, 2024

BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

KinScene: Model-Based Mobile Manipulation of Articulated Scenes.

[BibT_eX]

[DOI]

Nur Muhammad (Mahi) Shafiullah

Joydeep Biswas

CoRR, 2024

PRESTO: Fast motion planning using diffusion models based on key-configuration environment representation.

[BibT_eX]

[DOI]

CoRR, 2024

LongVILA: Scaling Long-Context Visual Language Models for Long Videos.

[BibT_eX]

[DOI]

CoRR, 2024

ARDuP: Active Region Video Diffusion for Universal Policies.

[BibT_eX]

[DOI]

CoRR, 2024

RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots.

[BibT_eX]

[DOI]

CoRR, 2024

DrEureka: Language Model Guided Sim-To-Real Transfer.

[BibT_eX]

[DOI]

CoRR, 2024

Vision-based Manipulation from Single Human Video with Open-World Object Graphs.

[BibT_eX]

[DOI]

CoRR, 2024

InterPreT: Interactive Predicate Learning from Language Feedback for Generalizable Task Planning.

[BibT_eX]

[DOI]

CoRR, 2024

LOTUS: Continual Imitation Learning for Robot Manipulation Through Unsupervised Skill Discovery.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration.

[BibT_eX]

[DOI]

Henrik I. Christensen

Keerthana Gopalakrishnan

Lawrence Yunliang Chen

Subramanian Ramamoorthy

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Model-Based Runtime Monitoring with Interactive Imitation Learning.

[BibT_eX]

[DOI]

Huihan Liu

Shivin Dass

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Doduo: Learning Dense Visual Correspondence from Unsupervised Semantic-Aware Flow.

[BibT_eX]

[DOI]

Hanwen Jiang

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Eureka: Human-Level Reward Design via Coding Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents.

[BibT_eX]

[DOI]

Jake Grigsby

Linxi Fan

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Few-View Object Reconstruction with Unknown Categories and Camera Poses.

[BibT_eX]

[DOI]

Proceedings of the International Conference on 3D Vision, 2024

2023

Foundation Models in Robotics: Applications, Challenges, and the Future.

[BibT_eX]

[DOI]

CoRR, 2023

Edge Wasserstein Distance Loss for Oriented Object Detection.

[BibT_eX]

[DOI]

CoRR, 2023

RotaTR: Detection Transformer for Dense and Rotated Object.

[BibT_eX]

[DOI]

CoRR, 2023

Interactive Robot Learning from Verbal Correction.

[BibT_eX]

[DOI]

CoRR, 2023

Granger-Causal Hierarchical Skill Discovery.

[BibT_eX]

[DOI]

CoRR, 2023

Robot Learning on the Job: Human-in-the-Loop Autonomy and Learning During Deployment.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Cross-Episodic Curriculum for Transformer Agents.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Symbolic State Space Optimization for Long Horizon Mobile Manipulation Planning.

[BibT_eX]

[DOI]

IROS, 2023

Learning to Walk by Steering: Perceptive Quadrupedal Locomotion in Dynamic Environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Ditto in the House: Building Articulation Models of Indoor Scenes through Interactive Perception.

[BibT_eX]

[DOI]

Cheng-Chun Hsu

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

VIMA: Robot Manipulation with Multimodal Prompts.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Deep Imitation Learning for Humanoid Loco-manipulation Through Human Teleoperation.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE-RAS International Conference on Humanoid Robots, 2023

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Fast Monocular Scene Reconstruction with Global-Sparse Local-Dense Grids.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Generalizable Manipulation Policies with Object-Centric 3D Representations.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2023

MimicPlay: Long-Horizon Imitation Learning by Watching Human Play.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2023

MUTEX: Learning Unified Policies from Multimodal Task Specifications.

[BibT_eX]

[DOI]

Rutav Shah

Proceedings of the Conference on Robot Learning, 2023

MimicGen: A Data Generation System for Scalable Robot Learning using Human Demonstrations.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2023

Building Compositional Robot Autonomy with Modularity and Abstraction.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Bottom-Up Skill Discovery From Unsegmented Demonstrations for Long-Horizon Robot Manipulation.

[BibT_eX]

[DOI]

Yifeng Zhu

Peter Stone

IEEE Robotics Autom. Lett., 2022

VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors.

[BibT_eX]

[DOI]

CoRR, 2022

VIMA: General Robot Manipulation with Multimodal Prompts.

[BibT_eX]

[DOI]

CoRR, 2022

ACID: Action-Conditional Implicit Visual Dynamics for Deformable Object Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems XVIII, New York City, NY, USA, June 27, 2022

Pre-Trained Language Models for Interactive Decision-Making.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Visually Grounded Task and Motion Planning for Mobile Manipulation.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

OSCAR: Data-Driven Operational Space Control for Adaptive and Robust Robot Manipulation.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation Tasks.

[BibT_eX]

[DOI]

Soroush Nasiriany

Huihan Liu

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Causal Dynamics Learning for Task-Independent State Abstraction.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Coopernaut: End-to-End Driving with Cooperative Perception for Networked Vehicles.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Ditto: Building Digital Twins of Articulated Objects from Interaction.

[BibT_eX]

[DOI]

Cheng-Chun Hsu

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

VIOLA: Object-Centric Imitation Learning for Vision-Based Robot Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2022

Learning and Retrieval from Prior Data for Skill-based Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2022

1st Place Solution for FungiCLEF 2022 Competition: Fine-grained Open-set Fungi Recognition.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

2021

Reinforcement Learning in Factored Action Spaces using Tensor Decompositions.

[BibT_eX]

[DOI]

CoRR, 2021

Discovering Generalizable Skills via Automated Generation of Diverse Tasks.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems XVII, Virtual Event, July 12-16, 2021., 2021

Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems XVII, Virtual Event, July 12-16, 2021., 2021

MultiBench: Multiscale Benchmarks for Multimodal Representation Learning.

[BibT_eX]

[DOI]

Louis-Philippe Morency

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Hierarchical Planning for Long-Horizon Manipulation with Geometric and Symbolic Scene Graphs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Deep Affordance Foresight: Planning Through What Can Be Done in the Future.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Learning Multi-Arm Manipulation Through Collaborative Teleoperation.

[BibT_eX]

[DOI]

Albert Tung

Josiah Wong

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Fast Uncertainty Quantification for Deep Object Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Emergent Hand Morphology and Control from Optimizing Robust Grasps of Diverse Objects.

[BibT_eX]

[DOI]

Xinlei Pan

Animesh Garg

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Detect, Reject, Correct: Crossmodal Compensation of Corrupted Sensors.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Adaptive Procedural Task Generation for Hard-Exploration Problems.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision.

[BibT_eX]

[DOI]

Shiyi Lan

Zhiding Yu

Christopher B. Choy

Subhashree Radhakrishnan

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Dynamic Metric Learning: Towards a Scalable Metric Space To Accommodate Multiple Semantic Scales.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

What Matters in Learning from Offline Human Demonstrations for Robot Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

2020

Making Sense of Vision and Touch: Learning Multimodal Representations for Contact-Rich Tasks.

[BibT_eX]

[DOI]

IEEE Trans. Robotics, 2020

Learning task-oriented grasping for tool manipulation from simulated self-supervision.

[BibT_eX]

[DOI]

Int. J. Robotics Res., 2020

Human-in-the-Loop Imitation Learning using Remote Teleoperation.

[BibT_eX]

[DOI]

CoRR, 2020

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning.

[BibT_eX]

[DOI]

Josiah Wong

CoRR, 2020

OCEAN: Online Task Inference for Compositional Tasks with Context Adaptation.

[BibT_eX]

[DOI]

Hongyu Ren

Jure Leskovec

Animesh Garg

Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, 2020

Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

6-PACK: Category-level 6D Pose Tracker with Anchor-Based Keypoints.

[BibT_eX]

[DOI]

Chen Wang

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

KETO: Learning Keypoint Representations for Tool Manipulation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Spherical Feature Transform for Deep Metric Learning.

[BibT_eX]

[DOI]

Yan Bai

Yichen Wei

Proceedings of the Computer Vision - ECCV 2020, 2020

RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Learning a Contact-Adaptive Controller for Robust, Efficient Legged Locomotion.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

2019

Closing the perception-action loop: towards general-purpose robot autonomy.

[BibT_eX]

[DOI]

PhD thesis, 2019

Causal Induction from Visual Observations for Goal Directed Tasks.

[BibT_eX]

[DOI]

CoRR, 2019

Dual Sequential Monte Carlo: Tunneling Filtering and Planning in Continuous POMDPs.

[BibT_eX]

[DOI]

CoRR, 2019

SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Regression Planning Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Scaling Robot Supervision to Hundreds of Hours with RoboTurk: Robotic Manipulation Dataset through Human Reasoning and Dexterity.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Continuous Relaxation of Symbolic Planner for One-Shot Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Robotics and Automation, 2019

Situational Fusion of Visual Representation for Visual Navigation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion.

[BibT_eX]

[DOI]

Chen Wang

Roberto Martin Martin

Cewu Lu

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Dynamics Learning with Cascaded Variational Inference for Multi-Step Manipulation.

[BibT_eX]

[DOI]

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

2018

Reinforcement and Imitation Learning for Diverse Visuomotor Skills.

[BibT_eX]

[DOI]

Saran Tunyasuvunakool

Proceedings of the Robotics: Science and Systems XIV, 2018

Neural Task Programming: Learning to Generalize Across Hierarchical Tasks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Point Pair Features Based Object Recognition with Improved Training Pipeline.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Robotics and Applications - 11th International Conference, 2018

Digital Template System for Measuring Turbine Blade Forging and Its Calibration Method.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Robotics and Applications - 11th International Conference, 2018

ROBOTURK: A Crowdsourcing Platform for Robotic Skill Learning through Imitation.

[BibT_eX]

[DOI]

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

SURREAL: Open-Source Reinforcement Learning Framework and Robot Manipulation Benchmark.

[BibT_eX]

[DOI]

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

2017

Vicus: Exploiting local structures to improve network-based analysis of biological data.

[BibT_eX]

[DOI]

PLoS Comput. Biol., 2017

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2017

AI2-THOR: An Interactive 3D Environment for Visual AI.

[BibT_eX]

[DOI]

CoRR, 2017

AdaPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems.

[BibT_eX]

[DOI]

Proceedings of the Robotics Research, The 18th International Symposium, 2017

Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Target-driven visual navigation in indoor scenes using deep reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Visual Semantic Planning Using Deep Successor Representations.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Knowledge Acquisition for Visual Question Answering via Iterative Querying.

[BibT_eX]

[DOI]

Joseph J. Lim

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Scene Graph Generation by Iterative Message Passing.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Visual7W: Grounded Question Answering in Images.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

Building a Large-scale Multimodal Knowledge Base for Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, 2015

Action Recognition by Hierarchical Mid-Level Action Elements.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014

Modelling relational statistics with Bayes Nets.

[BibT_eX]

[DOI]

Oliver Schulte

Hassan Khosravi

Arthur E. Kirkpatrick

Tianxiang Gao

Mach. Learn., 2014

Reasoning about Object Affordances in a Knowledge Base Representation.

[BibT_eX]

[DOI]

Alireza Fathi

Proceedings of the Computer Vision - ECCV 2014, 2014

StrokeBank: Automating Personalized Chinese Handwriting Generation.

[BibT_eX]

[DOI]

Alfred Zong

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013

Latent Spatio-temporal Models for Action Localization and Recognition in Nursing Home Surveillance Video.

[BibT_eX]

[DOI]

Proceedings of the 13. IAPR International Conference on Machine Vision Applications, 2013

Graphical Model-Based Learning in High Dimensional Feature Spaces.

[BibT_eX]

[DOI]

Zhao Song