Fei Xia

Orcid: 0000-0003-4343-1444

Affiliations:
  • Google DeepMind, Mountain View, CA, USA
  • Stanford University, CA, USA (former)


According to our database1, Fei Xia authored at least 67 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation.
CoRR, 2024

Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs.
CoRR, 2024

VADER: Visual Affordance Detection and Error Recovery for Multi Robot Human Collaboration.
CoRR, 2024

GenCHiP: Generating Robot Policy Code for High-Precision and Contact-Rich Manipulation Tasks.
CoRR, 2024

CoNVOI: Context-aware Navigation using Vision Language Models in Outdoor and Indoor Environments.
CoRR, 2024

BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1, 000 Everyday Activities and Realistic Simulation.
CoRR, 2024

Learning to Learn Faster from Human Feedback with Language Model Predictive Control.
CoRR, 2024

AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents.
CoRR, 2024

SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities.
CoRR, 2024

Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024


Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Physically Grounded Vision-Language Models for Robotic Manipulation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Chain of Code: Reasoning with a Language Model-Augmented Code Emulator.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Video Language Planning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Generative Expressive Robot Behaviors using Large Language Models.
Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024

SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Open X-Embodiment: Robotic Learning Datasets and RT-X Models.
CoRR, 2023

Principles and Guidelines for Evaluating Social Robot Navigation Algorithms.
CoRR, 2023

Open-World Object Manipulation using Pre-trained Vision-Language Models.
CoRR, 2023

Grounded Decoding: Guiding Text Generation with Grounded Models for Robot Control.
CoRR, 2023

Demonstrating Large Language Models on Robots.
Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Scaling Robot Learning with Semantically Imagined Experience.
Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023


Grounded Decoding: Guiding Text Generation with Grounded Models for Embodied Agents.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Code as Policies: Language Model Programs for Embodied Control.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Robotic Table Wiping via Reinforcement Learning and Whole-body Trajectory Optimization.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Sonicverse: A Multisensory Simulation Platform for Embodied Household Agents that See and Hear.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Open-vocabulary Queryable Scene Representations for Real World Planning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023



Open-World Object Manipulation using Pre-Trained Vision-Language Models.
Proceedings of the Conference on Robot Learning, 2023

Navigation with Large Language Models: Semantic Guesswork as a Heuristic for Planning.
Proceedings of the Conference on Robot Learning, 2023

Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners.
Proceedings of the Conference on Robot Learning, 2023

Large Language Models as General Pattern Machines.
Proceedings of the Conference on Robot Learning, 2023

FindThis: Language-Driven Object Disambiguation in Indoor Environments.
Proceedings of the Conference on Robot Learning, 2023

Gesture-Informed Robot Assistance via Foundation Models.
Proceedings of the Conference on Robot Learning, 2023



2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances.
CoRR, 2022

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Model Predictive Controllers with Real-Time Attention for Real-World Navigation.
Proceedings of the Conference on Robot Learning, 2022


Inner Monologue: Embodied Reasoning through Planning with Language Models.
Proceedings of the Conference on Robot Learning, 2022


2021
PoseRBPF: A Rao-Blackwellized Particle Filter for 6-D Object Pose Tracking.
IEEE Trans. Robotics, 2021

iGibson 1.0: A Simulation Environment for Interactive Tasks in Large Realistic Scenes.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Probabilistic Visual Navigation with Bidirectional Image Prediction.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

ReLMoGen: Integrating Motion Generation in Reinforcement Learning for Mobile Manipulation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

BEHAVIOR: Benchmark for Everyday Household Activities in Virtual, Interactive, and Ecological Environments.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

iGibson 2.0: Object-Centric Simulation for Robot Learning of Everyday Household Tasks.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

2020
Interactive Gibson Benchmark: A Benchmark for Interactive Navigation in Cluttered Environments.
IEEE Robotics Autom. Lett., 2020

iGibson, a Simulation Environment for Interactive Tasks in Large Realistic Scenes.
CoRR, 2020

ReLMoGen: Leveraging Motion Generation in Reinforcement Learning for Mobile Manipulation.
CoRR, 2020

Probabilistic Visual Navigation with Bidirectional Image Prediction.
CoRR, 2020

ReferIt3D: Neural Listeners for Fine-Grained 3D Object Identification in Real-World Scenes.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Deep Visual MPC-Policy Learning for Navigation.
IEEE Robotics Autom. Lett., 2019

VUNet: Dynamic Scene View Synthesis for Traversability Estimation Using an RGB Camera.
IEEE Robotics Autom. Lett., 2019

Interactive Gibson: A Benchmark for Interactive Navigation in Cluttered Environments.
CoRR, 2019

A Behavioral Approach to Visual Navigation with Graph Localization Networks.
Proceedings of the Robotics: Science and Systems XV, 2019

Composite Shape Modeling via Latent Space Factorization.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

HRL4IN: Hierarchical Reinforcement Learning for Interactive Navigation with Mobile Manipulators.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

2018
GONet++: Traversability Estimation via Dynamic Scene View Synthesis.
CoRR, 2018

Chromatin accessibility prediction via a hybrid deep convolutional neural network.
Bioinform., 2018

Gibson Env: Real-World Perception for Embodied Agents.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2016
Partial DNA assembly: A rate-distortion perspective.
Proceedings of the IEEE International Symposium on Information Theory, 2016


  Loading...