We stand with Ukraine

We stand with Ukraine

Xiaolong Wang

Orcid: 0000-0003-3150-778X

Affiliations:

UC San Diego, CA, USA
UC Berkeley, CA, USA (former)
Carnegie Mellon University, Robotics Institute, Pittsburgh, PA, USA (former)
Sun Yat-Sen University, Guangzhou, China (former)

According to our database¹, Xiaolong Wang authored at least 153 papers between 2011 and 2024.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

On csauthors.net:

Bibliography

2024

HOVER: Versatile Neural Whole-Body Controller for Humanoid Robots.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Lessons from Learning to Spin "Pens".

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data.

[BibT_eX]

[DOI]

,

,

,

,

,

Eduardo E. Veas

,

CoRR, 2024

Bunny-VisionPro: Real-Time Bimanual Dexterous Teleoperation for Imitation Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Open-TeleVision: Teleoperation with Immersive Active Visual Feedback.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Cross-Embodiment Robot Manipulation Skill Transfer using Latent Space Alignment.

[BibT_eX]

[DOI]

,

,

,

Nikolay Atanasov

CoRR, 2024

SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Hierarchical World Models as Visual Whole-Body Humanoid Controllers.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Dynamic Gaussians Mesh: Consistent Mesh Reconstruction from Monocular Videos.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

Feature Splatting: Language-Driven Physics-Based Scene Synthesis and Editing.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Visual Whole-Body Control for Legged Loco-Manipulation.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Learning Generalizable Feature Fields for Mobile Manipulation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Nikolay Atanasov

,

Sebastian A. Scherer

,

CoRR, 2024

DNAct: Diffusion Guided Multi-Task 3D Policy Learning.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

Expressive Whole-Body Control for Humanoid Robots.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

DexTouch: Learning to Seek and Manipulate Objects with Tactile Dexterity.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

A Construct-Optimize Approach to Sparse View Synthesis without Camera Pose.

[BibT_eX]

[DOI]

,

,

Mukund Varma T.

,

,

,

,

Ravi Ramamoorthi

Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

Robot Synesthesia: In-Hand Manipulation with Visuotactile Sensing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Sim2Real Manipulation on Unknown Objects with Tactile-based Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Annabella Macaluso

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

GenSim: Generating Robotic Simulation Tasks via Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

3D Reconstruction with Generalizable Neural Fields using Scene Priors.

[BibT_eX]

[DOI]

,

Shalini De Mello

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

TUVF: Learning Generalizable Texture UV Radiance Fields.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

TD-MPC2: Scalable, Robust World Models for Continuous Control.

[BibT_eX]

[DOI]

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Language-Driven Physics-Based Scene Synthesis and Editing via Feature Splatting.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Editable Image Elements for Controllable Synthesis.

[BibT_eX]

[DOI]

,

Michaël Gharbi

,

,

,

Nuno Vasconcelos

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Pixel Aligned Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Cordelia Schmid

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RGBD Objects in the Wild: Scaling Real-World 3D Object Learning from RGB-D Videos.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CyberDemo: Augmenting Simulated Human Demonstration for Real-World Dexterous Manipulation.

[BibT_eX]

[DOI]

,

,

,

,

Akhilan Gurumoorthy

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

COLMAP-Free 3D Gaussian Splatting.

[BibT_eX]

[DOI]

,

,

,

,

,

Alexei A. Efros

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Image Neural Field Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Michaël Gharbi

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ContactArt: Learning 3D Interaction Priors for Category-level Articulated Object and Hand Poses Estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the International Conference on 3D Vision, 2024

2023

Visual Reinforcement Learning With Self-Supervised 3D Representations.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Robotics Autom. Lett., May, 2023

Learning Continuous Grasping Function With a Dexterous Hand From Human Demonstrations.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Robotics Autom. Lett., May, 2023

Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis.

[BibT_eX]

[DOI]

CoRR, 2023

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks.

[BibT_eX]

[DOI]

,

Yossi Gandelsman

,

,

,

,

,

CoRR, 2023

Generalized Animal Imitator: Agile Locomotion with Versatile Motion Prior.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

Test-Time Training on Video Streams.

[BibT_eX]

[DOI]

,

,

Yossi Gandelsman

,

,

Alexei A. Efros

,

CoRR, 2023

Rotating without Seeing: Towards In-hand Dexterity through Touch.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

AnyTeleop: A General Vision-Based Dexterous Robot Arm-Hand Teleoperation System.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Elastic Decision Transformer.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Efficient Bimanual Handover and Rearrangement via Symmetry-Aware Actor-Critic Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Learning Dense Correspondences between Photos and Sketches.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Machine Learning, 2023

MonoNeRF: Learning Generalizable NeRFs from Monocular Videos without Camera Poses.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Machine Learning, 2023

On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline.

[BibT_eX]

[DOI]

,

,

,

,

Aravind Rajeswaran

,

,

,

Proceedings of the International Conference on Machine Learning, 2023

Self-Supervised Geometric Correspondence for Category-Level 6D Object Pose Estimation in the Wild.

[BibT_eX]

[DOI]

,

,

Shubhankar Borse

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation.

[BibT_eX]

[DOI]

Chenhongyi Yang

,

,

Shalini De Mello

,

Elliot J. Crowley

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations.

[BibT_eX]

[DOI]

,

,

,

,

,

Aravind Rajeswaran

Proceedings of the Eleventh International Conference on Learning Representations, 2023

FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ActorsNeRF: Animatable Few-shot Human Rendering with Generalizable NeRFs.

[BibT_eX]

[DOI]

,

,

Nuno Vasconcelos

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Neural Volumetric Memory for Visual Locomotion Control.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Shalini De Mello

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Zero-shot Pose Transfer for Unrigged Stylized 3D Characters.

[BibT_eX]

[DOI]

,

,

,

Shalini De Mello

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Policy Adaptation from Foundation Model Feedback.

[BibT_eX]

[DOI]

,

Annabella Macaluso

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DexArt: Benchmarking Generalizable Dexterous Manipulation with Articulated Objects.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields.

[BibT_eX]

[DOI]

,

,

,

Annabella Macaluso

,

,

,

,

,

Proceedings of the Conference on Robot Learning, 2023

Dynamic Handover: Throw and Catch with Bimanual Hands.

[BibT_eX]

[DOI]

,

,

,

,

,

Nikolay Atanasov

,

Proceedings of the Conference on Robot Learning, 2023

Finetuning Offline World Models in the Real World.

[BibT_eX]

[DOI]

,

,

,

Chandramouli Rajagopalan

,

Proceedings of the Conference on Robot Learning, 2023

2022

Online Adaptation for Implicit Object Tracking and Shape Reconstruction in the Wild.

[BibT_eX]

[DOI]

,

,

,

IEEE Robotics Autom. Lett., 2022

From One Hand to Multiple Hands: Imitation Learning for Dexterous Manipulation From Single-Camera Teleoperation.

[BibT_eX]

[DOI]

,

,

IEEE Robotics Autom. Lett., 2022

Look Closer: Bridging Egocentric and Third-Person Views With Transformers for Robotic Manipulation.

[BibT_eX]

[DOI]

,

,

Sambaran Ghosal

,

,

IEEE Robotics Autom. Lett., 2022

Self-Play and Self-Describe: Policy Adaptation with Vision-Language Foundation Models.

[BibT_eX]

[DOI]

,

Annabella Macaluso

,

,

,

CoRR, 2022

Multiplane NeRF-Supervised Disentanglement of Depth and Camera Pose from Videos.

[BibT_eX]

[DOI]

,

,

CoRR, 2022

Inverse Reinforcement Learning from Diverse Third-Person Videos via Graph Abstraction.

[BibT_eX]

[DOI]

,

Jonathan Zamora

,

,

,

CoRR, 2022

Category-Level 6D Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and A New Dataset.

[BibT_eX]

[DOI]

,

CoRR, 2022

Category-Level 6D Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and A New Dataset.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Vision-Guided Quadrupedal Locomotion in the Wild with Multi-Modal Delay Randomization.

[BibT_eX]

[DOI]

Chieko Sarah Imai

,

,

,

Marcin Kierebinski

,

,

,

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Temporal Difference Learning for Model Predictive Control.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Machine Learning, 2022

Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Learning Continuous Environment Fields via Implicit Functions.

[BibT_eX]

[DOI]

,

Shalini De Mello

,

,

Ming-Hsuan Yang

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

DexMV: Imitation Learning for Dexterous Manipulation from Human Videos.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

Scraping Textures from Natural Images for Synthesis and Editing.

[BibT_eX]

[DOI]

,

,

Ming-Hsuan Yang

,

Alexei A. Efros

,

Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Implicit Feature Alignment Function for Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

Shubhankar Borse

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

Transformers as Meta-learners for Implicit Neural Representations.

[BibT_eX]

[DOI]

,

Proceedings of the Computer Vision - ECCV 2022, 2022

GIFS: Neural Implicit Function for General Shape Representation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

GroupViT: Semantic Segmentation Emerges from Text Supervision.

[BibT_eX]

[DOI]

,

Shalini De Mello

,

,

,

Thomas M. Breuel

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Look Outside the Room: Synthesizing A Consistent Long-Term 3D Scene Video from A Single Image.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs.

[BibT_eX]

[DOI]

,

Shalini De Mello

,

,

Nuno Vasconcelos

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Joint Hand Motion and Interaction Hotspots Prediction from Egocentric Videos.

[BibT_eX]

[DOI]

,

Subarna Tripathi

,

Somdeb Majumdar

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning Generalizable Dexterous Manipulation from Human Grasp Affordance.

[BibT_eX]

[DOI]

,

,

Proceedings of the Conference on Robot Learning, 2022

DexPoint: Generalizable Point Cloud Reinforcement Learning for Sim-to-Real Dexterous Manipulation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Conference on Robot Learning, 2022

Graph Inverse Reinforcement Learning from Diverse Videos.

[BibT_eX]

[DOI]

,

Jonathan Zamora

,

,

,

Proceedings of the Conference on Robot Learning, 2022

2021

Single RGB-D Camera Teleoperation for General Robotic Manipulation.

[BibT_eX]

[DOI]

,

,

,

,

,

Henrik I. Christensen

CoRR, 2021

Disentangled Attention as Intrinsic Regularization for Bimanual Multi-Object Manipulation.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2021

NovelD: A Simple yet Effective Exploration Criterion.

[BibT_eX]

[DOI]

,

,

,

,

,

Joseph E. Gonzalez

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Multi-Person 3D Motion Prediction with Multi-Range Transformers.

[BibT_eX]

[DOI]

,

,

Medhini Narasimhan

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Test-Time Personalization with a Transformer for Human Pose Estimation.

[BibT_eX]

[DOI]

,

,

,

Nitesh B. Gundavarapu

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

State-Only Imitation Learning for Dexterous Manipulation.

[BibT_eX]

[DOI]

Ilija Radosavovic

,

,

,

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Generalization in Reinforcement Learning by Soft Data Augmentation.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Compositional Video Synthesis with Action Graphs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Learning Cross-Domain Correspondence for Control with Dynamics Cycle-Consistency.

[BibT_eX]

[DOI]

,

,

Alexei A. Efros

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

What Should Not Be Contrastive in Contrastive Learning.

[BibT_eX]

[DOI]

,

,

Alexei A. Efros

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Simon Shaolei Du

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Learning Long-term Visual Dynamics with Region Proposal Interaction Networks.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Solving Compositional Reinforcement Learning Problems via Task Reduction.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Self-Supervised Policy Adaptation during Deployment.

[BibT_eX]

[DOI]

,

,

,

Guillem Alenyà

,

,

Alexei A. Efros

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Rethinking Self-supervised Correspondence Learning: A Video Frame-level Similarity Perspective.

[BibT_eX]

[DOI]

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Region Similarity Representation Learning.

[BibT_eX]

[DOI]

,

Colorado J. Reed

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency.

[BibT_eX]

[DOI]

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A-SDF: Learning Disentangled Signed Distance Functions for Articulated Shape Representation.

[BibT_eX]

[DOI]

,

,

Adam Kortylewski

,

,

Nuno Vasconcelos

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Rethinking preventing class-collapsing in metric learning with margin-based losses.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Video Autoencoder: self-supervised disentanglement of static 3D structure and motion.

[BibT_eX]

[DOI]

,

,

Alexei A. Efros

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Hand-Object Contact Consistency Reasoning for Human Grasps Generation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Robust Object Detection via Instance-Level Temporal Cycle Confusion.

[BibT_eX]

[DOI]

,

Thomas E. Huang

,

,

,

,

Joseph E. Gonzalez

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Semi-Supervised 3D Hand-Object Poses Estimation With Interactions in Time.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Continuous Image Representation With Local Implicit Image Function.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

BeBold: Exploration Beyond the Boundary of Explored Regions.

[BibT_eX]

[DOI]

,

,

,

,

,

Joseph E. Gonzalez

,

CoRR, 2020

Multi-Agent Collaboration via Reward Attribution Decomposition.

[BibT_eX]

[DOI]

,

,

,

,

,

Joseph E. Gonzalez

,

CoRR, 2020

Self-Supervised Policy Adaptation during Deployment.

[BibT_eX]

[DOI]

,

,

,

Alexei A. Efros

,

,

CoRR, 2020

Compositional Video Synthesis with Action Graphs.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2020

Reducing Class Collapse in Metric Learning with Easy Positive Sampling.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

A New Meta-Baseline for Few-Shot Learning.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2020

Multi-Task Reinforcement Learning with Soft Modularization.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Online Adaptation for Consistent Mesh Reconstruction in the Wild.

[BibT_eX]

[DOI]

,

,

Shalini De Mello

,

,

,

Ming-Hsuan Yang

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Test-Time Training with Self-Supervision for Generalization under Distribution Shifts.

[BibT_eX]

[DOI]

,

,

,

,

Alexei A. Efros

,

Proceedings of the 37th International Conference on Machine Learning, 2020

Deep Isometric Learning for Visual Recognition.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 37th International Conference on Machine Learning, 2020

Evolutionary Population Curriculum for Scaling Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 8th International Conference on Learning Representations, 2020

Hierarchical Style-Based Networks for Motion Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2020, 2020

Something-Else: Compositional Action Recognition With Spatial-Temporal Interaction Networks.

[BibT_eX]

[DOI]

Joanna Materzynska

,

,

,

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Learning and Reasoning with Visual Correspondence in Time.

[BibT_eX]

[DOI]

PhD thesis, 2019

Test-Time Training for Out-of-Distribution Generalization.

[BibT_eX]

[DOI]

,

,

,

,

Alexei A. Efros

,

CoRR, 2019

Joint-task Self-supervised Learning for Temporal Correspondence.

[BibT_eX]

[DOI]

,

,

Shalini De Mello

,

,

,

Ming-Hsuan Yang

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Visual Semantic Navigation using Scene Priors.

[BibT_eX]

[DOI]

,

,

,

,

Roozbeh Mottaghi

Proceedings of the 7th International Conference on Learning Representations, 2019

Spatio-Temporal Action Graph Networks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Learning Correspondence From the Cycle-Consistency of Time.

[BibT_eX]

[DOI]

,

,

Alexei A. Efros

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Putting Humans in a Scene: Learning Affordance in 3D Indoor Environments.

[BibT_eX]

[DOI]

,

,

,

,

Ming-Hsuan Yang

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Interpretable Intuitive Physics Model.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Computer Vision - ECCV 2018, 2018

Videos as Space-Time Region Graphs.

[BibT_eX]

[DOI]

,

Proceedings of the Computer Vision - ECCV 2018, 2018

3D Human Pose Estimation in the Wild by Adversarial Learning.

[BibT_eX]

[DOI]

,

,

,

Jimmy S. J. Ren

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Zero-Shot Recognition via Semantic Embeddings and Knowledge Graphs.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Non-Local Neural Networks.

[BibT_eX]

[DOI]

,

Ross B. Girshick

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Computer Vision, 2017

Transitive Invariance for Self-Supervised Visual Representation Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Conference on Computer Vision, 2017

A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection.

[BibT_eX]

[DOI]

,

Abhinav Shrivastava

,

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Binge Watching: Scaling Affordance Learning from Sitcoms.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Generative Image Modeling Using Style and Structure Adversarial Networks.

[BibT_eX]

[DOI]

,

Proceedings of the Computer Vision - ECCV 2016, 2016

Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding.

[BibT_eX]

[DOI]

Gunnar A. Sigurdsson

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2016, 2016

Actions ~ Transformations.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

Discriminatively Trained And-Or Graph Models for Object Shape Detection.

[BibT_eX]

[DOI]

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., 2015

In Defense of the Direct Perception of Affordances.

[BibT_eX]

[DOI]

David F. Fouhey

,

,

CoRR, 2015

Unsupervised Learning of Visual Representations Using Videos.

[BibT_eX]

[DOI]

,

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Designing deep networks for surface normal estimation.

[BibT_eX]

[DOI]

,

David F. Fouhey

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014

Deep Joint Task Learning for Generic Object Extraction.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

3D Human Activity Recognition with Reconfigurable Convolutional Neural Networks.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

An expressive deep model for human action parsing from a single image.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

2013

Incorporating Structural Alternatives and Sharing into Hierarchy for Multiclass Object Recognition and Detection.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012

Dynamical And-Or Graph Learning for Object Shape Modeling and Detection.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Learning contour-fragment-based shape model with And-Or tree representation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011

Interactive CT image segmentation with online discriminative learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Loading...