Yuxin Liu

CoRR, January, 2025

Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation.

[DOI]

CoRR, January, 2025

Joint Optimization for 4D Human-Scene Reconstruction in the Wild.

[DOI]

CoRR, January, 2025

2024

Spatial Steerability of GANs via Self-Supervision from Discriminator.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Experiment-free exoskeleton assistance via learning in simulation.

[DOI]

Israel Dominguez Silva

Nat., June, 2024

In-Domain GAN Inversion for Faithful Reconstruction and Editability.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Unsupervised Discovery of Steerable Factors When Graph Deep Generative Models Are Entangled.

[DOI]

Trans. Mach. Learn. Res., 2024

Street-View Image Generation From a Bird's-Eye View Layout.

[DOI]

Alexander Swerdlow

Runsheng Xu

IEEE Robotics Autom. Lett., 2024

Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning.

[DOI]

CoRR, 2024

V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction.

[DOI]

CoRR, 2024

Verbalized Representation Learning for Interpretable Few-Shot Generalization.

[DOI]

CoRR, 2024

Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels.

[DOI]

CoRR, 2024

CooPre: Cooperative Pretraining for V2X Cooperative Perception.

[DOI]

CoRR, 2024

MetaUrban: A Simulation Platform for Embodied AI in Urban Spaces.

[DOI]

CoRR, 2024

3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting.

[DOI]

CoRR, 2024

Urban Scene Diffusion through Semantic Occupancy Map.

[DOI]

CoRR, 2024

A Holistic Framework Towards Vision-based Traffic Signal Control with Microscopic Simulation.

[DOI]

CoRR, 2024

SimGen: Simulator-conditioned Driving Scene Generation.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Shared Autonomy with IDA: Interventional Diffusion Assistance.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Text-guided 3D Scene Composition.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Efficient 3D Articulated Human Generation with Layered Surface Volumes.

[DOI]

Proceedings of the International Conference on 3D Vision, 2024

2023

GH-Feat: Learning Versatile Generative Hierarchical Features From GANs.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

GAN Inversion: A Survey.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

ChemSpacE: Interpretable and Interactive Chemical Space Exploration.

[DOI]

Trans. Mach. Learn. Res., 2023

SceneWiz3D: Towards Text-guided 3D Scene Composition.

[DOI]

CoRR, 2023

Improving Out-of-Distribution Robustness of Classifiers via Generative Interpolation.

[DOI]

CoRR, 2023

Next Steps for Human-Centered Generative AI: A Technical Perspective.

[DOI]

CoRR, 2023

Spatial Steerability of GANs via Self-Supervision from Discriminator.

[DOI]

CoRR, 2023

Learning from Active Human Involvement through Proxy Value Propagation.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ScenarioNet: Open-Source Platform for Large-Scale Traffic Scenario Simulation and Modeling.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

V2XP-ASG: Generating Adversarial Scenes for Vehicle-to-Everything Perception.

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

TrafficGen: Learning to Generate Diverse and Realistic Traffic Scenarios.

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Towards Smooth Video Composition.

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Guarded Policy Optimization with Imperfect Online Demonstrations.

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

One-Shot Generative Domain Adaptation.

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene Synthesis.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

V2V4Real: A Real-World Large-Scale Dataset for Vehicle-to-Vehicle Cooperative Perception.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CAT: Closed-loop Adversarial Training for Safe End-to-End Driving.

[DOI]

Proceedings of the Conference on Robot Learning, 2023

2022

PlaTe: Visually-Grounded Planning With Transformers in Procedural Tasks.

[DOI]

IEEE Robotics Autom. Lett., 2022

InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Disentangled Inference for GANs With Latently Invertible Autoencoder.

[DOI]

Int. J. Comput. Vis., 2022

Exploiting Reward Shifting in Value-Based Deep RL.

[DOI]

CoRR, 2022

Human-AI Shared Control via Frequency-based Policy Dissection.

[DOI]

CoRR, 2022

Action-Conditioned Contrastive Policy Pretraining.

[DOI]

Qihang Zhang

CoRR, 2022

LocATe: End-to-end Localization of Actions in 3D with Transformers.

[DOI]

CoRR, 2022

Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation.

[DOI]

CoRR, 2022

Improving GANs with A Dynamic Discriminator.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exploit Reward Shifting in Value-Based Deep-RL: Optimistic Curiosity-Based Exploration and Conservative Exploitation via Linear Reward Shaping.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Human-AI Shared Control via Policy Dissection.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

AutoAlign: Pixel-Instance Feature Aggregation for Multi-Modal 3D Object Detection.

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization.

[DOI]

Quanyi Li

Proceedings of the Tenth International Conference on Learning Representations, 2022

Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining.

[DOI]

Qihang Zhang

Proceedings of the Computer Vision - ECCV 2022, 2022

Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation.

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

3D-aware Image Synthesis via Learning Structural and Textural Representations.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Improving GAN Equilibrium by Raising Spatial Awareness.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers.

[DOI]

Proceedings of the Conference on Robot Learning, 2022

Visual Sound Localization in the Wild by Cross-Modal Interference Erasing.

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-training for Spatial-Aware Visual Representations.

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Texture Memory-Augmented Deep Patch-Based Image Inpainting.

[DOI]

IEEE Trans. Image Process., 2021

Adversarial Inverse Reinforcement Learning With Self-Attention Dynamics Model.

[DOI]

IEEE Robotics Autom. Lett., 2021

Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis.

[DOI]

Ceyuan Yang

Int. J. Comput. Vis., 2021

STransGAN: An Empirical Study on Transformer in GANs.

[DOI]

CoRR, 2021

MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning.

[DOI]

CoRR, 2021

Safe Exploration by Solving Early Terminated MDP.

[DOI]

CoRR, 2021

Unsupervised Image Transformation Learning via Generative Adversarial Networks.

[DOI]

Kaiwen Zha

CoRR, 2021

Deep Learning for Scene Classification: A Survey.

[DOI]

CoRR, 2021

Data-Efficient Instance Generation from Instance Discrimination.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Instance Localization for Self-Supervised Detection Pretraining.

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Positional Encoding As Spatial Inductive Bias in GANs.

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Generative Hierarchical Features From Synthesizing Images.

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Closed-Form Factorization of Latent Semantics in GANs.

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Multimodal Motion Prediction With Stacked Transformers.

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Safe Driving via Expert Guided Policy Optimization.

[DOI]

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

HiABP: Hierarchical Initialized ABP for Unsupervised Representation Learning.

[DOI]

Jiankai Sun

Rui Liu

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Cross-View Semantic Segmentation for Sensing Surroundings.

[DOI]

IEEE Robotics Autom. Lett., 2020

Understanding the role of individual units in a deep neural network.

[DOI]

Proc. Natl. Acad. Sci. USA, 2020

Moments in Time Dataset: One Million Videos for Event Understanding.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Improving the Generalization of End-to-End Driving through Procedural Generation.

[DOI]

CoRR, 2020

Improving the Fairness of Deep Generative Models without Retraining.

[DOI]

Shuhan Tan

CoRR, 2020

Unsupervised Landmark Learning from Unpaired Data.

[DOI]

CoRR, 2020

Video Representation Learning with Visual Tempo Consistency.

[DOI]

CoRR, 2020

Non-local Policy Optimization via Diversity-regularized Collaborative Exploration.

[DOI]

Hao Sun

CoRR, 2020

Zeroth-Order Supervised Policy Improvement.

[DOI]

CoRR, 2020

Novel Policy Seeking with Constrained Optimization.

[DOI]

CoRR, 2020

Evolutionary Stochastic Policy Distillation.

[DOI]

CoRR, 2020

Interpreting Generative Adversarial Networks for Interactive Image Generation.

[DOI]

Proceedings of the xxAI - Beyond Explainable AI, 2020

In-Domain GAN Inversion for Real Image Editing.

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

A Unified Framework for Shot Type Classification Based on Subject Centric Lens.

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

TransMoMo: Invariance-Driven Unsupervised Video Motion Retargeting.

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Temporal Pyramid Network for Action Recognition.

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Interpreting the Latent Space of GANs for Semantic Face Editing.

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

A Local-to-Global Approach to Multi-Modal Movie Scene Segmentation.

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Image Processing Using Multi-Code GAN Prior.

[DOI]

Jinjin Gu

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

TPNet: Trajectory Proposal Network for Motion Prediction.

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Neuro-Symbolic Program Search for Autonomous Driving Decision Module Design.

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

Learning a Decision Module by Imitating Driver's Control Behaviors.

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow.

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Comparing the Interpretability of Deep Networks via Network Dissection.

[DOI]

Proceedings of the Explainable AI: Interpreting, 2019

Semantic photo manipulation with a generative image prior.

[DOI]

ACM Trans. Graph., 2019

Interpreting Deep Visual Representations via Network Dissection.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Semantic Understanding of Scenes Through the ADE20K Dataset.

[DOI]

Int. J. Comput. Vis., 2019

Learning Driving Decisions by Imitating Drivers' Control Behaviors.

[DOI]

CoRR, 2019

Cross-view Semantic Segmentation for Sensing Surroundings.

[DOI]

CoRR, 2019

Visualizing and Understanding Generative Adversarial Networks (Extended Abstract).

[DOI]

CoRR, 2019

Proceedings of AAAI 2019 Workshop on Network Interpretability for Deep Learning.

[DOI]

Quanshi Zhang

Lixin Fan

CoRR, 2019

Policy Continuation with Hindsight Inverse Dynamics.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Visualizing and Understanding GANs.

[DOI]

Proceedings of the Deep Generative Models for Highly Structured Data, 2019

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks.

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

A Graph-Based Framework to Bridge Movies and Synopses.

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Reasoning About Human-Object Interactions Through Dual Attention Networks.

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Seeing What a GAN Cannot Generate.

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DrivingStereo: A Large-Scale Dataset for Stereo Matching in Autonomous Driving Scenarios.

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Deep Flow-Guided Video Inpainting.

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Interpretable representation learning for visual intelligence.

[DOI]

PhD thesis, 2018

Places: A 10 Million Image Database for Scene Recognition.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

FaceFeat-GAN: a Two-Stage Approach for Identity-Preserving Face Synthesis.

[DOI]

CoRR, 2018

Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation.

[DOI]

CoRR, 2018

Revisiting the Importance of Individual Units in CNNs via Ablation.

[DOI]

CoRR, 2018

DeepMiner: Discovering Interpretable Representations for Mammogram Classification and Explanation.

[DOI]

CoRR, 2018

Expert identification of visual primitives used by CNNs during mammogram classification.

[DOI]

Proceedings of the Medical Imaging 2018: Computer-Aided Diagnosis, 2018

Real-Time Object Pose Estimation with Pose Interpreter Networks.

[DOI]

Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Interpretable Basis Decomposition for Visual Explanation.

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Temporal Relational Reasoning in Videos.

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Unified Perceptual Parsing for Scene Understanding.

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Single Image Intrinsic Decomposition Without a Single Intrinsic Image.

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Factorizable Net: An Efficient Subgraph-Based Framework for Scene Graph Generation.

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Recurrent Residual Module for Fast Inference in Videos.

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Visual Question Generation as Dual Task of Visual Question Answering.

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Temporal Relational Reasoning in Videos.

[DOI]

Alex Andonian

Antonio Torralba

CoRR, 2017

Visual Question Generation as Dual Task of Visual Question Answering.

[DOI]

CoRR, 2017

Scene Graph Generation from Objects, Phrases and Caption Regions.

[DOI]

CoRR, 2017

SegICP: Integrated deep semantic segmentation and pose estimation.

[DOI]

Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Open Vocabulary Scene Parsing.

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Scene Graph Generation from Objects, Phrases and Region Captions.

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Scene Parsing through ADE20K Dataset.

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Person Search with Natural Language Description.

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Network Dissection: Quantifying Interpretability of Deep Visual Representations.

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Semantic Understanding of Scenes through the ADE20K Dataset.

[DOI]

CoRR, 2016

Places: An Image Database for Deep Scene Understanding.

[DOI]

CoRR, 2016

Learning Deep Features for Discriminative Localization.

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Optimization as Estimation with Gaussian Processes in Bandit Settings.

[DOI]

Zi Wang

Stefanie Jegelka

Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

2015

Learning Collective Crowd Behaviors with Dynamic Pedestrian-Agents.

[DOI]

Int. J. Comput. Vis., 2015

Simple Baseline for Visual Question Answering.

[DOI]

CoRR, 2015

Object Detectors Emerge in Deep Scene CNNs.

[DOI]

Proceedings of the 3rd International Conference on Learning Representations, 2015

Understanding Intra-Class Knowledge Inside CNN.

[DOI]

CoRR, 2015

ConceptLearner: Discovering visual concepts from weakly labeled image collections.

[DOI]

Vignesh Jagadeesh

Robinson Piramuthu

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014

Measuring Crowd Collectiveness.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2014

Learning Deep Features for Scene Recognition using Places Database.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Recognizing City Identity via Attribute Analysis of Geo-tagged Images.

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

2013

Measuring Crowd Collectiveness.

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012

Coherent Filtering: Detecting Coherent Motions from Crowd Clutters.

[DOI]

Proceedings of the Computer Vision - ECCV 2012, 2012

Understanding collective crowd behaviors: Learning a Mixture model of Dynamic pedestrian-Agents.

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011

Modeling Manifold Ways of Scene Perception.

[DOI]

Mengyuan Zhu

Proceedings of the Neural Information Processing - 18th International Conference, 2011

Random field topic model for semantic region analysis in crowded scenes from tracklets.

[DOI]

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010

A Phase Discrepancy Analysis of Object Motion.

[DOI]

Xiaodi Hou

Liqing Zhang

Proceedings of the Computer Vision - ACCV 2010, 2010

2009

Scene Gist: A Holistic Generative Model of Natural Image.

[DOI]