We stand with Ukraine

We stand with Ukraine

Ziwei Liu

Orcid: 0000-0002-4220-5958

Affiliations:

Nanyang Technological University, S-Lab, Singapore
Chinese University of Hong Kong, Department of Information Engineering, Hong Kong (PhD)

According to our database¹, Ziwei Liu authored at least 339 papers between 2014 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

On csauthors.net:

Bibliography

2024

Fast-Vid2Vid++: Spatial-Temporal Distillation for Real-Time Video-to-Video Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Class-Incremental Learning: A Survey.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Pair Then Relation: Pair-Net for Panoptic Scene Graph Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Transformer-Based Visual Segmentation: A Survey.

[BibT_eX]

[DOI]

,

,

,

,

,

Guangliang Cheng

,

,

,

Chen Change Loy

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Playing for 3D Human Recovery.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Chen Change Loy

,

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Generalized Out-of-Distribution Detection: A Survey.

[BibT_eX]

[DOI]

,

,

,

Int. J. Comput. Vis., December, 2024

PERF: Panoramic Neural Radiance Field From a Single Panorama.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

,

IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking With Transformer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

Detecting and Grounding Multi-Modal Media Manipulation and Beyond.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., August, 2024

ReliTalk: Relightable Talking Portrait Generation from a Single Video.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Int. J. Comput. Vis., August, 2024

MotionDiffuse: Text-Driven Human Motion Generation With Diffusion Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., June, 2024

Talk-to-Edit: Fine-Grained 2D and 3D Facial Editing via Dialog.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

,

IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Unified 3D and 4D Panoptic Segmentation via Dynamic Shifting Networks.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Exploiting Hierarchical Interactions for Protein Surface Learning.

[BibT_eX]

[DOI]

,

,

,

,

IEEE J. Biomed. Health Informatics, April, 2024

Guest Editorial: Special Issue on the Promises and Dangers of Large Vision Models.

[BibT_eX]

[DOI]

,

,

,

,

Int. J. Comput. Vis., April, 2024

Open Long-Tailed Recognition in a Dynamic World.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Robust Partial-to-Partial Point Cloud Registration in a Full Range.

[BibT_eX]

[DOI]

,

,

IEEE Robotics Autom. Lett., 2024

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Gordon Wetzstein

CoRR, 2024

Imagine360: Immersive 360 Video Generation from Perspective Anchor.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters.

[BibT_eX]

[DOI]

,

,

,

Huaizhong Zhang

,

,

,

,

,

,

CoRR, 2024

Material Anything: Generating Materials for Any 3D Object via Diffusion.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Nattapol Chanpaisit

,

,

,

,

,

,

,

,

,

CoRR, 2024

MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

High-Fidelity Virtual Try-on with Large-Scale Unpaired Learning.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Kwan-Yee K. Wong

CoRR, 2024

DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

VistaDream: Sampling multiview consistent images for single-view scene reconstruction.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation.

[BibT_eX]

[DOI]

,

,

,

Kwan-Yee K. Wong

,

CoRR, 2024

EgoLM: Multi-Modal Language Model of Egocentric Motions.

[BibT_eX]

[DOI]

,

,

,

,

Richard A. Newcombe

,

,

CoRR, 2024

Disco4D: Disentangled 4D Human Generation and Animation from a Single Image.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion.

[BibT_eX]

[DOI]

,

,

,

,

,

Rynson W. H. Lau

CoRR, 2024

LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Gordon Wetzstein

,

,

CoRR, 2024

LLaVA-OneVision: Easy Visual Task Transfer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Toshihiko Yamasaki

,

Kiyoharu Aizawa

CoRR, 2024

Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Hengshuang Zhao

CoRR, 2024

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models.

[BibT_eX]

[DOI]

,

,

,

,

Joshua Adrian Cahyono

,

,

,

,

,

,

CoRR, 2024

VEnhancer: Generative Space-Time Enhancement for Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Long Context Transfer from Language to Vision.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition.

[BibT_eX]

[DOI]

,

,

,

,

,

Benoit R. Cottereau

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Xingliang Huang

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Xiaoqiang Cheng

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

DiffTF++: 3D-aware Diffusion Transformer for Large-Vocabulary 3D Generation.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

Christopher Arif Setiadharma

,

,

CoRR, 2024

Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials.

[BibT_eX]

[DOI]

,

,

,

,

,

Gordon Wetzstein

,

CoRR, 2024

MMInA: Benchmarking Multihop Multimodal Internet Agents.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Move Anything with Layered Scene Diffusion.

[BibT_eX]

[DOI]

,

,

,

,

,

Antoine Toisoul

CoRR, 2024

FashionEngine: Interactive Generation and Editing of 3D Clothed Humans.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Large Motion Model for Unified Multi-Modal Motion Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Kiyoharu Aizawa

CoRR, 2024

Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

InTeX: Interactive Text-to-texture Synthesis via Unified Depth-aware Inpainting.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

A Comprehensive Survey on 3D Content Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Latte: Latent Diffusion Transformer for Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

ReVersion: Diffusion-Based Relation Inversion from Images.

[BibT_eX]

[DOI]

,

,

,

Kelvin C. K. Chan

,

Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024

TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024

ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars.

[BibT_eX]

[DOI]

,

,

Gerhard P. Hancke

,

,

Rynson W. H. Lau

Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

MMHead: Towards Fine-grained Multi-modal 3D Facial Animation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Chen Change Loy

Proceedings of the Twelfth International Conference on Learning Representations, 2024

FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion.

[BibT_eX]

[DOI]

,

,

Aliaksandr Siarohin

,

Ivan Skorokhodov

,

,

,

,

,

Sergey Tulyakov

Proceedings of the Twelfth International Conference on Learning Representations, 2024

SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Large-Vocabulary 3D Diffusion Model with Transformer.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Large Motion Model for Unified Multi-modal Motion Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

WHAC: World-Grounded Humans and Cameras.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Atsushi Yamashita

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time.

[BibT_eX]

[DOI]

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Octopus: Embodied Vision-Language Programmer from Environmental Feedback.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Chencheng Jiang

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

4D Contrastive Superflows are Dense 3D Representation Learners.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

[inline-graphic not available: see fulltext] FunQA: Towards Surprising Video Comprehension.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

FreeInit: Bridging Initialization Gap in Video Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

LGM: Large Multi-view Gaussian Model for High-Resolution 3D Content Creation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Nymeria: A Massive Collection of Multimodal Egocentric Daily Motion in the Wild.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Alexander Gamino

,

,

,

,

David Soriano Fosas

,

,

,

,

,

Richard A. Newcombe

Proceedings of the Computer Vision - ECCV 2024, 2024

MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

MMBench: Is Your Multi-modal Model an All-Around Player?

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

GroupDiff: Diffusion-Based Group Portrait Editing.

[BibT_eX]

[DOI]

,

,

,

Krishna Kumar Singh

,

,

Chen Change Loy

,

Proceedings of the Computer Vision - ECCV 2024, 2024

StructLDM: Structured Latent Diffusion for 3D Human Generation.

[BibT_eX]

[DOI]

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

ReSyncer: Rewiring Style-Based Generator for Unified Audio-Visually Synced Facial Performer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

TC4D: Trajectory-Conditioned Text-to-4D Generation.

[BibT_eX]

[DOI]

Sherwin Bahmani

,

,

,

Ivan Skorokhodov

,

,

,

,

Jeong Joon Park

,

Sergey Tulyakov

,

Gordon Wetzstein

,

Andrea Tagliasacchi

,

David B. Lindell

Proceedings of the Computer Vision - ECCV 2024, 2024

Vlogger: Make Your Dream A Vlog.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

InstructVideo: Instructing Video Diffusion Models with Human Feedback.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Fresco: Spatial-Temporal Correspondence for Zero-Shot Video Translation.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CityDreamer: Compositional Generative Model of Unbounded 3D Cities.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Leonidas J. Guibas

,

,

Gordon Wetzstein

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Language-Driven Video Inpainting via Multimodal Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Jiangning Zhang

,

,

,

,

,

Chen Change Loy

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SinSR: Diffusion-Based Image Super-Resolution in a Single Step.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FreeU: Free Lunch in Diffusion U-Net.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Move Anything with Layered Scene Diffusion.

[BibT_eX]

[DOI]

,

,

,

,

,

Antoine Toisoul

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Multi-Space Alignments Towards Universal LiDAR Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VBench: Comprehensive Benchmark Suite for Video Generative Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Nattapol Chanpaisit

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SurMo: Surface-based 4D Motion Modeling for Dynamic Human Rendering.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Digital Life Project: Autonomous 3D Characters with Social Intelligence.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Chen Change Loy

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

URHand: Universal Relightable Hands.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VideoBooth: Diffusion-based Video Generation with Image Prompts.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Chen Change Loy

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Few-shot forgery detection via Guided Adversarial Interpolation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Pattern Recognit., December, 2023

Bailando++: 3D Dance GPT With Choreographic Memory.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Chen Change Loy

,

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

SceneDreamer: Unbounded 3D Scene Generation From 2D Image Collections.

[BibT_eX]

[DOI]

,

,

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Towards Real-World Visual Tracking With Temporal Contexts.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

GP-UNIT: Generative Prior for Versatile Unsupervised Image-to-Image Translation.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Full-Spectrum Out-of-Distribution Detection.

[BibT_eX]

[DOI]

,

,

Int. J. Comput. Vis., October, 2023

Variational Relational Point Completion Network for Robust 3D Classification.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

Semi-Supervised Domain Generalization with Stochastic StyleMatch.

[BibT_eX]

[DOI]

,

Chen Change Loy

,

Int. J. Comput. Vis., September, 2023

Reference-Based Image and Video Super-Resolution via $C^{2}$-Matching.

[BibT_eX]

[DOI]

,

Kelvin C. K. Chan

,

,

Chen Change Loy

,

IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Computation-Efficient Knowledge Distillation via Uncertainty-Aware Mixup.

[BibT_eX]

[DOI]

,

,

Chen Change Loy

Pattern Recognit., June, 2023

Lifting 2D Human Pose to 3D with Domain Adapted 3D Body Concept.

[BibT_eX]

[DOI]

,

,

Int. J. Comput. Vis., May, 2023

Domain Generalization: A Survey.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

DreamGaussian4D: Generative 4D Gaussian Splatting.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Hengshuang Zhao

CoRR, 2023

GauHuman: Articulated Gaussian Splatting from Monocular Human Videos.

[BibT_eX]

[DOI]

,

CoRR, 2023

OtterHD: A High-Resolution Multi-modality Model.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

Multimodal Foundation Models for Zero-shot Animal Species Recognition in Camera Trap Images.

[BibT_eX]

[DOI]

,

,

,

,

,

Andrés Hernández

,

Andrés Montes-Rojas

,

,

,

,

Pablo Arbeláez

,

,

Juan Lavista Ferres

CoRR, 2023

SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Chen Change Loy

,

,

CoRR, 2023

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Chen Change Loy

,

,

,

,

CoRR, 2023

Robust Sequential DeepFake Detection.

[BibT_eX]

[DOI]

,

,

CoRR, 2023

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

Chen Change Loy

CoRR, 2023

DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Chen Change Loy

CoRR, 2023

PointHPS: Cascaded 3D Human Pose and Shape Estimation from Point Clouds.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Chen Change Loy

,

,

CoRR, 2023

HumanLiff: Layer-wise 3D Human Generation with Diffusion Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

Temporally-Adaptive Models for Efficient Video Understanding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Marcelo H. Ang Jr.

CoRR, 2023

Benchmarking and Analyzing Generative Data for Visual Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Chen Change Loy

,

,

,

,

,

CoRR, 2023

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

FunQA: Towards Surprising Video Comprehension.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

OpenOOD v1.5: Enhanced Benchmark for Out-of-Distribution Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

MIMIC-IT: Multi-Modal In-Context Instruction Tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

Learning without Forgetting for Vision-Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

SAD: Segment Any RGBD.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2023

RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Chen Change Loy

,

,

,

,

CoRR, 2023

ConsistentNeRF: Enhancing Neural Radiance Fields with 3D Consistency for Sparse View Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2023

Otter: A Multi-Modal Model with In-Context Instruction Tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

RoboBEV: Towards Robust Bird's Eye View Perception under Corruptions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Chen-Chieh Liao

,

,

,

,

,

,

,

CoRR, 2023

Deep Class-Incremental Learning: A Survey.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

Efficient Video Portrait Reenactment via Grid-based Codebook.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the ACM SIGGRAPH 2023 Conference Proceedings, 2023

What Makes Good Examples for Visual In-Context Learning?

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

4D Panoptic Scene Graph Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

InsActor: Instruction-driven Physics-based Characters.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Towards Robust and Expressive Whole-body Human Pose and Shape Estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Chen Change Loy

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Segment Any Point Cloud Sequences by Distilling Vision Foundation Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Large Language Models are Visual Reasoning Coordinators.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Chen Change Loy

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D Brain MRI Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Multi-Modal Generative AI with Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the 1st Workshop on Large Generative Models Meet Multimodal Applications, 2023

BiBench: Benchmarking and Analyzing Network Binarization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2023

Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Chen-Chieh Liao

,

,

,

Proceedings of the 25th International Conference on Multimodal Interaction, 2023

Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction.

[BibT_eX]

[DOI]

,

,

,

,

Christian Theobalt

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

DiffMimic: Efficient Motion Mimicking with Differentiable Physics.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Sparse Mixture-of-Experts are Domain Generalizable Learners.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

EVA3D: Compositional 3D Human Generation from 2D Image Collections.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Masked Frequency Modeling for Self-Supervised Visual Pre-Training.

[BibT_eX]

[DOI]

,

,

,

,

,

Chen Change Loy

Proceedings of the Eleventh International Conference on Learning Representations, 2023

DeformToon3d: Deformable Neural Radiance Fields for 3D Toonification.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Chen Change Loy

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SparseNeRF: Distilling Depth Ranking for Few-shot Novel View Synthesis.

[BibT_eX]

[DOI]

,

,

Chen Change Loy

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Deep Geometrized Cartoon Line Inbetweening.

[BibT_eX]

[DOI]

,

,

,

,

,

Chen Change Loy

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Robo3D: Towards Robust and Reliable 3D Perception against Corruptions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Range View Representation for LiDAR Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SHERF: Generalizable Human NeRF from a Single Image.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

UnitedHuman: Harnessing Multi-Source Data for High-Resolution Human Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Chen Change Loy

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Text2Performer: Text-Driven Human Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Panoptic Video Scene Graph Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Chen Change Loy

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Detecting and Grounding Multi-Modal Media Manipulation.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

LaserMix for Semi-Supervised LiDAR Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Collaborative Diffusion for Multi-Modal Face Generation and Editing.

[BibT_eX]

[DOI]

,

Kelvin C. K. Chan

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

F<sup>2</sup>-NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Christian Theobalt

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Robust Video Portrait Reenactment via Personalized Representation Quantization.

[BibT_eX]

[DOI]

,

Changcheng Liang

,

,

,

,

,

,

,

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

VToonify: Controllable High-Resolution Portrait Video Style Transfer.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

ACM Trans. Graph., 2022

Text2Human: text-driven controllable human image generation.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

,

ACM Trans. Graph., 2022

AvatarCLIP: zero-shot text-driven generation and animation of 3D avatars.

[BibT_eX]

[DOI]

,

,

,

,

,

ACM Trans. Graph., 2022

Text2Light: Zero-Shot Text-Driven HDR Panorama Generation.

[BibT_eX]

[DOI]

,

,

ACM Trans. Graph., 2022

Chasing the Tail in Monocular 3D Human Reconstruction With Prototype Memory.

[BibT_eX]

[DOI]

,

,

Chen Change Loy

IEEE Trans. Image Process., 2022

CARAFE++: Unified Content-Aware ReAssembly of FEatures.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

,

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Learning to Prompt for Vision-Language Models.

[BibT_eX]

[DOI]

,

,

Chen Change Loy

,

Int. J. Comput. Vis., 2022

Delving into Inter-Image Invariance for Unsupervised Visual Representations.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

Int. J. Comput. Vis., 2022

Reference-based Image and Video Super-Resolution via C2-Matching.

[BibT_eX]

[DOI]

,

Kelvin C. K. Chan

,

,

Chen Change Loy

,

CoRR, 2022

TripleE: Easy Domain Generalization via Episodic Replay.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2022

On-Device Domain Generalization.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

,

CoRR, 2022

StyleFaceV: Face Video Generation via Decomposing and Recomposing Pretrained StyleGAN3.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

Neural Prompt Search.

[BibT_eX]

[DOI]

,

,

CoRR, 2022

Sparse Fusion Mixture-of-Experts are Domain Generalizable Learners.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

Robust Face Anti-Spoofing with Dual Probabilistic Modeling.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

SeCo: Separating Unknown Musical Visual Sounds with Consistency Guidance.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2022

X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2022

Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2022

LiDAR-based 4D Panoptic Segmentation via Dynamic Shifting Network.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

OpenOOD: Benchmarking Generalized Out-of-Distribution Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Benchmarking and Analyzing 3D Human Pose and Shape Estimation Beyond Algorithms.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Audio-Driven Co-Speech Gesture Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

AnimeRun: 2D Animation Visual Correspondence from Open Source 3D Movies.

[BibT_eX]

[DOI]

,

,

,

,

,

Chen Change Loy

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Free Lunch for Surgical Video Understanding by Distilling Self-supervisions.

[BibT_eX]

[DOI]

,

,

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Benchmarking and Analyzing Point Cloud Classification under Corruptions.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Machine Learning, 2022

BiBERT: Accurate Fully Binarized BERT.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

TAda! Temporally-Adaptive Convolutions for Video Understanding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Marcelo H. Ang Jr.

Proceedings of the Tenth International Conference on Learning Representations, 2022

Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

CelebV-HQ: A Large-Scale Video Facial Attributes Dataset.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Chen Change Loy

Proceedings of the Computer Vision - ECCV 2022, 2022

Benchmarking Omni-Vision Representation Through the Lens of Visual Realms.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

Panoptic Scene Graph Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

StyleSwap: Style-Based Generator Empowers Robust Face Swapping.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

Mind the Gap in Distilling StyleGANs.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

Proceedings of the Computer Vision - ECCV 2022, 2022

StyleLight: HDR Panorama Generation for Lighting Estimation and Editing.

[BibT_eX]

[DOI]

,

,

Chen Change Loy

,

Proceedings of the Computer Vision - ECCV 2022, 2022

Detecting and Recovering Sequential DeepFake Manipulation.

[BibT_eX]

[DOI]

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

UNIF: United Neural Implicit Functions for Clothed Human Reconstruction and Animation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

StyleGAN-Human: A Data-Centric Odyssey of Human Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Chen Change Loy

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

Relighting4D: Neural Relightable Human from Videos.

[BibT_eX]

[DOI]

,

Proceedings of the Computer Vision - ECCV 2022, 2022

HuMMan: Multi-modal 4D Human Dataset for Versatile Sensing and Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

Chen Change Loy

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

Conditional Prompt Learning for Vision-Language Models.

[BibT_eX]

[DOI]

,

,

Chen Change Loy

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

[BibT_eX]

[DOI]

,

,

Shanghang Zhang

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Full-Range Virtual Try-On with Recurrent Tri-Level Transform.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unsupervised Image-to-Image Translation with Generative Prior.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Chen Change Loy

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Balanced MSE for Imbalanced Visual Regression.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Versatile Multi-Modal Pre-Training for Human-Centric Perception.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TCTrack: Temporal Contexts for Aerial Tracking.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SepFusion: Finding Optimal Fusion Structures for Visual Sound Separation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Visual Sound Localization in the Wild by Cross-Modal Interference Erasing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Learning Diverse Fashion Collocation by Neural Graph Filtering.

[BibT_eX]

[DOI]

,

,

,

IEEE Trans. Multim., 2021

Iterative human and automated identification of wildlife images.

[BibT_eX]

[DOI]

,

,

Kaitlyn M. Gaynor

,

Meredith S. Palmer

,

,

Nat. Mach. Intell., 2021

Multi-View Partial (MVP) Point Cloud Challenge 2021 on Completion and Registration: Methods and Results.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Francisco Gómez Fernández

,

,

CoRR, 2021

ForgeryNet - Face Forgery Analysis Challenge 2021: Methods and Results.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2021

Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2021

Playing for 3D Human Recovery.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Chen Change Loy

,

CoRR, 2021

Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

[BibT_eX]

[DOI]

,

,

Shanghang Zhang

,

,

,

,

,

,

,

CoRR, 2021

CelebA-Spoof Challenge 2020 on Face Anti-Spoofing: Methods and Results.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2021

DeeperForensics Challenge 2020 on Real-World Face Forgery Detection: Methods and Results.

[BibT_eX]

[DOI]

,

,

,

,

,

Chen Change Loy

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2021

Person-in-Context Synthesis with Compositional Structural Space.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

ShineOn: Illuminating Design Choices for Practical Video-based Virtual Clothing Try-on.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE Winter Conference on Applications of Computer Vision Workshops, 2021

Unsupervised Object-Level Representation Learning from Scene Images.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Balanced Chamfer Distance as a Comprehensive Metric for Point Cloud Completion.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Garment4D: Garment Reconstruction from Point Cloud Sequences.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Few-Shot Object Detection via Association and DIscrimination.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

MMFashion: An Open-Source Toolbox for Visual Fashion Analysis.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Speech2Talking-Face: Inferring and Driving a Face with Synchronized Audio-Visual Representation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Long-tailed Recognition by Routing Diverse Distribution-Aware Experts.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Do 2D GANs Know 3D Shape? Unsupervised 3D Shape Reconstruction from 2D Image GANs.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Differentiable Dynamic Wirings for Neural Networks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Incorporating Convolution Designs into Visual Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Semantically Coherent Out-of-Distribution Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

BlockPlanner: City Block Generation with Vectorized Graph Representation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Energy-Based Open-World Uncertainty Modeling for Confidence Calibration.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Shanghang Zhang

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Talk-to-Edit: Fine-Grained Facial Editing via Dialog.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Visually Informed Binaural Audio Generation without Binaural Audios.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Adversarial Robustness Under Long-Tailed Distribution.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Seesaw Loss for Long-Tailed Instance Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Chen Change Loy

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Unsupervised Feature Learning by Cross-Level Instance-Group Discrimination.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Variational Relational Point Completion Network.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Deep Animation Video Interpolation in the Wild.

[BibT_eX]

[DOI]

,

,

,

,

Dimitris N. Metaxas

,

Chen Change Loy

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

LiDAR-Based Panoptic Segmentation via Dynamic Shifting Network.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Robust Reference-Based Super-Resolution via C2-Matching.

[BibT_eX]

[DOI]

,

Kelvin C. K. Chan

,

,

Chen Change Loy

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

PTeacher: a Computer-Aided Personalized Pronunciation Training System with Exaggerated Audio-Visual Corrective Feedback.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021

Monocular 3D Reconstruction of Interacting Hands via Collision-Aware Factorized Refinements.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

Proceedings of the International Conference on 3D Vision, 2021

2020

ShineOn: Illuminating Design Choices for Practical Video-based Virtual Clothing Try-on.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2020

Person-in-Context Synthesiswith Compositional Structural Space.

[BibT_eX]

[DOI]

,

,

CoRR, 2020

Unsupervised Feature Learning by Cross-Level Discrimination between Instances and Groups.

[BibT_eX]

[DOI]

,

,

CoRR, 2020

Unsupervised Human 3D Pose Representation with Viewpoint and Pose Disentanglement.

[BibT_eX]

[DOI]

,

,

CoRR, 2020

Unsupervised Landmark Learning from Unpaired Data.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2020

Sensing, Understanding and Synthesizing Humans in an Open World.

[BibT_eX]

[DOI]

Proceedings of the HuMA'20: Proceedings of the 1st International Workshop on Human-centric Multimedia Analysis, 2020

Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ECCV 2020, 2020

CelebA-Spoof: Large-Scale Face Anti-spoofing Dataset with Rich Annotations.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2020, 2020

Knowledge Distillation Meets Self-supervision.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

Proceedings of the Computer Vision - ECCV 2020, 2020

Distribution-Balanced Loss for Multi-label Classification in Long-Tailed Datasets.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ECCV 2020, 2020

Unsupervised 3D Human Pose Representation with Viewpoint and Pose Disentanglement.

[BibT_eX]

[DOI]

,

,

Proceedings of the Computer Vision - ECCV 2020, 2020

Placepedia: Comprehensive Place Understanding with Multi-faceted Annotations.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2020, 2020

Rotate-and-Render: Unsupervised Photorealistic Face Rotation From Single-View Images.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Online Deep Clustering for Unsupervised Representation Learning.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Self-Supervised Scene De-Occlusion.

[BibT_eX]

[DOI]

,

,

,

,

,

Chen Change Loy

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

MaskGAN: Towards Diverse and Interactive Facial Image Manipulation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

When NAS Meets Robustness: In Search of Robust Architectures Against Adversarial Attacks.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Open Compound Domain Adaptation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Dynamic Graph CNN for Learning on Point Clouds.

[BibT_eX]

[DOI]

,

,

,

Sanjay E. Sarma

,

Michael M. Bronstein

,

Justin M. Solomon

ACM Trans. Graph., 2019

When NAS Meets Robustness: In Search of Robust Architectures against Adversarial Attacks.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2019

Learning to Synthesize Fashion Textures.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

CoRR, 2019

Compound Domain Adaptation in an Open World.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2019

MMDetection: Open MMLab Detection Toolbox and Benchmark.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Chen Change Loy

,

CoRR, 2019

Vision-Infused Deep Audio Inpainting.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

CARAFE: Content-Aware ReAssembly of FEatures.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Self-Supervised Learning via Conditional Motion Propagation.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Hybrid Task Cascade for Instance Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Chen Change Loy

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Large-Scale Long-Tailed Recognition in an Open World.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

One-shot Face Reenactment.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

,

Proceedings of the 30th British Machine Vision Conference 2019, 2019

Talking Face Generation by Adversarially Disentangled Audio-Visual Representation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Instance-Level Facial Attributes Transfer with Geometry-Aware Flow.

[BibT_eX]

[DOI]

,

,

Chen Change Loy

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

DPATCH: An Adversarial Patch Attack on Object Detectors.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Workshop on Artificial Intelligence Safety 2019 co-located with the Thirty-Third AAAI Conference on Artificial Intelligence 2019 (AAAI-19), 2019

2018

Vision-Based Calibration of Dual RCM-Based Robot Arms in Human-Robot Collaborative Minimally Invasive Surgery.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Peter Kazanzides

,

Russell H. Taylor

IEEE Robotics Autom. Lett., 2018

Deep Learning Markov Random Field for Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

,

IEEE Trans. Pattern Anal. Mach. Intell., 2018

Adaptive Affinity Field for Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2018

Consensus-Driven Propagation in Massive Unlabeled Data for Face Recognition.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

Proceedings of the Computer Vision - ECCV 2018, 2018

Adaptive Affinity Fields for Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Computer Vision - ECCV 2018, 2018

Mix-and-Match Tuning for Self-Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

Chen Change Loy

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Video Object Segmentation with Re-identification.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Chen Change Loy

CoRR, 2017

Unconstrained Fashion Landmark Detection via Hierarchical Recurrent Transformer Networks.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Video Frame Synthesis Using Deep Voxel Flow.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Computer Vision, 2017

Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

,

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Semantic Facial Expression Editing using Autoencoded Flow.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2016

Fashion Landmark Detection in the Wild.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ECCV 2016, 2016

DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Face Model Compression by Distilling Knowledge from Neurons.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Deep Learning Face Attributes in the Wild.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Semantic Image Segmentation via Deep Parsing Network.

[BibT_eX]

[DOI]

,

,

,

Chen Change Loy

,

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014

Fast burst images denoising.

[BibT_eX]

[DOI]

,

,

,

Matt Uyttendaele

,

ACM Trans. Graph., 2014

Loading...