2024
Spatial Steerability of GANs via Self-Supervision from Discriminator.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024
In-Domain GAN Inversion for Faithful Reconstruction and Editability.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024
Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation.
CoRR, 2024
Edicho: Consistent Image Editing in the Wild.
CoRR, 2024
DepthLab: From Partial to Complete.
CoRR, 2024
EnvGS: Modeling View-Dependent Appearance with Environment Gaussian.
CoRR, 2024
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis.
CoRR, 2024
AniDoc: Animation Creation Made Easier.
CoRR, 2024
Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning.
CoRR, 2024
Learning Visual Generative Priors without Text.
CoRR, 2024
PlanarSplatting: Accurate Planar Surface Reconstruction in 3 Minutes.
CoRR, 2024
Mimir: Improving Video Diffusion Models for Precise Text Understanding.
CoRR, 2024
MagicQuill: An Intelligent Interactive Image Editing System.
CoRR, 2024
Framer: Interactive Frame Interpolation.
CoRR, 2024
Rectified Diffusion Guidance for Conditional Generation.
CoRR, 2024
Learning Temporally Consistent Video Depth from Video Diffusion Priors.
CoRR, 2024
InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior.
CoRR, 2024
FlashFace: Human Image Personalization with High-fidelity Identity Preservation.
CoRR, 2024
Contextual AD Narration with Interleaved Multimodal Sequence.
CoRR, 2024
Bridging 3D Gaussian and Mesh for Freeview Video Rendering.
CoRR, 2024
GaussianPrediction: Dynamic 3D Gaussian Prediction for Motion Extrapolation and Free View Synthesis.
Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024
MaPa: Text-driven Photorealistic Material Painting for 3D Shapes.
Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024
HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation.
Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024
LoTLIP: Improving Language-Image Pre-training for Long Text Understanding.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
UKnow: A Unified Knowledge Protocol with Multimodal Knowledge Graph Datasets for Reasoning and Vision-Language Pre-Training.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Zero-shot Image Editing with Reference Imitation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Research on Intelligent Control Methods for Community Real Population Based on Big Data and Object Detection.
Proceedings of the 2024 7th International Conference on Machine Vision and Applications, 2024
SMaRt: Improving GANs with Score Matching Regularity.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Lipschitz Singularities in Diffusion Models.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Twelfth International Conference on Learning Representations, 2024
DreamLIP: Language-Image Pre-training with Long Captions.
Proceedings of the Computer Vision - ECCV 2024, 2024
Exploring Guided Sampling of Conditional GANs.
Proceedings of the Computer Vision - ECCV 2024, 2024
GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024
SAM-Guided Graph Cut for 3D Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024
LivePhoto: Real Image Animation with Text-Guided Motion Control.
Proceedings of the Computer Vision - ECCV 2024, 2024
Learning 3D-Aware GANs from Unposed Images with Template Feature Field.
Proceedings of the Computer Vision - ECCV 2024, 2024
Real-Time 3D-Aware Portrait Editing from a Single Image.
Proceedings of the Computer Vision - ECCV 2024, 2024
BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
SpatialTracker: Tracking Any 2D Pixels in 3D Space.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Towards More Accurate Diffusion Model Acceleration with a Timestep Tuner.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
AnyDoor: Zero-shot Object-level Image Customization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
4K4D: Real-Time 4D View Synthesis at 4K Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
NEAT: Distilling 3D Wireframes from Neural Attraction Fields.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
ScaNeRF: Scalable Bundle-Adjusting Neural Radiance Fields for Large-Scale Scene Rendering.
ACM Trans. Graph., December, 2023
GH-Feat: Learning Versatile Generative Hierarchical Features From GANs.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification.
CoRR, 2023
CCM: Adding Conditional Controls to Text-to-Image Consistency Models.
CoRR, 2023
Learning Naturally Aggregated Appearance for Efficient 3D Editing.
CoRR, 2023
GenDeF: Learning Generative Deformation Field for Video Generation.
CoRR, 2023
Towards More Accurate Diffusion Model Acceleration with A Timestep Aligner.
CoRR, 2023
Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis.
CoRR, 2023
Eliminating Lipschitz Singularities in Diffusion Models.
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Using Unreliable Pseudo-Labels for Label-Efficient Semantic Segmentation.
CoRR, 2023
Cones 2: Customizable Image Synthesis with Multiple Subjects.
CoRR, 2023
Pulling Target to Source: A New Perspective on Domain Adaptive Semantic Segmentation.
CoRR, 2023
VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation.
CoRR, 2023
UKnow: A Unified Knowledge Protocol for Common-Sense Reasoning and Vision-Language Pre-training.
CoRR, 2023
Spatial Steerability of GANs via Self-Supervision from Discriminator.
CoRR, 2023
LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis.
CoRR, 2023
Learning Modulated Transformation in GANs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Revisiting the Evaluation of Image Synthesis with GANs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
FaceComposer: A Unified Model for Versatile Facial Content Creation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
VideoComposer: Compositional Video Synthesis with Motion Controllability.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Customizable Image Synthesis with Multiple Subjects.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Compact Neural Volumetric Video Representations with Dynamic Codebooks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Composer: Creative and Controllable Image Synthesis with Composable Conditions.
Proceedings of the International Conference on Machine Learning, 2023
Towards Smooth Video Composition.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
One-Shot Generative Domain Adaptation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
ViM: Vision Middleware for Unified Downstream Transferring.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Dimensionality-Varying Diffusion Process.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene Synthesis.
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
LipFormer: High-fidelity and Generalizable Talking Face Generation with A Pre-learned Facial Codebook.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Balancing Logit Variation for Long-Tailed Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Learning 3D-Aware Image Synthesis with Unknown Pose Distribution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Neural Dependencies Emerging from Learning Massive Categories.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
GLeaD: Improving GANs with A Generator-Leading Task.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Deep Generative Models on 3D Representations: A Survey.
CoRR, 2022
Interpreting Class Conditional GANs with Channel Awareness.
CoRR, 2022
A Unified Model for Multi-class Anomaly Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Improving GANs with A Dynamic Discriminator.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Learning from Future: A Novel Self-Training Framework for Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Region-Based Semantic Factorization in GANs.
Proceedings of the International Conference on Machine Learning, 2022
3D-Aware Indoor Scene Synthesis with Depth Priors.
Proceedings of the Computer Vision - ECCV 2022, 2022
High-Fidelity GAN Inversion with Padding Space.
Proceedings of the Computer Vision - ECCV 2022, 2022
Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
3D-aware Image Synthesis via Learning Structural and Textural Representations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Improving GAN Equilibrium by Raising Spatial Awareness.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis.
Int. J. Comput. Vis., 2021
Decorating Your Own Bedroom: Locally Controlling Image Generation with Generative Adversarial Networks.
CoRR, 2021
Unsupervised Image Transformation Learning via Generative Adversarial Networks.
CoRR, 2021
Low-Rank Subspaces in GANs.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Data-Efficient Instance Generation from Instance Discrimination.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
CompConv: A Compact Convolution Module for Efficient Feature Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021
Generative Hierarchical Features From Synthesizing Images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Glancing at the Patch: Anomaly Localization With Global and Local Feature Comparison.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Closed-Form Factorization of Latent Semantics in GANs.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
Improving the Fairness of Deep Generative Models without Retraining.
CoRR, 2020
Residual Knowledge Distillation.
CoRR, 2020
In-Domain GAN Inversion for Real Image Editing.
Proceedings of the Computer Vision - ECCV 2020, 2020
Interpreting the Latent Space of GANs for Semantic Face Editing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Image Processing Using Multi-Code GAN Prior.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2018
Feature Matters: A Stage-by-Stage Approach for Knowledge Transfer.
CoRR, 2018
FaceFeat-GAN: a Two-Stage Approach for Identity-Preserving Face Synthesis.
CoRR, 2018
FaceID-GAN: Learning a Symmetry Three-Player GAN for Identity-Preserving Face Synthesis.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018