2024

Spatial Steerability of GANs via Self-Supervision from Discriminator.

[DOI]

Jianyuan Wang

Lalit Bhagat

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

In-Domain GAN Inversion for Faithful Reconstruction and Editability.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation.

[DOI]

CoRR, 2024

Edicho: Consistent Image Editing in the Wild.

[DOI]

CoRR, 2024

DepthLab: From Partial to Complete.

[DOI]

CoRR, 2024

EnvGS: Modeling View-Dependent Appearance with Environment Gaussian.

[DOI]

CoRR, 2024

LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis.

[DOI]

CoRR, 2024

AniDoc: Animation Creation Made Easier.

[DOI]

CoRR, 2024

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning.

[DOI]

CoRR, 2024

Learning Visual Generative Priors without Text.

[DOI]

CoRR, 2024

PlanarSplatting: Accurate Planar Surface Reconstruction in 3 Minutes.

[DOI]

CoRR, 2024

Mimir: Improving Video Diffusion Models for Precise Text Understanding.

[DOI]

CoRR, 2024

MagicQuill: An Intelligent Interactive Image Editing System.

[DOI]

CoRR, 2024

Framer: Interactive Frame Interpolation.

[DOI]

CoRR, 2024

Rectified Diffusion Guidance for Conditional Generation.

[DOI]

CoRR, 2024

Learning Temporally Consistent Video Depth from Video Diffusion Priors.

[DOI]

CoRR, 2024

InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior.

[DOI]

CoRR, 2024

FlashFace: Human Image Personalization with High-fidelity Identity Preservation.

[DOI]

CoRR, 2024

Contextual AD Narration with Interleaved Multimodal Sequence.

[DOI]

CoRR, 2024

Bridging 3D Gaussian and Mesh for Freeview Video Rendering.

[DOI]

CoRR, 2024

GaussianPrediction: Dynamic 3D Gaussian Prediction for Motion Extrapolation and Free View Synthesis.

[DOI]

Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

MaPa: Text-driven Photorealistic Material Painting for 3D Shapes.

[DOI]

Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation.

[DOI]

Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

UKnow: A Unified Knowledge Protocol with Multimodal Knowledge Graph Datasets for Reasoning and Vision-Language Pre-Training.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Zero-shot Image Editing with Reference Imitation.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Research on Intelligent Control Methods for Community Real Population Based on Big Data and Object Detection.

[DOI]

Proceedings of the 2024 7th International Conference on Machine Vision and Applications, 2024

SMaRt: Improving GANs with Score Matching Regularity.

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models.

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Lipschitz Singularities in Diffusion Models.

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

DreamLIP: Language-Image Pre-training with Long Captions.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Exploring Guided Sampling of Conditional GANs.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

SAM-Guided Graph Cut for 3D Instance Segmentation.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

LivePhoto: Real Image Animation with Text-Guided Motion Control.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Learning 3D-Aware GANs from Unposed Images with Template Feature Field.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Real-Time 3D-Aware Portrait Editing from a Single Image.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SpatialTracker: Tracking Any 2D Pixels in 3D Space.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards More Accurate Diffusion Model Acceleration with a Timestep Tuner.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

AnyDoor: Zero-shot Object-level Image Customization.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

4K4D: Real-Time 4D View Synthesis at 4K Resolution.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

NEAT: Distilling 3D Wireframes from Neural Attraction Fields.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

ScaNeRF: Scalable Bundle-Adjusting Neural Radiance Fields for Large-Scale Scene Rendering.

[DOI]

ACM Trans. Graph., December, 2023

GH-Feat: Learning Versatile Generative Hierarchical Features From GANs.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification.

[DOI]

CoRR, 2023

CCM: Adding Conditional Controls to Text-to-Image Consistency Models.

[DOI]

CoRR, 2023

Learning Naturally Aggregated Appearance for Efficient 3D Editing.

[DOI]

CoRR, 2023

GenDeF: Learning Generative Deformation Field for Video Generation.

[DOI]

CoRR, 2023

Towards More Accurate Diffusion Model Acceleration with A Timestep Aligner.

[DOI]

CoRR, 2023

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis.

[DOI]

CoRR, 2023

Eliminating Lipschitz Singularities in Diffusion Models.

[DOI]

CoRR, 2023

Using Unreliable Pseudo-Labels for Label-Efficient Semantic Segmentation.

[DOI]

CoRR, 2023

Cones 2: Customizable Image Synthesis with Multiple Subjects.

[DOI]

CoRR, 2023

Pulling Target to Source: A New Perspective on Domain Adaptive Semantic Segmentation.

[DOI]

CoRR, 2023

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation.

[DOI]

CoRR, 2023

UKnow: A Unified Knowledge Protocol for Common-Sense Reasoning and Vision-Language Pre-training.

[DOI]

CoRR, 2023

Spatial Steerability of GANs via Self-Supervision from Discriminator.

[DOI]

CoRR, 2023

LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis.

[DOI]

CoRR, 2023

Learning Modulated Transformation in GANs.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Revisiting the Evaluation of Image Synthesis with GANs.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

FaceComposer: A Unified Model for Versatile Facial Content Creation.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

VideoComposer: Compositional Video Synthesis with Motion Controllability.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Customizable Image Synthesis with Multiple Subjects.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Compact Neural Volumetric Video Representations with Dynamic Codebooks.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Composer: Creative and Controllable Image Synthesis with Composable Conditions.

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Towards Smooth Video Composition.

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis.

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models.

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

One-Shot Generative Domain Adaptation.

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos.

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ViM: Vision Middleware for Unified Downstream Transferring.

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Dimensionality-Varying Diffusion Process.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene Synthesis.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

LipFormer: High-fidelity and Generalizable Talking Face Generation with A Pre-learned Facial Codebook.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Balancing Logit Variation for Long-Tailed Semantic Segmentation.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning 3D-Aware Image Synthesis with Unknown Pose Distribution.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Neural Dependencies Emerging from Learning Massive Categories.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GLeaD: Improving GANs with A Generator-Leading Task.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Deep Generative Models on 3D Representations: A Survey.

[DOI]

CoRR, 2022

Interpreting Class Conditional GANs with Channel Awareness.

[DOI]

CoRR, 2022

A Unified Model for Multi-class Anomaly Detection.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improving GANs with A Dynamic Discriminator.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Region-Based Semantic Factorization in GANs.

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

3D-Aware Indoor Scene Synthesis with Depth Priors.

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

High-Fidelity GAN Inversion with Padding Space.

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

3D-aware Image Synthesis via Learning Structural and Textural Representations.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Improving GAN Equilibrium by Raising Spatial Awareness.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis.

[DOI]

Ceyuan Yang

Yujun Shen

Bolei Zhou

Int. J. Comput. Vis., 2021

Decorating Your Own Bedroom: Locally Controlling Image Generation with Generative Adversarial Networks.

[DOI]

Chen Zhang

Yinghao Xu

Yujun Shen

CoRR, 2021

Unsupervised Image Transformation Learning via Generative Adversarial Networks.

[DOI]

Kaiwen Zha

Yujun Shen

Bolei Zhou

CoRR, 2021

Low-Rank Subspaces in GANs.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Data-Efficient Instance Generation from Instance Discrimination.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

CompConv: A Compact Convolution Module for Efficient Feature Learning.

[DOI]

Chen Zhang

Yinghao Xu

Yujun Shen

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Generative Hierarchical Features From Synthesizing Images.

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Glancing at the Patch: Anomaly Localization With Global and Local Feature Comparison.

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Closed-Form Factorization of Latent Semantics in GANs.

[DOI]

Yujun Shen

Bolei Zhou

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Improving the Fairness of Deep Generative Models without Retraining.

[DOI]

Shuhan Tan

Yujun Shen

Bolei Zhou

CoRR, 2020

Residual Knowledge Distillation.

[DOI]

CoRR, 2020

In-Domain GAN Inversion for Real Image Editing.

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Interpreting the Latent Space of GANs for Semantic Face Editing.

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Image Processing Using Multi-Code GAN Prior.

[DOI]

Jinjin Gu

Yujun Shen

Bolei Zhou

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2018

Feature Matters: A Stage-by-Stage Approach for Knowledge Transfer.

[DOI]

CoRR, 2018

FaceFeat-GAN: a Two-Stage Approach for Identity-Preserving Face Synthesis.

[DOI]

CoRR, 2018

FaceID-GAN: Learning a Symmetry Three-Player GAN for Identity-Preserving Face Synthesis.

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018