Dongdong Chen

Orcid: 0000-0002-4642-4373

Affiliations:
  • Microsoft Cloud AI
  • University of Science and Technology of China, Hefei, China (former)


According to our database1, Dongdong Chen authored at least 150 papers between 2017 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing.
ACM Trans. Graph., December, 2024

High-Fidelity and Efficient Pluralistic Image Completion With Transformers.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Robust Model Watermarking for Image Processing Networks via Structure Consistency.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

Transformer Based Pluralistic Image Completion With Reduced Information Loss.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

Learning a Single Network for Robust Medical Image Segmentation With Noisy Labels.
IEEE Trans. Medical Imaging, September, 2024

NeRF-Art: Text-Driven Neural Radiance Fields Stylization.
IEEE Trans. Vis. Comput. Graph., August, 2024

3D Question Answering.
IEEE Trans. Vis. Comput. Graph., March, 2024

Deep Image Matting With Sparse User Interactions.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

AnimeDiff: Customized Image Generation of Anime Characters Using Diffusion Model.
IEEE Trans. Multim., 2024

PersonMAE: Person Re-Identification Pre-Training With Masked AutoEncoders.
IEEE Trans. Multim., 2024

PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition.
IEEE Trans. Image Process., 2024

ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities.
CoRR, 2024

SynChart: Synthesizing Charts from Language Models.
CoRR, 2024

Pluralistic Salient Object Detection.
CoRR, 2024

Chat2Layout: Interactive 3D Furniture Layout with a Multimodal LLM.
CoRR, 2024

Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge.
CoRR, 2024

Generative Enhancement for 3D Medical Images.
CoRR, 2024

Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search.
CoRR, 2024

i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

TrafficMOT: A Challenging Dataset for Multi-Object Tracking in Complex Traffic Scenarios.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Attribute-Aware Head Swapping Guided by 3d Modeling.
Proceedings of the IEEE International Conference on Acoustics, 2024

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

OmniViD: A Generative Framework for Universal Video Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards More Unified In-Context Visual Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Robust Point Cloud Segmentation With Noisy Annotations.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Semantic Probability Distribution Modeling for Diverse Semantic Image Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Cross-Domain and Disentangled Face Manipulation With 3D Guidance.
IEEE Trans. Vis. Comput. Graph., April, 2023

Perceptual Hashing of Deep Convolutional Neural Networks for Model Copy Detection.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Coherent adversarial deepfake video generation.
Signal Process., 2023

Old Photo Restoration via Deep Latent Space Translation.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Mesh-Guided Neural Implicit Field Editing.
CoRR, 2023

Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models.
CoRR, 2023

On the Hidden Waves of Image.
CoRR, 2023

HQ-50K: A Large-scale, High-quality Dataset for Image Restoration.
CoRR, 2023

Designing a Better Asymmetric VQGAN for StableDiffusion.
CoRR, 2023

Image is First-order Norm+Linear Autoregressive.
CoRR, 2023

Album Storytelling with Iterative Story-aware Captioning and Large Language Models.
CoRR, 2023

i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data.
CoRR, 2023

ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System.
CoRR, 2023

OmniTracker: Unifying Object Tracking by Tracking-with-Detection.
CoRR, 2023

Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion.
Proceedings of the International Conference on Machine Learning, 2023

Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Streaming Video Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Look Before You Match: Instance Understanding Matters in Video Object Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Diversity-Aware Meta Visual Prompting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

i-Code: An Integrative and Composable Multimodal Learning Framework.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Meta-PU: An Arbitrary-Scale Upsampling Network for Point Cloud.
IEEE Trans. Vis. Comput. Graph., 2022

JPEG Robust Invertible Grayscale.
IEEE Trans. Vis. Comput. Graph., 2022

TERA: Screen-to-Camera Image Code With Transparency, Efficiency, Robustness and Adaptability.
IEEE Trans. Multim., 2022

Poison Ink: Robust and Invisible Backdoor Attack.
IEEE Trans. Image Process., 2022

E2Style: Improve the Efficiency and Effectiveness of StyleGAN Inversion.
IEEE Trans. Image Process., 2022

Translation of Aerial Image Into Digital Map via Discriminative Segmentation and Creative Generation.
IEEE Trans. Geosci. Remote. Sens., 2022

Distribution-Preserving Steganography Based on Text-to-Speech Generative Models.
IEEE Trans. Dependable Secur. Comput., 2022

Deep Model Intellectual Property Protection via Deep Watermarking.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Efficient Semantic Image Synthesis via Class-Adaptive Normalization.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Online multi-object tracking with unsupervised re-identification learning and occlusion estimation.
Neurocomputing, 2022

CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet.
CoRR, 2022

X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion.
CoRR, 2022

Self-Supervised Learning based on Heat Equation.
CoRR, 2022

SinDiffusion: Learning a Diffusion Model from a Single Natural Image.
CoRR, 2022

PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition.
CoRR, 2022

Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling.
CoRR, 2022

Should All Proposals be Treated Equally in Object Detection?
CoRR, 2022

Semantic Image Synthesis via Diffusion Models.
CoRR, 2022

Residual Mixture of Experts.
CoRR, 2022

Protecting Celebrities with Identity Consistency Transformer.
CoRR, 2022

Self-supervised Transformer for Deepfake Detection.
CoRR, 2022

OmniVL: One Foundation Model for Image-Language and Video-Language Tasks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Should All Proposals Be Treated Equally in Object Detection?
Proceedings of the Computer Vision - ECCV 2022, 2022

Bootstrapped Masked Autoencoders for Vision BERT Pretraining.
Proceedings of the Computer Vision - ECCV 2022, 2022

General Facial Representation Learning in a Visual-Linguistic Manner.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

HairCLIP: Design Your Hair by Text and Reference Image.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

BEVT: BERT Pretraining of Video Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Bringing Old Films Back to Life.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Reduce Information Loss in Transformers for Pluralistic Image Inpainting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Shape-invariant 3D Adversarial Point Clouds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Vector Quantized Diffusion Model for Text-to-Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Large-Scale Pre-training for Person Re-identification with Noisy Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Protecting Celebrities from DeepFake with Identity Consistency Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Mobile-Former: Bridging MobileNet and Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Deep Template-Based Watermarking.
IEEE Trans. Circuits Syst. Video Technol., 2021

A General Decoupled Learning Framework for Parameterized Image Operators.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Explicit Filterbank Learning for Neural Image Style Transfer and Image Processing.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

<i>CDAE</i>: Color decomposition-based adversarial examples for screen devices.
Inf. Sci., 2021

Adversarial defense via self-orthogonal randomization super-network.
Neurocomputing, 2021

Visual Structure Constraint for Transductive Zero-Shot Learning in the Wild.
Int. J. Comput. Vis., 2021

PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers.
CoRR, 2021

Florence: A New Foundation Model for Computer Vision.
CoRR, 2021

Unsupervised Finetuning.
CoRR, 2021

Poison Ink: Robust and Invisible Backdoor Attack.
CoRR, 2021

Exploring Structure Consistency for Deep Model Watermarking.
CoRR, 2021

A Simple Baseline for StyleGAN Inversion.
CoRR, 2021

Weak NAS Predictors Are All You Need.
CoRR, 2021

Stronger NAS with Weaker Predictors.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Revisiting Dynamic Convolution via Matrix Decomposition.
Proceedings of the 9th International Conference on Learning Representations, 2021

Learning with Noisy Labels for Robust Point Cloud Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

High-Fidelity Pluralistic Image Completion with Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

MicroNet: Improving Image Recognition with Extremely Low FLOPs.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Improve Unsupervised Pretraining for Few-label Transfer.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multi-Attentional Deepfake Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Improved Image Matting via Real-Time User Clicks and Uncertainty Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Diverse Semantic Image Synthesis via Probability Distribution Modeling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Unsupervised Pre-Training for Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Dynamic Head: Unifying Object Detection Heads With Attentions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
MichiGAN: multi-input-conditioned hair image generation for portrait editing.
ACM Trans. Graph., 2020

Improving Person Re-Identification With Iterative Impression Aggregation.
IEEE Trans. Image Process., 2020

Controllable Image Processing via Adaptive FilterBank Pyramid.
IEEE Trans. Image Process., 2020

Are Fewer Labels Possible for Few-shot Learning?
CoRR, 2020

Semantic Image Synthesis via Efficient Class-Adaptive Normalization.
CoRR, 2020

Identity-Driven DeepFake Detection.
CoRR, 2020

MicroNet: Towards Image Recognition with Extremely Low FLOPs.
CoRR, 2020

Rethinking Spatially-Adaptive Normalization.
CoRR, 2020

Passport-aware Normalization for Deep Model Protection.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

GreedyFool: Distortion-Aware Sparse Adversarial Attack.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

DA-NAS: Data Adapted Pruning for Efficient Neural Architecture Search.
Proceedings of the Computer Vision - ECCV 2020, 2020

Dynamic ReLU.
Proceedings of the Computer Vision - ECCV 2020, 2020

LG-GAN: Label Guided Adversarial Network for Flexible Targeted Attack of Point Cloud Based Deep Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Bringing Old Photos Back to Life.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Density-Aware Graph for Deep Semi-Supervised Visual Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Robust Superpixel-Guided Attentional Adversarial Attack.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Self-Robust 3D Point Recognition via Gather-Vector Guidance.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Dynamic Convolution: Attention Over Convolution Kernels.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Model Watermarking for Image Processing Networks.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Progressive Color Transfer With Dense Semantic Correspondences.
ACM Trans. Graph., 2019

Mirror, Mirror, on the Wall, Who's Got the Clearest Image of Them All? - A Tailored Approach to Single Image Reflection Removal.
IEEE Trans. Image Process., 2019

Deep Reflection Prior.
CoRR, 2019

Gated Context Aggregation Network for Image Dehazing and Deraining.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Transductive Zero-Shot Learning with Visual Structure Constraint.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Emerging applications of reversible data hiding.
Proceedings of the 2nd International Conference on Image and Graphics Processing, 2019

Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Deep exemplar-based colorization.
ACM Trans. Graph., 2018

Decouple Learning for Parameterized Image Operators.
Proceedings of the Computer Vision - ECCV 2018, 2018

Stereoscopic Neural Style Transfer.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Coherent Online Video Style Transfer.
Proceedings of the IEEE International Conference on Computer Vision, 2017

StyleBank: An Explicit Representation for Neural Image Style Transfer.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017


  Loading...