Jiangning Zhang

Orcid: 0000-0001-8891-6766

According to our database1, Jiangning Zhang authored at least 104 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm.
Int. J. Comput. Vis., September, 2024

Sfnet: Faster and Accurate Semantic Segmentation Via Semantic Flow.
Int. J. Comput. Vis., February, 2024

Towards Open Vocabulary Learning: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., 2024

Dual-path Frequency Discriminators for few-shot anomaly detection.
Knowl. Based Syst., 2024

LLaVA-KD: A Framework of Distilling Multimodal Large Language Models.
CoRR, 2024

OSV: One Step is Enough for High-Quality Image to Video Generation.
CoRR, 2024

SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation.
CoRR, 2024

Temporal and Interactive Modeling for Efficient Human-Human Motion Generation.
CoRR, 2024

LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using LiDAR and Camera.
CoRR, 2024

Learning Multi-view Anomaly Detection.
CoRR, 2024

RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network.
CoRR, 2024

MotionBooth: Motion-Aware Customized Text-to-Video Generation.
CoRR, 2024

Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image.
CoRR, 2024

AnyMaker: Zero-shot General Object Customization via Decoupled Dual-Level ID Injection.
CoRR, 2024

ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection.
CoRR, 2024

M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising.
CoRR, 2024

VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation.
CoRR, 2024

Open-Vocabulary SAM3D: Understand Any 3D Scene.
CoRR, 2024

StyleMaster: Towards Flexible Stylized Image Generation with Diffusion Models.
CoRR, 2024

PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud Learning.
CoRR, 2024

MotionMaster: Training-free Camera Motion Transfer For Video Generation.
CoRR, 2024

Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark.
CoRR, 2024

MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection.
CoRR, 2024

Deepfake Generation and Detection: A Benchmark and Survey.
CoRR, 2024

DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation.
CoRR, 2024

DMAD: Dual Memory Bank for Real-World Anomaly Detection.
CoRR, 2024

Explore In-Context Segmentation via Latent Diffusion Models.
CoRR, 2024

PointSeg: A Training-Free Paradigm for 3D Scene Segmentation via Foundation Models.
CoRR, 2024

Dual-path Frequency Discriminators for Few-shot Anomaly Detection.
CoRR, 2024

MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model.
CoRR, 2024

A Survey on Visual Anomaly Detection: Challenge, Approach, and Prospect.
CoRR, 2024

Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection.
CoRR, 2024

A Generalist FaceX via Learning Unified Facial Representation.
CoRR, 2024

MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

COMD: Training-free Video Motion Transfer With Camera-Object Motion Disentanglement.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and Disentangled Multi-Modality Fusion.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

UniM-OV3D: Uni-Modality Open-Vocabulary 3D Scene Understanding with Fine-Grained Feature Representation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

TransAVS: End-to-End Audio-Visual Segmentation with Transformer.
Proceedings of the IEEE International Conference on Acoustics, 2024

Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

DiffuMatting: Synthesizing Arbitrary Objects with Matting-Level Annotation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Face-Adapter for Pre-trained Diffusion Models with Fine-Grained ID and Attribute Control.
Proceedings of the Computer Vision - ECCV 2024, 2024

FreeMotion: A Unified Framework for Number-Free Text-to-Motion Synthesis.
Proceedings of the Computer Vision - ECCV 2024, 2024

AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

Towards Language-Driven Video Inpainting via Multimodal Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PortraitBooth: A Versatile Portrait Model for Fast Identity-Preserved Personalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SuperSVG: Superpixel-Based Scalable Vector Graphics Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Self-Supervised Likelihood Estimation with Energy Guidance for Anomaly Segmentation in Urban Scenes.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

AnomalyDiffusion: Few-Shot Anomaly Image Generation with Diffusion Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

A Diffusion-Based Framework for Multi-Class Anomaly Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Rethinking Reverse Distillation for Multi-Modal Anomaly Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Adaptive Recurrent Forward Network for Dense Point Cloud Completion.
IEEE Trans. Multim., 2023

Omni-Frequency Channel-Selection Representations for Unsupervised Anomaly Detection.
IEEE Trans. Image Process., 2023

Exploring Plain ViT Reconstruction for Multi-class Unsupervised Anomaly Detection.
CoRR, 2023

DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection.
CoRR, 2023

Exploring Grounding Potential of VQA-oriented GPT-4V for Zero-shot Anomaly Detection.
CoRR, 2023

CLIP-AD: A Language-Guided Staged Dual-Path Model for Zero-shot Anomaly Detection.
CoRR, 2023

A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD.
CoRR, 2023

Dual Path Transformer with Partition Attention.
CoRR, 2023

Hear to Segment: Unmixing the Audio to Guide the Semantic Segmentation.
CoRR, 2023

Multimodal-driven Talking Face Generation via a Unified Diffusion-based Generator.
CoRR, 2023

Self-supervised Likelihood Estimation with Energy Guidance for Anomaly Segmentation in Urban Scenes.
CoRR, 2023

Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation.
CoRR, 2023

Rethinking Mobile Block for Efficient Neural Models.
CoRR, 2023

PVG: Progressive Vision Graph for Vision Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Toward High Quality Facial Representation Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Rethinking Mobile Block for Efficient Attention-based Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Global-aware Kernel for Image Harmonization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Remembering Normality: Memory-guided Knowledge Distillation for Unsupervised Anomaly Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

High-Fidelity Generalized Emotional Talking Face Generation with Multi-Modal Emotion Space Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning with Noisy labels via Self-supervised Adversarial Noisy Masking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MixTeacher: Mining Promising Labels with Mixed Scale Teacher for Semi-Supervised Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning to Measure the Point Cloud Reconstruction Loss in a Representation Space.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Better "CMOS" Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Multimodal Industrial Anomaly Detection via Hybrid Fusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Calibrated Teacher for Sparsely Annotated Object Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
3QNet: 3D Point Cloud Geometry Quantization Compression Network.
ACM Trans. Graph., 2022

Multilevel Spatial-Temporal Feature Aggregation for Video Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022

Real-Time Audio-Guided Multi-Face Reenactment.
IEEE Signal Process. Lett., 2022

Fast Point Cloud Sampling Network.
Pattern Recognit. Lett., 2022

SFNet: Faster, Accurate, and Domain Agnostic Semantic Segmentation via Semantic Flow.
CoRR, 2022

Iterative Few-shot Semantic Segmentation from Image Label Text.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Designing One Unified Framework for High-Fidelity Face Reenactment and Swapping.
Proceedings of the Computer Vision - ECCV 2022, 2022

Resolution-Free Point Cloud Sampling Network with Data Distillation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Learning to Train a Point Cloud Reconstruction Network Without Matching.
Proceedings of the Computer Vision - ECCV 2022, 2022

Region-Aware Face Swapping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SCSNet: An Efficient Paradigm for Learning Simultaneously Image Colorization and Super-resolution.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Learning hierarchical and efficient Person re-identification for robotic navigation.
Int. J. Intell. Robotics Appl., 2021

SelFSR: Self-Conditioned Face Super-Resolution in the Wild via Flow Field Degradation Network.
CoRR, 2021

Analogous to Evolutionary Algorithm: Designing a Unified Sequence Model.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

RFNet: Recurrent Forward Network for Dense Point Cloud Completion.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
APB2FaceV2: Real-Time Audio-Guided Multi-Face Reenactment.
CoRR, 2020

Hierarchical and Efficient Learning for Person Re-Identification.
CoRR, 2020

APB2FACE: Audio-Guided Face Reenactment with Auxiliary Pose and Blink Signals.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

DTVNet: Dynamic Time-Lapse Video Generation via Single Still Image.
Proceedings of the Computer Vision - ECCV 2020, 2020

FReeNet: Multi-Identity Face Reenactment.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning by Analogy: Reliable Supervision From Transformations for Unsupervised Optical Flow Estimation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Learning-Based Hand Motion Capture and Understanding in Assembly Process.
IEEE Trans. Ind. Electron., 2019

FaceSwapNet: Landmark Guided Many-to-Many Face Reenactment.
CoRR, 2019


  Loading...