Huaibo Huang

Orcid: 0000-0001-5866-2283

According to our database1, Huaibo Huang authored at least 91 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Uncertainty-aware image inpainting with adaptive feedback network.
Expert Syst. Appl., January, 2024

RISTRA: Recursive Image Super-Resolution Transformer With Relativistic Assessment.
IEEE Trans. Multim., 2024

Deep Learning Technology for Face Forgery Detection: A Survey.
CoRR, 2024

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning.
CoRR, 2024

MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs.
CoRR, 2024

Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model.
CoRR, 2024

Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer.
CoRR, 2024

Vision Transformer with Sparse Scan Prior.
CoRR, 2024

ViTAR: Vision Transformer with Any Resolution.
CoRR, 2024

DiffMAC: Diffusion Manifold Hallucination Correction for High Generalization Blind Face Restoration.
CoRR, 2024

FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs.
CoRR, 2024

InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding.
CoRR, 2024

Variational Capsules for Image Analysis and Synthesis.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Learning Fine-Grained and Semantically Aware Mamba Representations for Tampered Text Detection in Images.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

FKA-Owl: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

ZePo: Zero-Shot Portrait Stylization with Faster Sampling.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Parallel Augmentation and Dual Enhancement for Occluded Person Re-Identification.
Proceedings of the IEEE International Conference on Acoustics, 2024

Semantic-Aware Detail Enhancement for Blind Face Restoration.
Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

INSTASTYLE: Inversion Noise of a Stylized Image is Secretly a Style Adviser.
Proceedings of the Computer Vision - ECCV 2024, 2024

RMT: Retentive Networks Meet Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Uncertainty-Aware Source-Free Adaptive Image Super-Resolution with Wavelet Augmentation Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Multimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DeVAn: Dense Video Annotation for Video-Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Heterogeneous Test-Time Training for Multi-Modal Person Re-identification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Memory Uncertainty Learning for Real-World Single Image Deraining.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Diverse features discovery transformer for pedestrian attribute recognition.
Eng. Appl. Artif. Intell., March, 2023

Contextual Measures for Iris Recognition.
IEEE Trans. Inf. Forensics Secur., 2023

Portrait Diffusion: Training-free Face Stylization with Chain-of-Painting.
CoRR, 2023

Exploring Straighter Trajectories of Flow Matching with Diffusion Guidance.
CoRR, 2023

InstaStyle: Inversion Noise of a Stylized Image is Secretly a Style Adviser.
CoRR, 2023

Video-CSR: Complex Video Digest Creation for Visual-Language Models.
CoRR, 2023

Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling.
CoRR, 2023

Rethinking Local Perception in Lightweight Vision Transformer.
CoRR, 2023

SOSR: Source-Free Image Super-Resolution with Wavelet Augmentation Transformer.
CoRR, 2023

Learning-to-Rank Meets Language: Boosting Language-Driven Ordering Alignment for Ordinal Classification.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Lightweight Vision Transformer with Bidirectional Interaction.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Pluralistic Aging Diffusion Autoencoder.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023



Fine-Grained Face Sketch-Photo Synthesis with Text-Guided Diffusion Models.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

Uncertainty-Guided Test-Time Training for Face Forgery Detection.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

2022
Towards More Discriminative and Robust Iris Recognition by Learning Uncertain Factors.
IEEE Trans. Inf. Forensics Secur., 2022

Memory-Modulated Transformer Network for Heterogeneous Face Recognition.
IEEE Trans. Inf. Forensics Secur., 2022

Contrastive attention network with dense field estimation for face completion.
Pattern Recognit., 2022

DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Vision Transformer with Super Token Sampling.
CoRR, 2022

Prior-Guided Multi-scale Fusion Transformer for Face Attribute Recognition.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Style-Based Attentive Network for Real-World Face Hallucination.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Orthogonal Transformer: An Efficient Vision Transformer Backbone with Token Orthogonalization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Fine-Grained Cross-Modal Retrieval with Triple-Streamed Memory Fusion Transformer Encoder.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Video Forgery Detection Using Spatio-Temporal Dual Transformer.
Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition, 2022

Artistic Style Discovery with Independent Components.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022


Rethinking Image Cropping: Exploring Diverse Compositions from Global Views.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Confidence-Calibrated Face Image Forgery Detection with Contrastive Representation Distillation.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
LAMP-HQ: A Large-Scale Multi-pose High-Quality Database and Benchmark for NIR-VIS Face Recognition.
Int. J. Comput. Vis., 2021

Selective Wavelet Attention Learning for Single Image Deraining.
Int. J. Comput. Vis., 2021

Toward Accurate and Reliable Iris Segmentation Using Uncertainty Learning.
CoRR, 2021

Learning Causal Representation for Face Transfer across Large Appearance Gap.
CoRR, 2021

Universal Face Restoration With Memorized Modulation.
CoRR, 2021

LightCvT: Audio Forgery Detection via Fusion of Light CNN and Transformer.
Proceedings of the ICCPR '21: 10th International Conference on Computing and Pattern Recognition, Shanghai, China, October 15, 2021

Visual-Semantic Transformer for Face Forgery Detection.
Proceedings of the International IEEE Joint Conference on Biometrics, 2021

Memory Oriented Transfer Learning for Semi-Supervised Image Deraining.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Information Bottleneck Disentanglement for Identity Swapping.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Heterogeneous Facial Analysis and Synthesis
Springer Briefs in Computer Science, Springer, ISBN: 978-981-13-9147-7, 2020

BLAN: Bi-directional ladder attentive network for facial attribute prediction.
Pattern Recognit., 2020

A Survey of Deep Facial Attribute Analysis.
Int. J. Comput. Vis., 2020

Disentangled Representation Learning of Makeup Portraits in the Wild.
Int. J. Comput. Vis., 2020

Cosmetic-Aware Makeup Cleanser.
CoRR, 2020

Arbitrary Talking Face Generation via Attentional Audio-Visual Coherence Learning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Exemplar Guided Cross-Spectral Face Hallucination via Mutual Information Disentanglement.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Attentional Wavelet Network for Traditional Chinese Painting Transfer.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Free-Form Image Inpainting via Contrastive Attention Network.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Hierarchical Face Aging Through Disentangled Latent Characteristics.
Proceedings of the Computer Vision - ECCV 2020, 2020

Informative Sample Mining Network for Multi-domain Image-to-Image Translation.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Wavelet Domain Generative Adversarial Network for Multi-scale Face Hallucination.
Int. J. Comput. Vis., 2019

Style-based Variational Autoencoder for Real-World Super-Resolution.
CoRR, 2019

LAMP-HQ: A Large-Scale Multi-Pose High-Quality Database for NIR-VIS Face Recognition.
CoRR, 2019

Biphasic Learning of GANs for High-Resolution Image-to-Image Translation.
CoRR, 2019

UVA: A Universal Variational Framework for Continuous Age Analysis.
CoRR, 2019

Dual Variational Generation for Low Shot Heterogeneous Face Recognition.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Learning Disentangled Representation for Cross-Modal Retrieval with Deep Mutual Information Estimation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

A Novel Distance Learning for Elastic Cross-Modal Audio-Visual Matching.
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Adversarial Iris Super Resolution.
Proceedings of the 2019 International Conference on Biometrics, 2019

Semantic-Aware Makeup Cleanser.
Proceedings of the 10th IEEE International Conference on Biometrics Theory, 2019

Disentangled Variational Representation for Heterogeneous Face Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
A Survey to Deep Facial Attribute Analysis.
CoRR, 2018

High-Resolution Talking Face Generation via Mutual Information Approximation.
CoRR, 2018

Variational Capsules for Image Analysis and Synthesis.
CoRR, 2018

IntroVAE: Introspective Variational Autoencoders for Photographic Image Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017
Wavelet-SRNet: A Wavelet-Based CNN for Multi-scale Face Super Resolution.
Proceedings of the IEEE International Conference on Computer Vision, 2017


  Loading...