Yan Huang

Orcid: 0000-0002-8239-7229

Affiliations:
  • Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China


According to our database1, Yan Huang authored at least 123 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
An Overview of Text-Based Person Search: Recent Advances and Future Directions.
IEEE Trans. Circuits Syst. Video Technol., September, 2024

Text-to-Image Vehicle Re-Identification: Multi-Scale Multi-View Cross-Modal Alignment Network and a Unified Benchmark.
IEEE Trans. Intell. Transp. Syst., July, 2024

Customized meta-dataset for automatic classifier accuracy evaluation.
Pattern Recognit., February, 2024

Comprehensive Attribute Prediction Learning for Person Search by Language.
IEEE Trans. Image Process., 2024

Meta Clothing Status Calibration for Long-Term Person Re-Identification.
IEEE Trans. Image Process., 2024

Enhancing Person Re-Identification Performance Through In Vivo Learning.
IEEE Trans. Image Process., 2024

Self-Supervised Recovery and Guide for Low-Resolution Person Re-Identification.
IEEE Trans. Inf. Forensics Secur., 2024

Memory-Adaptive Vision-and-Language Navigation.
Pattern Recognit., 2024

GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal Conditioned Policy.
CoRR, 2024

SliceMamba for Medical Image Segmentation.
CoRR, 2024

Free Lunch for Gait Recognition: A Novel Relation Descriptor.
Proceedings of the Computer Vision - ECCV 2024, 2024

Investigating Compositional Challenges in Vision-Language Models for Visual Grounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Attribute-Guided Pedestrian Retrieval: Bridging Person Re-ID with Internal Attribute Variability.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
End-to-End Alternating Optimization for Real-World Blind Super Resolution.
Int. J. Comput. Vis., December, 2023

Efficient Image and Sentence Matching.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Microstructure Turbulence Measurement in the Northern South China Sea from a Long-Range Hybrid AUV.
Sensors, February, 2023

A Reconstruction-Based Visual-Acoustic-Semantic Embedding Method for Speech-Image Retrieval.
IEEE Trans. Multim., 2023

Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training.
IEEE Trans. Image Process., 2023

SiamON: Siamese Occlusion-Aware Network for Visual Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2023

Cyclic Differentiable Architecture Search.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

AI in Human-computer Gaming: Techniques, Challenges and Opportunities.
Int. J. Autom. Comput., 2023

VI-Diff: Unpaired Visible-Infrared Translation Diffusion Model for Single Modality Labeled Visible-Infrared Person Re-identification.
CoRR, 2023

Illumination Distillation Framework for Nighttime Person Re-Identification and A New Benchmark.
CoRR, 2023

Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation.
CoRR, 2023

Free Lunch for Gait Recognition: A Novel Relation Descriptor.
CoRR, 2023

ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments.
CoRR, 2023

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation.
CoRR, 2023

Frequency-Enhanced Data Augmentation for Vision-and-Language Navigation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Causal Intervention for Sparse-View Gait Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Towards Few-shot Image Captioning with Cycle-based Compositional Semantic Enhancement Framework.
Proceedings of the International Joint Conference on Neural Networks, 2023

2022
Actor and Action Modular Network for Text-Based Video Segmentation.
IEEE Trans. Image Process., 2022

Joint Token and Feature Alignment Framework for Text-Based Person Search.
IEEE Signal Process. Lett., 2022

Design and Motion Performance Analysis of Turbulent AUV Measuring Platform.
Sensors, 2022

Distilled light GaitSet: Towards scalable gait recognition.
Pattern Recognit. Lett., 2022

Few-Shot Image and Sentence Matching via Aligned Cross-Modal Memory.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Two-Branch Relational Prototypical Network for Weakly Supervised Temporal Action Localization.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Learning a Robust Part-Aware Monocular 3D Human Pose Estimator via Neural Architecture Search.
Int. J. Comput. Vis., 2022

BEVBert: Topo-Metric Map Pre-training for Language-guided Navigation.
CoRR, 2022

CNTN: Cyclic Noise-tolerant Network for Gait Recognition.
CoRR, 2022

1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022).
CoRR, 2022

Learning the Degradation Distribution for Blind Image Super-Resolution.
CoRR, 2022

MACK: Multimodal Aligned Conceptual Knowledge for Unpaired Image-text Matching.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Part Based Interaction Learning for Group Activity Recognition.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

Generalized Inter-class Loss for Gait Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Cross-modal Co-occurrence Attributes Alignments for Person Search by Language.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Regularized Graph Structure Learning with Semantic Knowledge for Multi-variates Time-Series Forecasting.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Dynamic Collaboration Convolution for Robust RGBT Tracking.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

The Tenth Visual Object Tracking VOT2022 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

3D Shape Temporal Aggregation for Video-Based Clothing-Change Person Re-identification.
Proceedings of the Computer Vision - ACCV 2022, 2022

Generalizable Person Re-identification via Self-Supervised Batch Norm Test-Time Adaption.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Modeling Sub-Actions for Weakly Supervised Temporal Action Localization.
IEEE Trans. Image Process., 2021

Attribute Prototype Learning for Interactive Face Retrieval.
IEEE Trans. Inf. Forensics Secur., 2021

End-to-end video text detection with online tracking.
Pattern Recognit., 2021

Adaptive super-resolution for person re-identification with low-resolution images.
Pattern Recognit., 2021

Mask-guided contrastive attention and two-stream metric co-learning for person Re-identification.
Neurocomputing, 2021

PokerNet: Expanding Features Cheaply via Depthwise Convolutions.
Int. J. Autom. Comput., 2021

Adaptive Dilated Convolution For Human Pose Estimation.
CoRR, 2021

End-to-end Alternating Optimization for Blind Super Resolution.
CoRR, 2021

FDAN: Flow-guided Deformable Alignment Network for Video Super-Resolution.
CoRR, 2021

Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Neighbor-view Enhanced Model for Vision and Language Navigation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Efficient Human Pose Estimation by Learning Deeply Aggregated Representations.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

CMF: Cascaded Multi-Model Fusion For Referring Image Segmentation.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Rethinking the Heatmap Regression for Bottom-Up Human Pose Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Improving Description-Based Person Re-Identification by Multi-Granularity Image-Text Alignments.
IEEE Trans. Image Process., 2020

Long video question answering: A Matching-guided Attention Model.
Pattern Recognit., 2020

Re-ranking image-text matching by adaptive metric fusion.
Pattern Recognit., 2020

Image and Sentence Matching via Semantic Concepts and Order Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Frame-GAN: Increasing the frame rate of gait videos with generative adversarial networks.
Neurocomputing, 2020

Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation.
CoRR, 2020

Actor and Action Modular Network for Text-based Video Segmentation.
CoRR, 2020

Recurrent Deconvolutional Generative Adversarial Networks with Application to Text Guided Video Generation.
CoRR, 2020

Global Context Enhanced Multi-modal Fusion for Referring Image Segmentation.
Proceedings of the Pattern Recognition and Computer Vision, Third Chinese Conference, 2020

Unfolding the Alternating Optimization for Blind Super Resolution.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Textual Dependency Embedding for Person Search by Language.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

VSR++: Improving Visual Semantic Reasoning for Fine-Grained Image-Text Matching.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Efficient Super Resolution by Recursive Aggregation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

On the Robustness of 3D Human Pose Estimation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Prediction and Recovery for Adaptive Low-Resolution Person Re-Identification.
Proceedings of the Computer Vision - ECCV 2020, 2020

Towards Part-Aware Monocular 3D Human Pose Estimation: An Architecture Search Approach.
Proceedings of the Computer Vision - ECCV 2020, 2020

Relational Prototypical Network for Weakly Supervised Temporal Action Localization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Part-Level Graph Convolutional Network for Skeleton-Based Action Recognition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
GaitNet: An end-to-end network for gait based human identification.
Pattern Recognit., 2019

Part-aligned pose-guided recurrent network for action recognition.
Pattern Recognit., 2019

Learning view invariant gait features with Two-Stream GAN.
Neurocomputing, 2019

A hierarchical contextual attention-based network for sequential recommendation.
Neurocomputing, 2019

Learning Compact Target-Oriented Feature Representations for Visual Tracking.
CoRR, 2019

Recurrent Deconvolutional Generative Adversarial Networks with Application to Video Generation.
Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019

Relational Network for Skeleton-Based Action Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Augmented Visual-Semantic Embeddings for Image and Sentence Matching.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Hierarchical Graph Convolutional Network for Skeleton-Based Action Recognition.
Proceedings of the Image and Graphics - 10th International Conference, 2019

Fusing Two Directions in Cross-Domain Adaption for Real Life Person Search by Language.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

The Seventh Visual Object Tracking VOT2019 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Language-Driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Box-Driven Class-Wise Region Masking and Filling Rate Guided Loss for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Local Relationship Learning With Person-Specific Shape Regularization for Facial Action Unit Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning Depth-aware Heatmaps for 3D Human Pose Estimation in the Wild.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Few-Shot Image and Sentence Matching via Gated Visual-Semantic Embedding.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Video Super-Resolution via Bidirectional Recurrent Convolutional Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Skeleton-Based Relational Modeling for Action Recognition.
CoRR, 2018

Hierarchical Memory Modelling for Video Captioning.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Deep Temporal Feature Encoding for Action Recognition.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Towards Unconstrained Pointing Problem of Visual Question Answering: A Retrieval-based Method.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Automatic Engagement Prediction with GAP Feature.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T Tracking.
Proceedings of the Computer Vision - ECCV 2018, 2018

M3: Multimodal Memory Modelling for Video Captioning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Mask-Guided Contrastive Attention Model for Person Re-Identification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Semantic Concepts and Order for Image and Sentence Matching.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Conditional High-Order Boltzmann Machines for Supervised Relation Learning.
IEEE Trans. Image Process., 2017

Learning Semantic Concepts and Order for Image and Sentence Matching.
CoRR, 2017

See the Forest for the Trees: Joint Spatial and Temporal Recurrent Neural Networks for Video-Based Person Re-identification.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Instance-Aware Image and Sentence Matching with Selective Multimodal LSTM.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Learning Shared and Specific Factors for Multi-modal Data.
Proceedings of the Computer Vision - Second CCF Chinese Conference, 2017

2016
Multimodal Memory Modelling for Video Captioning.
CoRR, 2016

2015
Unconstrained Multimodal Multi-Label Learning.
IEEE Trans. Multim., 2015

Bidirectional Recurrent Convolutional Networks for Multi-Frame Super-Resolution.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Conditional High-Order Boltzmann Machine: A Supervised Learning Model for Relation Learning.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014
A General Nonlinear Embedding Framework Based on Deep Neural Network.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Deep Embedding Network for Clustering.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Generalized Autoencoder: A Neural Network Framework for Dimensionality Reduction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Multi-task deep neural network for multi-label learning.
Proceedings of the IEEE International Conference on Image Processing, 2013

2012
An effective regional saliency model based on extended site entropy rate.
Proceedings of the 21st International Conference on Pattern Recognition, 2012


  Loading...