Yan Huang

Yuming Wang

IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Microstructure Turbulence Measurement in the Northern South China Sea from a Long-Range Hybrid AUV.

[BibT_eX]

[DOI]

Sensors, February, 2023

A Reconstruction-Based Visual-Acoustic-Semantic Embedding Method for Speech-Image Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

SiamON: Siamese Occlusion-Aware Network for Visual Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2023

Cyclic Differentiable Architecture Search.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2023

AI in Human-computer Gaming: Techniques, Challenges and Opportunities.

[BibT_eX]

[DOI]

Int. J. Autom. Comput., 2023

VI-Diff: Unpaired Visible-Infrared Translation Diffusion Model for Single Modality Labeled Visible-Infrared Person Re-identification.

[BibT_eX]

[DOI]

Han Huang

CoRR, 2023

Illumination Distillation Framework for Nighttime Person Re-Identification and A New Benchmark.

[BibT_eX]

[DOI]

CoRR, 2023

Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation.

[BibT_eX]

[DOI]

CoRR, 2023

Free Lunch for Gait Recognition: A Novel Relation Descriptor.

[BibT_eX]

[DOI]

CoRR, 2023

ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments.

[BibT_eX]

[DOI]

CoRR, 2023

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Frequency-Enhanced Data Augmentation for Vision-and-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Causal Intervention for Sparse-View Gait Recognition.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Towards Few-shot Image Captioning with Cycle-based Compositional Semantic Enhancement Framework.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2023

2022

Actor and Action Modular Network for Text-Based Video Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Joint Token and Feature Alignment Framework for Text-Based Person Search.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2022

Design and Motion Performance Analysis of Turbulent AUV Measuring Platform.

[BibT_eX]

[DOI]

Sensors, 2022

Distilled light GaitSet: Towards scalable gait recognition.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2022

Few-Shot Image and Sentence Matching via Aligned Cross-Modal Memory.

[BibT_eX]

[DOI]

Jingdong Wang

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Two-Branch Relational Prototypical Network for Weakly Supervised Temporal Action Localization.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Learning a Robust Part-Aware Monocular 3D Human Pose Estimator via Neural Architecture Search.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2022

BEVBert: Topo-Metric Map Pre-training for Language-guided Navigation.

[BibT_eX]

[DOI]

CoRR, 2022

CNTN: Cyclic Noise-tolerant Network for Gait Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022).

[BibT_eX]

[DOI]

CoRR, 2022

Learning the Degradation Distribution for Blind Image Super-Resolution.

[BibT_eX]

[DOI]

CoRR, 2022

MACK: Multimodal Aligned Conceptual Knowledge for Unpaired Image-text Matching.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Part Based Interaction Learning for Group Activity Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

Generalized Inter-class Loss for Gait Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Cross-modal Co-occurrence Attributes Alignments for Person Search by Language.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Regularized Graph Structure Learning with Semantic Knowledge for Multi-variates Time-Series Forecasting.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Dynamic Collaboration Convolution for Robust RGBT Tracking.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on Pattern Recognition, 2022

The Tenth Visual Object Tracking VOT2022 Challenge Results.

[BibT_eX]

[DOI]

Joni-Kristian Kämäräinen

Alireza Memarmoghadam

Christian Micheloni

Payman Moallem

Le Thanh Nguyen-Meidine

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

3D Shape Temporal Aggregation for Video-Based Clothing-Change Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2022, 2022

Generalizable Person Re-identification via Self-Supervised Batch Norm Test-Time Adaption.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Modeling Sub-Actions for Weakly Supervised Temporal Action Localization.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Attribute Prototype Learning for Interactive Face Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2021

End-to-end video text detection with online tracking.

[BibT_eX]

[DOI]

Pattern Recognit., 2021

Adaptive super-resolution for person re-identification with low-resolution images.

[BibT_eX]

[DOI]

Pattern Recognit., 2021

Mask-guided contrastive attention and two-stream metric co-learning for person Re-identification.

[BibT_eX]

[DOI]

Neurocomputing, 2021

PokerNet: Expanding Features Cheaply via Depthwise Convolutions.

[BibT_eX]

[DOI]

Wei Tang

Int. J. Autom. Comput., 2021

Adaptive Dilated Convolution For Human Pose Estimation.

[BibT_eX]

[DOI]

CoRR, 2021

End-to-end Alternating Optimization for Blind Super Resolution.

[BibT_eX]

[DOI]

CoRR, 2021

FDAN: Flow-guided Deformable Alignment Network for Video Super-Resolution.

[BibT_eX]

[DOI]

Jiayi Lin

CoRR, 2021

Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Neighbor-view Enhanced Model for Vision and Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Efficient Human Pose Estimation by Learning Deeply Aggregated Representations.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

CMF: Cascaded Multi-Model Fusion For Referring Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Rethinking the Heatmap Regression for Bottom-Up Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Improving Description-Based Person Re-Identification by Multi-Granularity Image-Text Alignments.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Long video question answering: A Matching-guided Attention Model.

[BibT_eX]

[DOI]

Weining Wang

Pattern Recognit., 2020

Re-ranking image-text matching by adaptive metric fusion.

[BibT_eX]

[DOI]

Kai Niu

Pattern Recognit., 2020

Image and Sentence Matching via Semantic Concepts and Order Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Frame-GAN: Increasing the frame rate of gait videos with generative adversarial networks.

[BibT_eX]

[DOI]

Neurocomputing, 2020

Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation.

[BibT_eX]

[DOI]

CoRR, 2020

Actor and Action Modular Network for Text-based Video Segmentation.

[BibT_eX]

[DOI]

CoRR, 2020

Recurrent Deconvolutional Generative Adversarial Networks with Application to Text Guided Video Generation.

[BibT_eX]

[DOI]

CoRR, 2020

Global Context Enhanced Multi-modal Fusion for Referring Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision, Third Chinese Conference, 2020

Unfolding the Alternating Optimization for Blind Super Resolution.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Textual Dependency Embedding for Person Search by Language.

[BibT_eX]

[DOI]

Kai Niu

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

VSR++: Improving Visual Semantic Reasoning for Fine-Grained Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

Efficient Super Resolution by Recursive Aggregation.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

On the Robustness of 3D Human Pose Estimation.

[BibT_eX]

[DOI]

Zerui Chen

Proceedings of the 25th International Conference on Pattern Recognition, 2020

Prediction and Recovery for Adaptive Low-Resolution Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Towards Part-Aware Monocular 3D Human Pose Estimation: An Architecture Search Approach.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Relational Prototypical Network for Weakly Supervised Temporal Action Localization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Part-Level Graph Convolutional Network for Skeleton-Based Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

GaitNet: An end-to-end network for gait based human identification.

[BibT_eX]

[DOI]

Pattern Recognit., 2019

Part-aligned pose-guided recurrent network for action recognition.

[BibT_eX]

[DOI]

Pattern Recognit., 2019

Learning view invariant gait features with Two-Stream GAN.

[BibT_eX]

[DOI]

Neurocomputing, 2019

A hierarchical contextual attention-based network for sequential recommendation.

[BibT_eX]

[DOI]

Neurocomputing, 2019

Learning Compact Target-Oriented Feature Representations for Visual Tracking.

[BibT_eX]

[DOI]

CoRR, 2019

Recurrent Deconvolutional Generative Adversarial Networks with Application to Video Generation.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019

Relational Network for Skeleton-Based Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Augmented Visual-Semantic Embeddings for Image and Sentence Matching.

[BibT_eX]

[DOI]

Zerui Chen

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Hierarchical Graph Convolutional Network for Skeleton-Based Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Image and Graphics - 10th International Conference, 2019

Fusing Two Directions in Cross-Domain Adaption for Real Life Person Search by Language.

[BibT_eX]

[DOI]

Kai Niu

Rama Krishna Sai Subrahmanyam Gorthi

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

The Seventh Visual Object Tracking VOT2019 Challenge Results.

[BibT_eX]

[DOI]

Abdelrahman Eldesokey

Alireza Memarmoghadam

Ardhendu Shekhar Tripathi

Arnold W. M. Smeulders

Joni-Kristian Kämäräinen

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Language-Driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model.

[BibT_eX]

[DOI]

Weining Wang

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Box-Driven Class-Wise Region Masking and Filling Rate Guided Loss for Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Local Relationship Learning With Person-Specific Shape Regularization for Facial Action Unit Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning Depth-aware Heatmaps for 3D Human Pose Estimation in the Wild.

[BibT_eX]

[DOI]

Proceedings of the 30th British Machine Vision Conference 2019, 2019

Few-Shot Image and Sentence Matching via Gated Visual-Semantic Embedding.

[BibT_eX]

[DOI]

Yang Long

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Video Super-Resolution via Bidirectional Recurrent Convolutional Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

Skeleton-Based Relational Modeling for Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2018

Hierarchical Memory Modelling for Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Deep Temporal Feature Encoding for Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Pattern Recognition, 2018

Towards Unconstrained Pointing Problem of Visual Question Answering: A Retrieval-based Method.

[BibT_eX]

[DOI]

Wenlong Cheng

Proceedings of the 24th International Conference on Pattern Recognition, 2018

Automatic Engagement Prediction with GAP Feature.

[BibT_eX]

[DOI]

Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T Tracking.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

M3: Multimodal Memory Modelling for Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Mask-Guided Contrastive Attention Model for Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Semantic Concepts and Order for Image and Sentence Matching.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Conditional High-Order Boltzmann Machines for Supervised Relation Learning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

Learning Semantic Concepts and Order for Image and Sentence Matching.

[BibT_eX]

[DOI]

Qi Wu

CoRR, 2017

See the Forest for the Trees: Joint Spatial and Temporal Recurrent Neural Networks for Video-Based Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Instance-Aware Image and Sentence Matching with Selective Multimodal LSTM.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Learning Shared and Specific Factors for Multi-modal Data.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - Second CCF Chinese Conference, 2017

2016

Multimodal Memory Modelling for Video Captioning.

[BibT_eX]

[DOI]

CoRR, 2016

2015

Unconstrained Multimodal Multi-Label Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

Bidirectional Recurrent Convolutional Networks for Multi-Frame Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Conditional High-Order Boltzmann Machine: A Supervised Learning Model for Relation Learning.

[BibT_eX]

[DOI]