2025
STPM: Spatial-Temporal Token Pruning and Merging for Complex Activity Recognition.
IEEE Trans. Circuits Syst. Video Technol., June, 2025
Toward Physically Stable Motion Generation: A New Paradigm of Human Pose Representation.
IEEE Trans. Circuits Syst. Video Technol., May, 2025
MUST: Multi-Scale Structural-Temporal Link Prediction Model for UAV Ad Hoc Networks.
CoRR, May, 2025
Leveraging Frame- and Feature-level Progressive Augmentation for Semi-supervised Action Recognition.
ACM Trans. Multim. Comput. Commun. Appl., April, 2025
Coarse-Fine Nested Network for Weakly Supervised Group Activity Recognition.
IEEE Trans. Neural Networks Learn. Syst., April, 2025
Appearance-Agnostic Representation Learning for Compositional Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., April, 2025
Hierarchical Relation-augmented Representation Generalization for Few-shot Action Recognition.
CoRR, April, 2025
Global-Local Multiple Granularity Learning for Cross-Modality Visible-Infrared Person Reidentification.
IEEE Trans. Neural Networks Learn. Syst., March, 2025
Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection.
CoRR, March, 2025
Learning Clustering-based Prototypes for Compositional Zero-shot Learning.
CoRR, February, 2025
Hierarchical Motion-Enhanced Matching Framework for Few-Shot Action Recognition.
IEEE Trans. Multim., 2025
GPT4Ego: Unleashing the Potential of Pre-Trained Models for Zero-Shot Egocentric Action Recognition.
IEEE Trans. Multim., 2025
MoCoLSK: Modality-Conditioned High-Resolution Downscaling for Land Surface Temperature.
IEEE Trans. Geosci. Remote. Sens., 2025
Revisiting Few-Shot Compositional Action Recognition With Knowledge Calibration.
IEEE Signal Process. Lett., 2025
Kernel-Aware Graph Prompt Learning for Few-Shot Anomaly Detection.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
3D-aware Select, Expand, and Squeeze Token for Aerial Action Recognition.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Motion-Aware Mask Feature Reconstruction for Skeleton-Based Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., November, 2024
Spatiotemporal Decouple-and-Squeeze Contrastive Learning for Semisupervised Skeleton-Based Action Recognition.
IEEE Trans. Neural Networks Learn. Syst., August, 2024
Dilation-erosion for single-frame supervised temporal action localization.
Multim. Tools Appl., January, 2024
Semantic-Disentangled Transformer With Noun-Verb Embedding for Compositional Action Recognition.
IEEE Trans. Image Process., 2024
EventCrab: Harnessing Frame and Point Synergy for Event-based Action Recognition and Beyond.
CoRR, 2024
FTMoMamba: Motion Generation with Frequency and Text State Space Models.
CoRR, 2024
UnitedVLN: Generalizable Gaussian Splatting for Continuous Vision-Language Navigation.
CoRR, 2024
FedMLLM: Federated Fine-tuning MLLM on Multimodal Heterogeneity Data.
CoRR, 2024
GrokLST: Towards High-Resolution Benchmark and Toolkit for Land Surface Temperature Downscaling.
CoRR, 2024
HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy Scenes.
CoRR, 2024
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization.
CoRR, 2024
The SkatingVerse Workshop & Challenge: Methods and Results.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition.
CoRR, 2024
GPT4Ego: Unleashing the Potential of Pre-trained Models for Zero-Shot Egocentric Action Recognition.
CoRR, 2024
Rethinking attribute localization for zero-shot learning.
Sci. China Inf. Sci., 2024
DoFIT: Domain-aware Federated Instruction Tuning with Alleviated Catastrophic Forgetting.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
AdaFPP: Adapt-Focused Bi-Propagating Prototype Learning for Panoramic Activity Recognition.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
DTS-TPT: Dual Temporal-Sync Test-time Prompt Tuning for Zero-shot Activity Recognition.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Leveraging Multimodal Knowledge for Spatio-Temporal Action Localization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
2023
Com-STAL: Compositional Spatio-Temporal Action Localization.
IEEE Trans. Circuits Syst. Video Technol., December, 2023
Supervised Learning Strategy for Spiking Neurons Based on Their Segmental Running Characteristics.
Neural Process. Lett., December, 2023
A simple yet effective image stitching with computational suture zone.
Vis. Comput., October, 2023
Progressive Instance-Aware Feature Learning for Compositional Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023
HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023
Multi-Granularity Anchor-Contrastive Representation Learning for Semi-Supervised Skeleton-Based Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023
BCMask: a finer leaf instance segmentation with bilayer convolution mask.
Multim. Syst., June, 2023
Attack is Good Augmentation: Towards Skeleton-Contrastive Representation Learning.
CoRR, 2023
Pyramid Self-attention Polymerization Learning for Semi-supervised Skeleton-based Action Recognition.
CoRR, 2023
Spatiotemporal Decouple-and-Squeeze Contrastive Learning for Semi-Supervised Skeleton-based Action Recognition.
CoRR, 2023
Pedestrian-specific Bipartite-aware Similarity Learning for Text-based Person Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Slowfast Diversity-aware Prototype Learning for Egocentric Action Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Foreground/Background-Masked Interaction Learning for Spatio-temporal Action Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
MUP: Multi-granularity Unified Perception for Panoramic Activity Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
UATVR: Uncertainty-Adaptive Text-Video Retrieval.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
2022
Position-Aware Participation-Contributed Temporal Dynamic Model for Group Activity Recognition.
IEEE Trans. Neural Networks Learn. Syst., 2022
X-Invariant Contrastive Augmentation and Representation Learning for Semi-Supervised Skeleton-Based Action Recognition.
IEEE Trans. Image Process., 2022
Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2022
Coherence Constrained Graph LSTM for Group Activity Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Spatiotemporal Co-Attention Recurrent Neural Networks for Human-Skeleton Motion Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Wavelet-Attention CNN for image classification.
Multim. Syst., 2022
Skip-attention encoder-decoder framework for human motion prediction.
Multim. Syst., 2022
Progressive enhancement network with pseudo labels for weakly supervised temporal action localization.
J. Vis. Commun. Image Represent., 2022
Spatiotemporal Perturbation Based Dynamic Consistency for Semi-supervised Temporal Action Detection.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022
Look Less Think More: Rethinking Compositional Action Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
PNP: Robust Learning from Noisy Labels by Probabilistic Noise Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
Host-Parasite: Graph LSTM-in-LSTM for Group Activity Recognition.
IEEE Trans. Neural Networks Learn. Syst., 2021
Hierarchical Long Short-Term Concurrent Memory for Human Interaction Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2021
Attention-aware conditional generative adversarial networks for facial age synthesis.
Neurocomputing, 2021
2020
Bi-Modal Progressive Mask Attention for Fine-Grained Recognition.
IEEE Trans. Image Process., 2020
Facial Age Synthesis With Label Distribution-Guided Generative Adversarial Network.
IEEE Trans. Inf. Forensics Secur., 2020
Deep supervised feature selection for social relationship recognition.
Pattern Recognit. Lett., 2020
CAN-GAN: Conditioned-attention normalized GAN for face age synthesis.
Pattern Recognit. Lett., 2020
Deep multi-person kinship matching and recognition for family photos.
Pattern Recognit., 2020
Interactive Fusion of Multi-level Features for Compositional Activity Recognition.
CoRR, 2020
Data-driven Meta-set Based Fine-Grained Visual Classification.
CoRR, 2020
A Novel CNN Architecture for Real-Time Point Cloud Recognition in Road Environment.
Proceedings of the Pattern Recognition and Computer Vision, Third Chinese Conference, 2020
Cross Fusion for Egocentric Interactive Action Recognition.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020
Storyboard relational model for group activity recognition.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020
Data-driven Meta-set Based Fine-Grained Visual Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Social Adaptive Module for Weakly-Supervised Group Activity Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020
Web-Supervised Network with Softly Update-Drop Training for Fine-Grained Visual Classification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Deep Ordinal Hashing With Spatial Attention.
IEEE Trans. Image Process., 2019
Social Anchor-Unit Graph Regularized Tensor Completion for Large-Scale Image Retagging.
IEEE Trans. Pattern Anal. Mach. Intell., 2019
Image annotation refinement via 2P-KNN based group sparse reconstruction.
Multim. Tools Appl., 2019
Region-Manipulated Fusion Networks for Pancreatitis Recognition.
CoRR, 2019
Temporal Action Localization Based on Temporal Evolution Model and Multiple Instance Learning.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019
Semantic BI-Embedded GRU for Fill-in-the-Blank Image Question Answering.
Proceedings of the 2nd International Conference on Computer Science and Software Engineering, 2019
2018
Image Classification With Tailored Fine-Grained Dictionaries.
IEEE Trans. Circuits Syst. Video Technol., 2018
Personalized Age Progression with Bi-Level Aging Dictionary Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2018
A Feature Selection Method for Projection Twin Support Vector Machine.
Neural Process. Lett., 2018
Two Dimensional Slow Feature Discriminant Analysis via L 2, 1 Norm Minimization for Feature Extraction.
KSII Trans. Internet Inf. Syst., 2018
Global and Local C3D Ensemble System for First Person Interactive Action Recognition.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018
Participation-Contributed Temporal Dynamic Model for Group Activity Recognition.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
DotaNet: Two-Stream Match-Recurrent Neural Networks for Predicting Social Game Result.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018
2017
Tri-Clustered Tensor Completion for Social-Aware Image Tag Refinement.
IEEE Trans. Pattern Anal. Mach. Intell., 2017
Computational face reader based on facial attribute estimation.
Neurocomputing, 2017
Recovering Overlapping Partials for Monaural Perfect Harmonic Musical Sound Separation Using Modified Common Amplitude Modulation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017
Mini Neural Networks for Effective and Efficient Mobile Album Organization.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017
Face Aging with Contextual Generative Adversarial Nets.
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Multi-part boosting LSTMS for skeleton based human activity analysis.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017
Image Retrieval Based on Optimized Visual Dictionary and Adaptive Soft Assignment.
Proceedings of the Internet Multimedia Computing and Service, 2017
Concurrence-Aware Long Short-Term Sub-Memories for Person-Person Action Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017
2016
Generalized Deep Transfer Networks for Knowledge Propagation in Heterogeneous Domains.
ACM Trans. Multim. Comput. Commun. Appl., 2016
Instance-Aware Hashing for Multi-Label Image Retrieval.
IEEE Trans. Image Process., 2016
Kinship-Guided Age Progression.
Pattern Recognit., 2016
Age progression: Current technologies and applications.
Neurocomputing, 2016
Computational Face Reader.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
2015
Deep kinship verification.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015
What Shall I Look Like after N Years?
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Weakly-Shared Deep Transfer Networks for Heterogeneous-Domain Knowledge Propagation.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Deep Face Beautification.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Partially Common-Semantic Pursuit for RGB-D Object Recognition.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Task-Driven Feature Pooling for Image Classification.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015
Personalized Age Progression with Aging Dictionary.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015