2025
Text-guided dynamic mouth motion capturing for person-generic talking face generation.
Knowl. Based Syst., 2025

CDTR: Semantic Alignment for Video Moment Retrieval Using Concept Decomposition Transformer.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Set of Diverse Queries With Uncertainty Regularization for Composed Image Retrieval.
IEEE Trans. Circuits Syst. Video Technol., October, 2024

Instance-Dictionary Learning for Open-World Object Detection in Autonomous Driving Scenarios.
IEEE Trans. Circuits Syst. Video Technol., May, 2024

Align and Retrieve: Composition and Decomposition Learning in Image Retrieval With Text Feedback.
IEEE Trans. Multim., 2024

Runge-Kutta Guided Feature Augmentation for Few-Sample Learning.
IEEE Trans. Multim., 2024

Towards Robust Person Re-Identification by Adversarial Training With Dynamic Attack Strategy.
IEEE Trans. Multim., 2024

Adaptive Multi-scale Degradation-Based Attack for Boosting the Adversarial Transferability.
IEEE Trans. Multim., 2024

Improving Pre-Trained Model-Based Speech Emotion Recognition From a Low-Level Speech Feature Perspective.
IEEE Trans. Multim., 2024

Boosting Adversarial Training with Hardness-Guided Attack Strategy.
IEEE Trans. Multim., 2024

Semantics Disentangling for Cross-Modal Retrieval.
IEEE Trans. Image Process., 2024

LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning.
CoRR, 2024

SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values.
CoRR, 2024

DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion.
CoRR, 2024

Cascaded Adversarial Attack: Simultaneously Fooling Rain Removal and Semantic Segmentation Networks.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

CDPNet: Cross-Modal Dual Phases Network for Point Cloud Completion.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Less is Better: Exponential Loss for Cross-Modal Matching.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Multi-Modal Transformer With Global-Local Alignment for Composed Query Image Retrieval.
IEEE Trans. Multim., 2023

Quaternion Representation Learning for cross-modal matching.
Knowl. Based Syst., 2023

Instance-Variant Loss with Gaussian RBF Kernel for 3D Cross-modal Retriveal.
CoRR, 2023

Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement.
CoRR, 2023

Cross-modal Consistency Learning with Fine-grained Fusion Network for Multimodal Fake News Detection.
Proceedings of the ACM Multimedia Asia 2023, 2023

Open-Scenario Domain Adaptive Object Detection in Autonomous Driving.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Universal Weighting Metric Learning for Cross-Modal Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Semantic guided knowledge graph for large-scale zero-shot learning.
J. Vis. Commun. Image Represent., 2022

Semantic Enhanced Knowledge Graph for Large-Scale Zero-Shot Learning.
CoRR, 2022

2021
Semantic Enhanced Cross-modal GAN for Zero-shot Learning.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Meta Self-Paced Learning for Cross-Modal Matching.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020
Scene graph generation via multi-relation classification and cross-modal attention coordinator.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Graph-based variational auto-encoder for generalized zero-shot learning.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Universal Weighting Metric Learning for Cross-Modal Matching.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Residual Graph Convolutional Networks for Zero-Shot Learning.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

2005
On Intelligent Agent-based Decoy Systems.
Proceedings of The 2005 International Conference on Security and Management, 2005