2025
Reliability-Guided Hierarchical Memory Network for Scribble-Supervised Video Object Segmentation.
IEEE Trans. Neural Networks Learn. Syst., April, 2025

Learning Compatible Multi-Prize Subnetworks for Asymmetric Retrieval.
CoRR, April, 2025

Prototype Perturbation for Relaxing Alignment Constraints in Backward-Compatible Learning.
CoRR, March, 2025

ZeroPose: CAD-Prompted Zero-Shot Object 6D Pose Estimation in Cluttered Scenes.
IEEE Trans. Circuits Syst. Video Technol., February, 2025

ZeroBP: Learning Position-Aware Correspondence for Zero-shot 6D Pose Estimation in Bin-Picking.
CoRR, February, 2025

Improving Federated Domain Generalization Through Dynamical Weights Calculated from Data Influences on Global Model Update.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal Model.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Robust Tracking via Fully Exploring Background Prior Knowledge.
IEEE Trans. Circuits Syst. Video Technol., May, 2024

Cross-Modality Proposal-Guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection.
IEEE Trans. Multim., 2024

MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking.
CoRR, 2024

Driving Referring Video Object Segmentation with Vision-Language Pre-trained Models.
CoRR, 2024

Motion-aware Latent Diffusion Models for Video Frame Interpolation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Simplifying Cross-modal Interaction via Modality-Shared Features for RGBT Tracking.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Interaction-based Retrieval-augmented Diffusion Models for Protein-specific 3D Molecule Generation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

RTracker: Recoverable Tracking via PN Tree Structured Memory.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Bilateral Event Mining and Complementary for Event Stream Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Robust 3D Tracking with Quality-Aware Shape Completion.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Siamese residual network for efficient visual tracking.
Inf. Sci., May, 2023

Channel and Spatial Relation-Propagation Network for RGB-Thermal Semantic Segmentation.
CoRR, 2023

Reliability-Hierarchical Memory Network for Scribble-Supervised Video Object Segmentation.
CoRR, 2023

Joint Visual Grounding and Tracking with Natural Language Specification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
SiamCorners: Siamese Corner Networks for Visual Tracking.
IEEE Trans. Multim., 2022

Object Tracking via Spatial-Temporal Memory Network.
IEEE Trans. Circuits Syst. Video Technol., 2022

Target-Aware State Estimation for Visual Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2022

Noise-Suppressing Deep Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2022

Global Tracking via Ensemble of Local Trackers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Adaptive ensemble perception tracking.
Neural Networks, 2021

Learning dual-margin model for visual tracking.
Neural Networks, 2021

Interactive convolutional learning for visual tracking.
Knowl. Based Syst., 2021

Crop-Transform-Paste: Self-Supervised Learning for Visual Tracking.
CoRR, 2021

Saliency-Associated Object Tracking.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Dual-regression model for visual tracking.
Neural Networks, 2020

SiamAtt: Siamese attention network for visual tracking.
Knowl. Based Syst., 2020

LSOTB-TIR: A Large-Scale High-Diversity Thermal Infrared Object Tracking Benchmark.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020