A Summary on GUI Agents with Foundation Models Enhanced by Reinforcement Learning.
CoRR, April, 2025
EffOWT: Transfer Visual Language Models to Open-World Tracking Efficiently and Effectively.
CoRR, April, 2025
ArtCrafter: Text-Image Aligning Style Transfer via Embedding Reframing.
CoRR, January, 2025
Integrating Low-Level Visual Cues for Enhanced Unsupervised Semantic Segmentation.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
PCIE_EgoHandPose Solution for EgoExo4D Hand Pose Challenge.
CoRR, 2024
ACTrack: Adding Spatio-Temporal Condition for Visual Object Tracking.
CoRR, 2024
2<sup>nd</sup> Workshop on Maritime Computer Vision (MaCVi) 2024: Challenge Results.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2024
ReIDTracker_Sea: Multi-Object Tracking in Maritime Computer Vision.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2024
Technical Report for Argoverse Challenges on Unified Sensor-based Detection, Tracking, and Forecasting.
CoRR, 2023
ReIDTracker Sea: the technical report of BoaTrack and SeaDronesSee-MOT challenge at MaCVi of WACV24.
CoRR, 2023
1st Place Solution for CVPR2023 BURST Long Tail and Open World Challenges.
CoRR, 2023
ReIDTrack: Multi-Object Track and Segmentation Without Motion.
CoRR, 2023
Multi-Object Tracking by Self-supervised Learning Appearance Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Lane detection with Position Embedding.
CoRR, 2022