Tina: Think, Interaction, and Action Framework for Zero-Shot Vision Language Navigation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Visual Graph Reasoning Network.
Proceedings of the IEEE International Conference on Acoustics, 2023
One-Stage Visual Grounding via Semantic-Aware Feature Filter.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021