Shuo Xin
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
OmniVLM: A Token-Compressed, Sub-Billion-Parameter Vision-Language Model for Efficient On-Device Inference.
CoRR, 2024
Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models.
CoRR, 2024
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024
A Robotic-centric Paradigm for 3D Human Tracking Under Complex Environments Using Multi-modal Adaptation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024
Multi-modal 3D Human Tracking for Robots in Complex Environment with Siamese Point-Video Transformer.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024