Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learning.
CoRR, January, 2025
EVLM: An Efficient Vision-Language Model for Visual Understanding.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learner.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
AdaLog: Post-training Quantization for Vision Transformers with Adaptive Logarithm Quantizer.
Proceedings of the Computer Vision - ECCV 2024, 2024