Bin Xiao
Orcid: 0000-0001-6477-5911Affiliations:
- Microsoft Cloud+AI, Microsoft Research Asia, China
- South China University of Technology, School of Electronic and Information Engineering, China
According to our database1,
Bin Xiao
authored at least 35 papers
between 2014 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion.
CoRR, 2024
Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search.
CoRR, 2024
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
CoRR, 2023
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data.
CoRR, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks.
CoRR, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training.
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
IEEE Trans. Pattern Anal. Mach. Intell., 2021
CoRR, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive Keypoint Estimates.
CoRR, 2020
HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
2018
Proceedings of the Computer Vision - ECCV 2018, 2018
2014
Proc. VLDB Endow., 2014