Ran Xu
Orcid: 0009-0004-4585-5261Affiliations:
- Salesforce Research, Salesforce AI Research,
According to our database1,
Ran Xu
authored at least 52 papers
between 2012 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs.
CoRR, 2024
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations.
CoRR, 2024
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens.
CoRR, 2024
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-Modal Reasoning.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning.
CoRR, 2023
CoRR, 2023
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Mask-Free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding.
CoRR, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the 29th International Conference on Computational Linguistics, 2022
Proceedings of the 33rd British Machine Vision Conference 2022, 2022
2021
CoRR, 2021
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
2019
2018
Pattern Recognit. Lett., 2018
2016
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2016
2015
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015
Jointly Modeling Deep Video and Compositional Text to Bridge Vision and Language in a Unified Framework.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
2014
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014
2012
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012
Proceedings of the Articulated Motion and Deformable Objects, 2012