2024
MiniCPM-V: A GPT-4V Level MLLM on Your Phone.
CoRR, 2024

GUICourse: From General Vision Language Models to Versatile GUI Agents.
CoRR, 2024

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies.
CoRR, 2024

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants.
CoRR, 2023