Anwen Hu
Orcid: 0000-0001-8839-4996
According to our database1,
Anwen Hu
authored at least 30 papers
between 2019 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding.
CoRR, 2024
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models.
CoRR, 2024
TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning.
CoRR, 2024
CoRR, 2024
VG-Annotator: Vision-Language Models as Query Annotators for Unsupervised Visual Grounding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
mPLUG-OwI2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Mach. Intell. Res., April, 2023
mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model.
CoRR, 2023
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration.
CoRR, 2023
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model.
CoRR, 2023
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding.
CoRR, 2023
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks.
CoRR, 2023
CoRR, 2023
Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
2021
CoRR, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Document-Level Named Entity Recognition by Incorporating Global and Neighbor Features.
Proceedings of the Information Retrieval - 25th China Conference, 2019