Pretrained Image-Text Models are Secretly Video Captioners.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
GEM: Generating Engaging Multimodal Content.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Knowledge from Large-Scale Protein Contact Prediction Models Can Be Transferred to the Data-Scarce RNA Contact Prediction Task.
Proceedings of the Pattern Recognition - 27th International Conference, 2024
Working Memory Identifies Reasoning Limits in Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Expedited Training of Visual Conditioned Language Generation via Redundancy Reduction.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
SimVLG: Simple and Efficient Pretraining of Visual Language Generative Models.
CoRR, 2023
Bootstrapping Vision-Language Learning with Decoupled Language Pre-training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Contrastive Learning for Prompt-based Few-shot Language Learners.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Embedding Hallucination for Few-shot Language Fine-tuning.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
T-Cell Receptor-Peptide Interaction Prediction with Physical Model Augmented Pseudo-Labeling.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022
Label Hallucination for Few-Shot Classification.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
MetaPix: Domain transfer for semantic segmentation by meta pixel weighting.
Image Vis. Comput., 2021
DIRECT: RNA contact predictions by integrating structural patterns.
BMC Bioinform., 2019
RBind: computational network method to predict RNA binding sites.
Bioinform., 2018