VidGen-1M: A Large-Scale Dataset for Text-to-video Generation.
CoRR, 2024
EVALALIGN: Supervised Fine-Tuning Multimodal LLMs with Human-Aligned Data for Evaluating Text-to-Image Models.
CoRR, 2024
Guiding ChatGPT to Generate Salient Domain Summaries.
Proceedings of the International Joint Conference on Neural Networks, 2024
STRA: A Simple Token Replacement Strategy Alleviating Exposure Bias in Text Generation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Contrastive Learning with High-Quality and Low-Quality Augmented Data for Query-Focused Summarization.
Proceedings of the IEEE International Conference on Acoustics, 2024
BVRCC: Bootstrapping Video Retrieval via Cross-Matching Correction.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2024, 2024
An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation.
Proceedings of the Computer Vision - ECCV 2024, 2024
Improving Copy-oriented Text Generation via EDU Copy Mechanism.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Diffusion Language Model with Query-Document Relevance for Query-Focused Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023