2024
G-DIG: Towards Gradient-based DIverse and hiGh-quality Instruction Data Selection for Machine Translation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Bilingual attention based neural machine translation.
Appl. Intell., February, 2023

BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Towards Robust Neural Machine Translation with Iterative Scheduled Data-Switch Training.
Proceedings of the 29th International Conference on Computational Linguistics, 2022