2025
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction.
CoRR, February, 2025

2023
Robust Learning for Multi-party Addressee Recognition with Discrete Addressee Codebook.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
AliCHI: A Large-scale Multi-modal Dataset and Automated Evaluation Tool for Human-like Dialogue Systems.
CoRR, 2022