2025

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction.

[DOI]

Ailin Huang

Boyong Wu

CoRR, February, 2025

2023

Robust Learning for Multi-party Addressee Recognition with Discrete Addressee Codebook.

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022

AliCHI: A Large-scale Multi-modal Dataset and Automated Evaluation Tool for Human-like Dialogue Systems.

[DOI]

CoRR, 2022