Think on your feet: Seamless Transition between Human-like Locomotion in Response to Changing Commands.
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
When End-to-End is Overkill: Rethinking Cascaded Speech-to-Text Translation.
CoRR, February, 2025
A Unit-based System and Dataset for Expressive Direct Speech-to-Speech Translation.
CoRR, February, 2025
DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models.
CoRR, 2024
Adapting Humanoid Locomotion over Challenging Terrain via Two-Phase Training.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024
DiCLET-TTS: Diffusion Model Based Cross-Lingual Emotion Transfer for Text-to-Speech - A Study Between English and Mandarin.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory.
CoRR, 2023
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
ViP3D: End-to-End Visual Trajectory Prediction via 3D Agent Queries.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech.
CoRR, 2022
Neural Dubber: Dubbing for Silent Videos According to Scripts.
CoRR, 2021
Neural Dubber: Dubbing for Videos According to Scripts.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
CVC: Contrastive Learning for Non-Parallel Voice Conversion.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech.
Proceedings of the 9th International Conference on Learning Representations, 2021
Incorporation of a decision space diversity maintenance mechanism into MOEA/D for multi-modal multi-objective optimization.
Proceedings of the Genetic and Evolutionary Computation Conference Companion, 2018