2025

Think on your feet: Seamless Transition between Human-like Locomotion in Response to Changing Commands.

[DOI]

Huaxing Huang

Wenhao Cui

CoRR, February, 2025

When End-to-End is Overkill: Rethinking Cascaded Speech-to-Text Translation.

[DOI]

CoRR, February, 2025

A Unit-based System and Dataset for Expressive Direct Speech-to-Speech Translation.

[DOI]

CoRR, February, 2025

2024

DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models.

[DOI]

CoRR, 2024

Adapting Humanoid Locomotion over Challenging Terrain via Two-Phase Training.

[DOI]

Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024

2023

DiCLET-TTS: Diffusion Model Based Cross-Lingual Emotion Transfer for Text-to-Speech - A Study Between English and Mandarin.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory.

[DOI]

CoRR, 2023

Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ViP3D: End-to-End Visual Trajectory Prediction via 3D Agent Queries.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech.

[DOI]

CoRR, 2022

2021

Neural Dubber: Dubbing for Silent Videos According to Scripts.

[DOI]

CoRR, 2021

Neural Dubber: Dubbing for Videos According to Scripts.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

CVC: Contrastive Learning for Non-Parallel Voice Conversion.

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

FastSpeech 2: Fast and High-Quality End-to-End Text to Speech.

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

2018

Incorporation of a decision space diversity maintenance mechanism into MOEA/D for multi-modal multi-objective optimization.

[DOI]

Chenxu Hu

Hisao Ishibuchi

Proceedings of the Genetic and Evolutionary Computation Conference Companion, 2018