2024
UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization.
Proceedings of the IEEE International Conference on Acoustics, 2024
Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction.
Proceedings of the IEEE International Conference on Acoustics, 2024
2022
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE.
CoRR, 2022
Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
VCVTS: Multi-Speaker Video-to-Speech Synthesis Via Cross-Modal Knowledge Transfer from Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2022
Speaker Identity Preservation in Dysarthric Speech Reconstruction by Adversarial Speaker Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Speech Emotion Recognition Using Sequential Capsule Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Any-to-Many Voice Conversion With Location-Relative Sequence-to-Sequence Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Improved End-to-End Dysarthric Speech Recognition via Meta-learning Based Model Re-initialization.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Unsupervised Domain Adaptation for Dysarthric Speech Detection via Domain Adversarial Training and Mutual Information Minimization.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-Shot Voice Conversion.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Fcl-Taco2: Towards Fast, Controllable and Lightweight Text-to-Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
End-To-End Voice Conversion Via Cross-Modal Knowledge Distillation for Dysarthric Speech Reconstruction.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
End-To-End Accent Conversion Without Using Native Utterances.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2018
Learning soft mask with DNN and DNN-SVM for multi-speaker DOA estimation using an acoustic vector sensor.
J. Frankl. Inst., 2018
Joint Noise and Reverberation Adaptive Learning for Robust Speaker DOA Estimation with an Acoustic Vector Sensor.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
2017
Prediction of synergistic anti-cancer drug combinations based on drug target network and drug induced gene expression profiles.
Artif. Intell. Medicine, 2017
A deep convolutional encoder-decoder model for robust speech dereverberation.
Proceedings of the 22nd International Conference on Digital Signal Processing, 2017
Learning a robust DOA estimation model with acoustic vector sensor cues.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
2016
A robust DBN-vector based speaker verification system under channel mismatch conditions.
Proceedings of the 2016 IEEE International Conference on Digital Signal Processing, 2016
Accurate and robust device-free localization approach via sparse representation in presence of noise and outliers.
Proceedings of the 2016 IEEE International Conference on Digital Signal Processing, 2016
2014
Near-field source localization in complex indoor environment using uniform circular array.
Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014