Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis.
CoRR, 2024
Towards Hierarchical Spoken Language Dysfluency Modeling.
CoRR, 2024
Stutter-Solver: End-To-End Multi-Lingual Dysfluency Detection.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE Spoken Language Technology Workshop, 2024
SSDM: Scalable Speech Dysfluency Modeling.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Enhancing GAN-based Vocoders with Contrastive Learning Under Data-Limited Condition.
Proceedings of the IEEE International Conference on Acoustics, 2024
Towards Hierarchical Spoken Language Disfluency Modeling.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024
Unsupervised TTS Acoustic Modeling for TTS With Conditional Disentangled Sequential VAE.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Deep Speech Synthesis from MRI-Based Articulatory Representations.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Articulatory Representation Learning via Joint Factor Analysis and Neural Matrix Factorization.
Proceedings of the IEEE International Conference on Acoustics, 2023
Unconstrained Dysfluency Modeling for Dysfluent Speech Transcription and Detection.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Av-Data2Vec: Self-Supervised Learning of Audio-Visual Speech Representations with Contextualized Target Representations.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
UTTS: Unsupervised TTS with Conditional Disentangled Sequential Variational Auto-encoder.
CoRR, 2022
Towards Improved Zero-shot Voice Conversion with Conditional DSVAE.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Robust Disentangled Variational Speech Representation Learning for Zero-Shot Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2022
Detection and Evaluation of Human and Machine Generated Speech in Spoofing Attacks on Automatic Speaker Verification Systems.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Masked Proxy Loss for Text-Independent Speaker Verification.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Mask Proxy Loss for Text-Independent Speaker Recognition.
CoRR, 2020
Common mode current suppression for permanent magnet synchronous motor based on model predictive control.
Proceedings of the Thirteenth International Conference on Ecological Vehicles and Renewable Energies, 2018