Towards an Interpretable Representation of Speaker Identity via Perceptual Voice Qualities.
Proceedings of the IEEE International Conference on Acoustics, 2024
Permod: Perceptually Grounded Voice Modification With Latent Diffusion Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023