Diffusion Synthesizer for Efficient Multilingual Speech to Speech Translation.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
Voice Toxicity Detection Using Multi-Task Learning.
Proceedings of the IEEE International Conference on Acoustics, 2024
Audiovisual Inputs for Learning Robust, Real-time Facial Animation with Lip Sync.
Proceedings of the 16th ACM SIGGRAPH Conference on Motion, Interaction and Games, 2023
Fast Facial Animation from Video.
Proceedings of the SIGGRAPH 2021: Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2021