TIPAA-SSL: Text Independent Phone-to-Audio Alignment based on Self-Supervised Learning and Knowledge Transfer.
CoRR, 2024
Flowchase: a Mobile Application for Pronunciation Training.
Proceedings of the 9th Workshop on Speech and Language Technology in Education, 2023
MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023
Where Is My Mind (Looking at)? A Study of the EEG-Visual Attention Relationship.
Informatics, 2022
Where Is My Mind (looking at)? Predicting Visual Attention from Brain Activity.
CoRR, 2022
Controlling the emotional expressiveness of synthetic speech: a deep learning approach.
4OR, 2022
ICE-Talk 2: Interface for Controllable Expressive TTS with perceptual assessment tool.
Softw. Impacts, 2021
Analysis and Assessment of Controllability of an Expressive Deep Learning-Based TTS System.
Informatics, 2021
Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition.
CoRR, 2020
A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis.
CoRR, 2020
Laughter Synthesis: Combining Seq2seq Modeling with Transfer Learning.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
ICE-Talk: An Interface for a Controllable Expressive Talking Machine.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Neural Speech Synthesis with Style Intensity Interpolation: A Perceptual Analysis.
Proceedings of the Companion of the 2020 ACM/IEEE International Conference on Human-Robot Interaction, 2020
The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach.
CoRR, 2019
Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis Through Audio Analysis.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Emotional Speech Datasets for English Speech Synthesis Purpose: A Review.
Proceedings of the Intelligent Systems and Applications, 2019
Exploring Transfer Learning for Low Resource Emotional TTS.
Proceedings of the Intelligent Systems and Applications, 2019
A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech - a Deep Learning approach.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2019
The Emotional Voices Database: Towards Controlling the Emotion Dimension in Voice Generation Systems.
CoRR, 2018
ASR-based Features for Emotion Recognition: A Transfer Learning Approach.
CoRR, 2018