Panagiota Karanasou

Orcid: 0000-0003-1939-4161

Affiliations:
  • University of Cambridge, UK
  • University of Paris-Sud, Orsay, France


According to our database1, Panagiota Karanasou authored at least 33 papers between 2010 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
A Comparative Analysis of Pretrained Language Models for Text-to-Speech.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Controllable Emphasis with zero data for text-to-speech.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody.
CoRR, 2022

CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer.
CoRR, 2022

Cross-lingual Style Transfer with Conditional Prior VAE and Style Loss.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

A Learned Conditional Prior for the VAE Acoustic Space of a TTS System.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech.
Proceedings of the IEEE International Conference on Acoustics, 2021

Camp: A Two-Stage Approach to Modelling Prosody in Context.
Proceedings of the IEEE International Conference on Acoustics, 2021

2019
Cross-lingual Transfer Learning for Japanese Named Entity Recognition.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

2018
Improving Interpretability and Regularization in Deep Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Selecting Machine-Translated Data for Quick Bootstrapping of a Natural Language Understanding System.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

2017
I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

2016
Stimulated Deep Neural Network for Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Combining i-vector representation and structured neural networks for rapid adaptation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Improved DNN-based segmentation for multi-genre broadcast audio.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
I-vector estimation using informative priors for adaptation of deep neural networks.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

An investigation into speaker informed DNN front-end for LVCSR.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Cambridge university transcription systems for the multi-genre broadcast challenge.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The development of the cambridge university alignment systems for the multi-genre broadcast challenge.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Speaker diarisation and longitudinal linking in multi-genre broadcast data.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Adaptation of deep neural network acoustic models using factorised i-vectors.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013
Phonemic variability and confusability in pronunciation modeling for automatic speech recognition. (Variabilité et confusabilité phonémique pour les modèles de prononciations au sein d'un système de reconnaissance automatique de la parole).
PhD thesis, 2013

Discriminative training of a phoneme confusion model for a dynamic lexicon in ASR.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012
Discriminatively trained phoneme confusion model for keyword spotting.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011
Pronunciation variants generation using SMT-inspired approaches.
Proceedings of the IEEE International Conference on Acoustics, 2011

Measuring the Confusability of Pronunciations in Speech Recognition.
Proceedings of the Finite-State Methods and Natural Language Processing, 2011

Automatic Generation of a Pronunciation Dictionary with Rich Variation Coverage Using SMT Methods.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2011

2010
Comparing SMT Methods for Automatic Generation of Pronunciation Variants.
Proceedings of the Advances in Natural Language Processing, 2010


  Loading...