We stand with Ukraine

We stand with Ukraine

Panagiota Karanasou

Orcid: 0000-0003-1939-4161

Affiliations:

University of Cambridge, UK
University of Paris-Sud, Orsay, France

According to our database¹, Panagiota Karanasou authored at least 34 papers between 2010 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org
on www3.eng.cam.ac.uk

On csauthors.net:

Bibliography

2024

Speak & Improve Challenge 2025: Tasks and Baseline Systems.

[BibT_eX]

[DOI]

,

,

,

,

Penny Karanasou

,

Mark J. F. Gales

,

CoRR, 2024

2023

A Comparative Analysis of Pretrained Language Models for Text-to-Speech.

[BibT_eX]

[DOI]

Marcel Granero Moya

,

Penny Karanasou

,

,

Bastian Schnell

,

,

,

Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Controllable Emphasis with zero data for text-to-speech.

[BibT_eX]

[DOI]

,

,

Ekaterina Peterova

,

Alessandro Lombardi

,

,

Arent van Korlaar

,

,

,

,

Mateusz Lajszczak

,

Penny Karanasou

,

Antonio Bonafonte

,

,

Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer.

[BibT_eX]

[DOI]

,

,

Bastian Schnell

,

Penny Karanasou

,

Marcel Granero Moya

,

,

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody.

[BibT_eX]

[DOI]

,

,

Mateusz Lajszczak

,

,

,

,

,

Penny Karanasou

CoRR, 2022

CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer.

[BibT_eX]

[DOI]

,

Penny Karanasou

,

Mateusz Lajszczak

,

,

,

,

,

Arent van Korlaar

,

,

CoRR, 2022

Cross-lingual Style Transfer with Conditional Prior VAE and Style Loss.

[BibT_eX]

[DOI]

Dino Rattcliffe

,

,

Alex Mansbridge

,

Penny Karanasou

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody.

[BibT_eX]

[DOI]

,

Syed Ammar Abbas

,

Mateusz Lajszczak

,

,

,

,

,

Penny Karanasou

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer.

[BibT_eX]

[DOI]

,

Penny Karanasou

,

Mateusz Lajszczak

,

Syed Ammar Abbas

,

,

,

,

Arent van Korlaar

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

Multi-Scale Spectrogram Modelling for Neural Text-to-Speech.

[BibT_eX]

[DOI]

,

Bajibabu Bollepalli

,

,

,

Penny Karanasou

,

,

,

,

Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

A Learned Conditional Prior for the VAE Acoustic Space of a TTS System.

[BibT_eX]

[DOI]

Penny Karanasou

,

,

,

,

,

,

Jaime Lorenzo-Trueba

,

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech.

[BibT_eX]

[DOI]

,

,

,

,

,

Penny Karanasou

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Camp: A Two-Stage Approach to Modelling Prosody in Context.

[BibT_eX]

[DOI]

,

,

,

Jaime Lorenzo-Trueba

,

,

,

,

Penny Karanasou

,

Proceedings of the IEEE International Conference on Acoustics, 2021

2019

Cross-lingual Transfer Learning for Japanese Named Entity Recognition.

[BibT_eX]

[DOI]

,

Penny Karanasou

,

,

Dietrich Klakow

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

2018

Improving Interpretability and Regularization in Deep Learning.

[BibT_eX]

[DOI]

,

Mark J. F. Gales

,

,

Penny Karanasou

,

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Selecting Machine-Translated Data for Quick Bootstrapping of a Natural Language Understanding System.

[BibT_eX]

[DOI]

,

Penny Karanasou

,

Rajen Chatterjee

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

2017

I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models.

[BibT_eX]

[DOI]

Penny Karanasou

,

,

Mark J. F. Gales

,

Philip C. Woodland

IEEE ACM Trans. Audio Speech Lang. Process., 2017

2016

Stimulated Deep Neural Network for Speech Recognition.

[BibT_eX]

[DOI]

,

Penny Karanasou

,

Mark J. F. Gales

,

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems.

[BibT_eX]

[DOI]

Pierre Lanchantin

,

Mark J. F. Gales

,

Penny Karanasou

,

,

,

,

Philip C. Woodland

,

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Combining i-vector representation and structured neural networks for rapid adaptation.

[BibT_eX]

[DOI]

,

Penny Karanasou

,

Mark J. F. Gales

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Improved DNN-based segmentation for multi-genre broadcast audio.

[BibT_eX]

[DOI]

,

,

Philip C. Woodland

,

Mark J. F. Gales

,

Panagiota Karanasou

,

Pierre Lanchantin

,

,

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

I-vector estimation using informative priors for adaptation of deep neural networks.

[BibT_eX]

[DOI]

Penny Karanasou

,

Mark J. F. Gales

,

Philip C. Woodland

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

An investigation into speaker informed DNN front-end for LVCSR.

[BibT_eX]

[DOI]

,

Penny Karanasou

,

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Cambridge university transcription systems for the multi-genre broadcast challenge.

[BibT_eX]

[DOI]

Philip C. Woodland

,

,

,

,

Mark J. F. Gales

,

Penny Karanasou

,

Pierre Lanchantin

,

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The development of the cambridge university alignment systems for the multi-genre broadcast challenge.

[BibT_eX]

[DOI]

Pierre Lanchantin

,

Mark J. F. Gales

,

Penny Karanasou

,

,

,

,

Philip C. Woodland

,

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Speaker diarisation and longitudinal linking in multi-genre broadcast data.

[BibT_eX]

[DOI]

Penny Karanasou

,

Mark J. F. Gales

,

Pierre Lanchantin

,

,

,

,

Philip C. Woodland

,

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Adaptation of deep neural network acoustic models using factorised i-vectors.

[BibT_eX]

[DOI]

Penny Karanasou

,

,

Mark J. F. Gales

,

Philip C. Woodland

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013

Phonemic variability and confusability in pronunciation modeling for automatic speech recognition. (Variabilité et confusabilité phonémique pour les modèles de prononciations au sein d'un système de reconnaissance automatique de la parole).

[BibT_eX]

[DOI]

Panagiota Karanasou

PhD thesis, 2013

Discriminative training of a phoneme confusion model for a dynamic lexicon in ASR.

[BibT_eX]

[DOI]

Penny Karanasou

,

,

Thomas Lavergne

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012

Discriminatively trained phoneme confusion model for keyword spotting.

[BibT_eX]

[DOI]

Panagiota Karanasou

,

,

Dimitra Vergyri

,

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011

Pronunciation variants generation using SMT-inspired approaches.

[BibT_eX]

[DOI]

Panagiota Karanasou

,

Proceedings of the IEEE International Conference on Acoustics, 2011

Measuring the Confusability of Pronunciations in Speech Recognition.

[BibT_eX]

[DOI]

Panagiota Karanasou

,

,

Proceedings of the Finite-State Methods and Natural Language Processing, 2011

Automatic Generation of a Pronunciation Dictionary with Rich Variation Coverage Using SMT Methods.

[BibT_eX]

[DOI]

Panagiota Karanasou

,

Proceedings of the Computational Linguistics and Intelligent Text Processing, 2011

2010

Comparing SMT Methods for Automatic Generation of Pronunciation Variants.

[BibT_eX]

[DOI]

Panagiota Karanasou

,

Proceedings of the Advances in Natural Language Processing, 2010

Loading...