Pirros Tsiakoulis

According to our database1, Pirros Tsiakoulis authored at least 67 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.




In proceedings 
PhD thesis 


On csauthors.net:


Improved Text Emotion Prediction Using Combined Valence and Arousal Ordinal Classification.
CoRR, 2024

Improved Text Emotion Prediction Using Combined Valence and Arousal Ordinal Classification.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations.
Proceedings of the IEEE International Conference on Acoustics, 2024

Controllable speech synthesis by learning discrete phoneme-level prosodic representations.
Speech Commun., 2023

Generating Multilingual Gender-Ambiguous Text-to-Speech Voices.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Investigating Content-Aware Neural Text-to-Speech MOS Prediction Using Prosodic and Linguistic Features.
Proceedings of the IEEE International Conference on Acoustics, 2023

Predicting phoneme-level prosody latents using AR and flow-based Prior Networks for expressive speech synthesis.
CoRR, 2022

Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis.
CoRR, 2022

Generating Gender-Ambiguous Text-to-Speech Voices.
CoRR, 2022

Fine-grained Noise Control for Multispeaker Speech Synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Self supervised learning for robust voice cloning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Karaoker: Alignment-free singing voice synthesis with speech training data.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

Improved Prosodic Clustering for Multispeaker and Speaker-Independent Phoneme-Level Prosody Control.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Cross-Lingual Low Resource Speaker Adaptation Using Phonological Features.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Prosodic Clustering for Phoneme-Level Prosody Control in End-to-End Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2021

High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Video-realistic expressive audio-visual speech synthesis for the Greek language.
Speech Commun., 2017

Affective word ratings for concatenative text-to-speech synthesis.
Proceedings of the 20th Pan-Hellenic Conference on Informatics, 2016

Expressive Speech Synthesis for Storytelling: The INNOETICS' Entry to the Blizzard Challenge 2016.
Proceedings of the Blizzard Challenge 2016, Cuppertino, CA, USA, September 16, 2016, 2016

A framework towards expressive speech analysis and synthesis with preliminary results.
J. Multimodal User Interfaces, 2015

Distributed dialogue policies for multi-domain statistical dialogue management.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Improving multiple-crowd-sourced transcriptions using a speech recogniser.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

The use of discriminative belief tracking in POMDP-based dialogue systems.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

An Overview of the ILSP Unit Selection Text-to-Speech Synthesis System.
Proceedings of the Artificial Intelligence: Methods and Applications, 2014

Using Audio Books for Training a Text-to-Speech System.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Evaluation of Statistical POMDP-Based Dialogue Systems in Noisy Environments.
Proceedings of the Situated Dialog in Speech-Based Human-Computer Interaction, 2014

Dialogue context sensitive speech synthesis using factorized decision trees.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Inverse reinforcement learning for micro-turn management.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Incremental on-line adaptation of POMDP-based dialogue managers to extended domains.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Dialogue context sensitive HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

Open Web-Based Text-to-Speech Services for the Citizens.
Proceedings of the HCI International 2014 - Posters' Extended Abstracts, 2014

Towards expressive speech synthesis: Analysis and modeling of expressive speech.
Proceedings of the 5th IEEE Conference on Cognitive Infocommunications, 2014

The ILSP / INNOETICS Text-to-Speech System for the Blizzard Challenge 2014.
Proceedings of the Blizzard Challenge 2014, Singapore, Singapore, September 19, 2014, 2014

Demonstration of the PARLANCE system: a data-driven incremental, spoken dialogue system for interactive search.
Proceedings of the SIGDIAL 2013 Conference, 2013

POMDP-based dialogue manager adaptation to extended domains.
Proceedings of the SIGDIAL 2013 Conference, 2013

Instantaneous frequency and bandwidth estimation using filterbank arrays.
Proceedings of the IEEE International Conference on Acoustics, 2013

On-line policy optimisation of Bayesian spoken dialogue systems via human interaction.
Proceedings of the IEEE International Conference on Acoustics, 2013

Continuous asr for flexible incremental dialogue.
Proceedings of the IEEE International Conference on Acoustics, 2013

The ILSP / INNOETICS Text-to-Speech System for the Blizzard Challenge 2013.
Proceedings of the Blizzard Challenge 2013, 2013

IPLR: an online resource for Greek word-level and sublexical information.
Lang. Resour. Evaluation, 2012

N-best error simulation for training spoken dialogue systems.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Discriminative spoken language understanding using word confusion networks.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Policy optimisation of POMDP-based dialogue systems without state space compression.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

The Effect of Cognitive Load on a Statistical Dialogue System.
Proceedings of the SIGDIAL 2012 Conference, 2012

The ILSP Text-to-Speech System for the Blizzard Challenge 2012.
Proceedings of the Blizzard Challenge 2012, Portland, OR, USA, September 14, 2012, 2012

The ILSP Text-to-Speech System for the Blizzard Challenge 2011.
Proceedings of the Blizzard Challenge 2011, Turin, Italy, September 2, 2011, 2011

A unit selection text-to-speech synthesis system optimized for use with screen readers.
IEEE Trans. Consumer Electron., 2010

Spectral Moment Features Augmented by Low Order Cepstral Coefficients for Robust ASR.
IEEE Signal Process. Lett., 2010

One-Class Classification for Spectral Join Cost Calculation in Unit Selection Speech Synthesis.
IEEE Signal Process. Lett., 2010

On the effect of fundamental frequency on amplitude and frequency modulation patterns in speech resonances.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

The ILSP Text-to-Speech System for the Blizzard Challenge 2010.
Proceedings of the Blizzard Challenge 2010, Kansai Science City, Japan, September 25, 2010, 2010

Embedded unit selection text-to-speech synthesis for mobile devices.
IEEE Trans. Consumer Electron., 2009

User Interaction Design for a Home-Based Telecare System.
Proceedings of the HCI and Usability for e-Inclusion, 2009

Enhancing Accessibility of Web Content for the Print-Impaired and Blind People.
Proceedings of the HCI and Usability for e-Inclusion, 2009

Corpus Design for a Unit Selection TtS System with Application to Bulgarian.
Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2009

Statistical analysis of amplitude modulation in speech signals using an AM-FM model.
Proceedings of the IEEE International Conference on Acoustics, 2009

Short-time instantaneous frequency and bandwidth features for speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

HMM-Based Speech Synthesis for the Greek Language.
Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008

A statistical method for database reduction for embedded unit selection speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2008

All Greek to me! An automatic Greeklish to Greek transliteration system.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Formant estimation of speech signals using subspace-based spectral analysis.
Proceedings of the 14th European Signal Processing Conference, 2006

On the use of a decimative spectral estimation method based on eigenanalysis and SVD for formant and bandwidth tracking of speech signals.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Rule-based grapheme-to-phoneme method for the Greek.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Bypassing Greeklish!
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004
