Yannis Stylianou

According to our database1, Yannis Stylianou authored at least 192 papers between 1993 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Awards

IEEE Fellow

IEEE Fellow 2017, "For contributions to speech analysis and communication".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Memory Efficient Neural Speech Synthesis Based on FastSpeech2 Using Attention Free Transformer.
Proceedings of the 32nd European Signal Processing Conference, 2024

2023
Cumulant GAN.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

2022
End-to-End Neural Based Modification of Noisy Speech for Speech-in-Noise Intelligibility Improvement.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Combining speakers of multiple languages to improve quality of neural voices.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

Assessing Speaker Interpolation in Neural Text-to-Speech.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

A Universal Multi-Speaker Multi-Style Text-to-Speech via Disentangled Representation Learning Based on Rényi Divergence Minimization.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Audiovisual Speech Synthesis using Tacotron2.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

2020
Evaluating the Intelligibility Benefits of Neural Speech Enrichment for Listeners with Normal Hearing and Hearing Impairment using the Greek Harvard Corpus.
CoRR, 2020

Audiovisual Speech Synthesis using Tacotron2.
CoRR, 2020

A fully recurrent feature extraction for single channel speech enhancement.
CoRR, 2020

Enhancing Speech Intelligibility in Text-To-Speech Synthesis Using Speaking Style Conversion.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Speaker Conditional WaveRNN: Towards Universal Neural Vocoder for Unseen Speaker and Recording Conditions.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
Enriched communication across the lifespan.
Proces. del Leng. Natural, 2019

Neural Text-to-Speech Adaptation from Low Quality Public Recordings.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

A Non-Causal FFTNet Architecture for Speech Enhancement.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Non-Parallel Voice Conversion Using Weighted Generative Adversarial Networks.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speech Enhancement for Noise-Robust Speech Synthesis Using Wasserstein GAN.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

An Unsupervised Learning Approach to Neural-net-supported Wpe Dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Connections between Reassigned Spectrum and Least Squares Estimation for Sinusoidal Models.
Proceedings of the 27th European Signal Processing Conference, 2019

Training Generative Adversarial Networks With Weights.
Proceedings of the 27th European Signal Processing Conference, 2019

2018
Automatic acoustic detection of birds through deep learning: the first Bird Audio Detection challenge.
CoRR, 2018

A study of time-frequency features for CNN-based automatic heart sound classification for pathology detection.
Comput. Biol. Medicine, 2018

Prediction of Dialogue Success with Spectral and Rhythm Acoustic Features Using DNNS and SVMS.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Spoken Dialogue for Information Navigation.
Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, 2018

Speech Intelligibility Enhancement Based on a Non-causal Wavenet-like Model.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

A Case Study on the Importance of Belief State Representation for Dialogue Policy Management.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Weighting Time-Frequency Representation of Speech Using Auditory Saliency for Automatic Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Adaptation of an Expressive Single Speaker Deep Neural Network Speech Synthesis System.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Towards Scalable Information-Seeking Multi-Domain Dialogue.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

ON the Use of Wavenet as a Statistical Vocoder.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

On Finding the Relevant User Reviews for Advancing Conversational Faceted Search.
Proceedings of 4th Workshop on Sentic Computing, 2018

2017
LD-SDS: Towards an Expressive Spoken Dialogue System based on Linked-Data.
CoRR, 2017

Domain Complexity and Policy Learning in Task-Oriented Dialogue Systems.
Proceedings of the Advanced Social Interaction with Agents, 2017

Single-Model Multi-domain Dialogue Management with Deep Learning.
Proceedings of the Advanced Social Interaction with Agents, 2017

On the Quality and Intelligibility of Noisy Speech Processed for Near-End Listening Enhancement.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Improved Automatic Speech Recognition Using Subband Temporal Envelope Features and Time-Delay Neural Network Denoising Autoencoder.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Adaptive gain control and time warp for enhanced speech intelligibility under reverberation.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Expressive visual text to speech and expression adaptation using deep neural networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Predicting dialogue success, naturalness, and length with acoustic features.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Effective emotion recognition in movie audio tracks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Near and Far Field Speech-in-Noise Intelligibility Improvements Based on a Time-Frequency Energy Reallocation Approach.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Global Variance in Speech Synthesis With Linear Dynamical Models.
IEEE Signal Process. Lett., 2016

Adaptive Gain Control for Enhanced Speech Intelligibility Under Reverberation.
IEEE Signal Process. Lett., 2016

Voice Activity Detection: Merging Source and Filter-based Information.
IEEE Signal Process. Lett., 2016

Advances in phase-aware signal processing in speech communication.
Speech Commun., 2016

Bird detection in audio: A survey and a challenge.
Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016

Effectiveness of Near-End Speech Enhancement Under Equal-Loudness and Equal-Level Constraints.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Generalizing Steady State Suppression for Enhanced Intelligibility Under Reverberation.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Automated Pause Insertion for Improved Intelligibility Under Reverberation.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Modulation Enhancement of Temporal Envelopes for Increasing Speech Intelligibility in Noise.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Multi-stream spectral representation for statistical parametric speech synthesis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Iterative estimation of phase using complex cepstrum representation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

High-resolution sinusoidal modeling of unvoiced speech.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Initial investigation of speech synthesis based on complex-valued neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Learning Domain-Independent Dialogue Policies via Ontology Parameterisation.
Proceedings of the SIGDIAL 2015 Conference, 2015

A fast algorithm for improved intelligibility of speech-in-noise based on frequency and time domain energy reallocation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Towards a linear dynamical model based speech synthesizer.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A maximum likelihood approach to the detection of moments of maximum excitation and its application to high-quality speech parameterization.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Intelligibility enhancement of casual speech for reverberant environments inspired by clear speech properties.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Fusion of multiple parameterisations for DNN-based sinusoidal speech synthesis with multi-task learning.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Fast and accurate phase unwrapping.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Methods for applying dynamic sinusoidal models to statistical parametric speech synthesis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Improved face-to-face communication using noise reduction and speech intelligibility enhancement.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Robust excitation-based features for Automatic Speech Recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Enhancing the intelligibility of statistically generated synthetic speech by means of noise-independent modifications.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Fast Inter-Harmonic Reconstruction for Spectral Envelope Estimation in High-Pitched Voices.
IEEE Signal Process. Lett., 2014

Maximum Voiced Frequency Estimation: Exploiting Amplitude and Phase Spectra.
IEEE Signal Process. Lett., 2014

Approaching speech intelligibility enhancement with inspiration from Lombard and Clear speaking styles.
Comput. Speech Lang., 2014

Introduction to the Special Issue on The listening talker: context-dependent speech production and perception.
Comput. Speech Lang., 2014

On spectral and time domain energy reallocation for speech-in-noise intelligibility enhancement.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Emotional speech classification using adaptive sinusoidal modelling.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Phase importance in speech processing applications.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

The importance of phase on voice quality assessment.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

An investigation of the application of dynamic sinusoidal models to statistical parametric speech synthesis.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Cluster adaptive training of average voice models.
Proceedings of the IEEE International Conference on Acoustics, 2014

Linear dynamical models in speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

Complex cepstrum factorization for statistical parametric synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

Simple and artefact-free spectral modifications for enhancing the intelligibility of casual speech.
Proceedings of the IEEE International Conference on Acoustics, 2014

Robust full-band adaptive Sinusoidal analysis and synthesis of speech.
Proceedings of the IEEE International Conference on Acoustics, 2014

Pitch modifications of speech based on an adaptive Harmonic Model.
Proceedings of the IEEE International Conference on Acoustics, 2014

A fixed dimension and perceptually based dynamic sinusoidal model of speech.
Proceedings of the IEEE International Conference on Acoustics, 2014

Analysis of emotional speech using an adaptive sinusoidal model.
Proceedings of the 22nd European Signal Processing Conference, 2014

2013
Analysis and Synthesis of Speech Using an Adaptive Full-Band Harmonic Model.
IEEE Trans. Speech Audio Process., 2013

Evaluating the intelligibility benefit of speech modifications in known noise conditions.
Speech Commun., 2013

Evaluating how well filtered white noise models the residual from sinusoidal modeling of musical instrument sounds.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Photo-realistic expressive text to talking head synthesis.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Increasing speech intelligibility via spectral shaping with frequency warping and dynamic range compression plus transient enhancement.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Linking loudness increases in normal and lombard speech to decreasing vowel formant separation.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Assessing the intelligibility impact of vowel space expansion via clear speech-inspired frequency warping.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Statistical synthesizer with embedded prosodic and spectral modifications to generate highly intelligible speech in noise.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Improving intelligibility in noise of HMM-generated speech via noise-dependent and -independent methods.
Proceedings of the IEEE International Conference on Acoustics, 2013

Automatic classification of systolic heart murmurs.
Proceedings of the IEEE International Conference on Acoustics, 2013

Time-scale modifications based on a full-band adaptive harmonic model.
Proceedings of the IEEE International Conference on Acoustics, 2013

Adaptive sinusoidal modeling of percussive musical instrument sounds.
Proceedings of the 21st European Signal Processing Conference, 2013

2012
Automatic glottal segmentation using local-based active contours and application to glottovibrography.
Speech Commun., 2012

Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Can modified casual speech reach the intelligibility of clear speech?
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

On the Modeling of Voiceless Stop Sounds of Speech using Adaptive Quasi-Harmonic Models.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Unsupervised Acoustic Analyses of Normal and Lombard Speech, with Spectral Envelope Transformation to Improve Intelligibility.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Implementation of Simple Spectral Techniques to Enhance the Intelligibility of Speech using a Harmonic Model.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A full-band adaptive harmonic representation of speech.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

An extension of the adaptive Quasi-Harmonic Model.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Speech-in-noise intelligibility improvement based on power recovery and dynamic range compression.
Proceedings of the 20th European Signal Processing Conference, 2012

2011
Adaptive AM-FM Signal Decomposition With Application to Speech Analysis.
IEEE Trans. Speech Audio Process., 2011

Voice Pathology Detection and Discrimination Based on Modulation Spectral Features.
IEEE Trans. Speech Audio Process., 2011

Scale Transform in Rhythmic Similarity of Music.
IEEE Trans. Speech Audio Process., 2011

Discrimination of speech from nonspeeech in broadcast news based on modulation frequency features.
Speech Commun., 2011

Tremor in speakers with spasmodic dysphonia.
Proceedings of the 7th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2011

A Hybrid Quasi-Harmonic/CELP Wideband Speech Coding Scheme for Unit Selection TTS Synthesis.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Glottal inverse filtering using stabilised weighted linear prediction.
Proceedings of the IEEE International Conference on Acoustics, 2011

ON the recovery of time-varying spectral envelope information from AQHM-derived spectra.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Introduction to the Special Section on Voice Transformation.
IEEE Trans. Speech Audio Process., 2010

Three Dimensions of Pitched Instrument Onset Detection.
IEEE Trans. Speech Audio Process., 2010

Auditory Spectrum-Based Pitched Instrument Onset Detection.
IEEE Trans. Speech Audio Process., 2010

Reply to "Comments on 'Iterative Estimation of Sinusoidal Signal Parameters'".
IEEE Signal Process. Lett., 2010

Iterative Estimation of Sinusoidal Signal Parameters.
IEEE Signal Process. Lett., 2010

Parataxis: Morphological Similarity in Traditional Music.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Fast least-squares solution for sinusoidal, harmonic and quasi-harmonic models.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A factorial sparse coder model for single channel source separation.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Analysis/synthesis of speech based on an adaptive quasi-harmonic plus noise model.
Proceedings of the IEEE International Conference on Acoustics, 2010

On the robustness of the Quasi-Harmonic model of speech.
Proceedings of the IEEE International Conference on Acoustics, 2010

Dysphonia detection based on modulation spectral features and cepstral coefficients.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Wrapped Gaussian Mixture Models for Modeling and High-Rate Quantization of Phase Data of Speech.
IEEE Trans. Speech Audio Process., 2009

Spectral jitter modeling and estimation.
Biomed. Signal Process. Control., 2009

A novel method for the extraction of vocal tremor.
Proceedings of the Sixth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2009

Modulation spectral features for objective voice quality assessment: the breathiness case.
Proceedings of the Sixth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2009

Rhythmic Similarity in Traditional Turkish Music.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Pitched Instrument Onset Detection based on Auditory Spectra.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

AM-FM estimation for speech based on a time-varying sinusoidal model.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Normalized modulation spectral features for cross-database voice pathology detection.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Voice Transformation: A survey.
Proceedings of the IEEE International Conference on Acoustics, 2009

Chirp rate estimation of speech based on a time-varying quasi-harmonic model.
Proceedings of the IEEE International Conference on Acoustics, 2009

A scale transform based method for rhythmic similarity of music.
Proceedings of the IEEE International Conference on Acoustics, 2009

Video and audio based detection of filled hesitation pauses in classroom lectures.
Proceedings of the 17th European Signal Processing Conference, 2009

Evaluation of modulation frequency features for speaker verification and identification.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Musical Genre Classification Using Nonnegative Matrix Factorization-Based Features.
IEEE Trans. Speech Audio Process., 2008

Beat Tracking using Group Delay Based Onset Detection.
Proceedings of the ISMIR 2008, 2008

On the properties of a time-varying quasi-harmonic model of speech.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Dimensionality reduction of modulation frequency features for speech discrimination.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Singing voice detection using modulation frequency feature.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008

Improving the modeling of the noise part in the harmonic plus noise model of speech.
Proceedings of the IEEE International Conference on Acoustics, 2008

Rhythmic similarity of music based on dynamic periodicity warping.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Conditional Vector Quantization for Speech Coding.
IEEE Trans. Speech Audio Process., 2007

A mathematical model for accurate measurement of jitter.
Proceedings of the Fifth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2007

Speech-nonspeech discrimination using the information bottleneck method and spectro-temporal modulation index.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Bit-erasure channel decoding for GMM-based multiple description coding.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

The harmonic model codec (HMC) framework for voIP.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Conditional Vector Quantization for Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2007

A Statistical Approach to Musical Genre Classification using Non-Negative Matrix Factorization.
Proceedings of the IEEE International Conference on Acoustics, 2007

Towards a Voice Conversion System Based on Frame Selection.
Proceedings of the IEEE International Conference on Acoustics, 2007

Stochastic Modeling and Quantization of Harmonic Phases in Speech using Wrapped Gaussian Mixture Models.
Proceedings of the IEEE International Conference on Acoustics, 2007

Speech - Nonspeech discrimination based on speech-relevant spectrogram modulations.
Proceedings of the 15th European Signal Processing Conference, 2007

2006
Fast Analysis/Synthesis of Harmonic Signals.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
On the Detection of Discontinuities in Concatenative Speech Synthesis.
Proceedings of the Progress in Nonlinear Speech Processing, 2005

Extraction of Speech-Relevant Information from Modulation Spectrograms.
Proceedings of the Progress in Nonlinear Speech Processing, 2005

Discontinuity detection in concatenated speech synthesis based on nonlinear speech analysis.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Coding with Side Information Techniques for LSF Reconstruction in Voice Over IP.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Biometrics: Different Approaches for Using Gaussian Mixture Models in Handwriting.
Proceedings of the Communications and Multimedia Security, 2005

2004
Modeling Speech Based on Harmonic Plus Noise Models.
Proceedings of the Nonlinear Speech Modeling and Applications, 2004

Nonlinear Speech Features for the Objective Detection of Discontinuities in Concatenative Speech Synthesis.
Proceedings of the Nonlinear Speech Modeling and Applications, 2004

Combined estimation/coding of highband spectral envelopes for speech spectrum expansion.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2001
Removing linear phase mismatches in concatenative speech synthesis.
IEEE Trans. Speech Audio Process., 2001

Applying the harmonic plus noise model in concatenative speech synthesis.
IEEE Trans. Speech Audio Process., 2001

Perceptual and objective detection of discontinuities in concatenative speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
A simple and fast way of generating a harmonic signal.
IEEE Signal Process. Lett., 2000

Corpus-based techniques in the AT&t nextgen synthesis system.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Multimodal Speech Synthesis.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

On the implementation of the harmonic plus noise model for concatenative speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2000

Stochastic modeling of spectral adjustment for high quality pitch modification.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Single complex sinusoid and ARHE model based pitch extractors.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Synchronization of speech frames based on phase data with application to concatenative speech synthesis.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Detection of non-stationarity in speech signals and its application to time-scaling.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Assessment and correction of voice quality variabilities in large speech databases for concatenative speech synthesis.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Continuous probabilistic transform for voice conversion.
IEEE Trans. Speech Audio Process., 1998

Removing phase mismatches in concatenative speech synthesis.
Proceedings of the Third ESCA/COCOSDA Workshop on Speech Synthesis, 1998

Concatenative speech synthesis using a harmonic plus noise model.
Proceedings of the Third ESCA/COCOSDA Workshop on Speech Synthesis, 1998

Real time voice alteration based on linear prediction.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Exploration of acoustic correlates in speaker selection for concatenative synthesis.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

TD-PSOLA versus harmonic plus noise model in diphone based speech synthesis.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

A system for voice conversion based on probabilistic classification and a harmonic plus noise model.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Quantization of the spectral envelope for sinusoidal coders.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Diphone concatenation using a harmonic plus noise model of speech.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996
Efficient Decomposition of Speech Signals Into a Deterministic and a Stochastic Part.
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996

Decomposition of speech signals into a deterministic and a stochastic part.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

On the transformation of the speech spectrum for voice conversion.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Decomposition of speech signals into a periodic and non-periodic part based on sinusoidal models.
Proceedings of Third International Conference on Electronics, Circuits, and Systems, 1996

1995
High-quality speech modification based on a harmonic + noise model.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Statistical methods for voice quality transformation.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1993
HNS: Speech modification based on a harmonic+noise model.
Proceedings of the IEEE International Conference on Acoustics, 1993


  Loading...