Yannis Stylianou

Anna Sfakianaki

Theognosia Chimona

CoRR, 2020

Audiovisual Speech Synthesis using Tacotron2.

[BibT_eX]

[DOI]

Ahmed Hussen Abdelaziz

Anushree Prasanna Kumar

CoRR, 2020

A fully recurrent feature extraction for single channel speech enhancement.

[BibT_eX]

[DOI]

Santelli Claudio

CoRR, 2020

Enhancing Speech Intelligibility in Text-To-Speech Synthesis Using Speaking Style Conversion.

[BibT_eX]

[DOI]

Dipjyoti Paul

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Speaker Conditional WaveRNN: Towards Universal Neural Vocoder for Unseen Speaker and Recording Conditions.

[BibT_eX]

[DOI]

Dipjyoti Paul

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

Enriched communication across the lifespan.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2019

Neural Text-to-Speech Adaptation from Low Quality Public Recordings.

[BibT_eX]

[DOI]

Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

A Non-Causal FFTNet Architecture for Speech Enhancement.

[BibT_eX]

[DOI]

Nagaraj Adiga

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Non-Parallel Voice Conversion Using Weighted Generative Adversarial Networks.

[BibT_eX]

[DOI]

Dipjyoti Paul

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speech Enhancement for Noise-Robust Speech Synthesis Using Wasserstein GAN.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

An Unsupervised Learning Approach to Neural-net-supported Wpe Dereverberation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Connections between Reassigned Spectrum and Least Squares Estimation for Sinusoidal Models.

[BibT_eX]

[DOI]

Proceedings of the 27th European Signal Processing Conference, 2019

Training Generative Adversarial Networks With Weights.

[BibT_eX]

[DOI]

Proceedings of the 27th European Signal Processing Conference, 2019

2018

Automatic acoustic detection of birds through deep learning: the first Bird Audio Detection challenge.

[BibT_eX]

[DOI]

CoRR, 2018

A study of time-frequency features for CNN-based automatic heart sound classification for pathology detection.

[BibT_eX]

[DOI]

Baris Bozkurt

Ioannis Germanakis

Comput. Biol. Medicine, 2018

Prediction of Dialogue Success with Spectral and Rhythm Acoustic Features Using DNNS and SVMS.

[BibT_eX]

[DOI]

Athanasios Lykartsis

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Spoken Dialogue for Information Navigation.

[BibT_eX]

[DOI]

Panagiotis Papadakos

Yannis Tzitzikas

Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, 2018

Speech Intelligibility Enhancement Based on a Non-causal Wavenet-like Model.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

A Case Study on the Importance of Belief State Representation for Dialogue Policy Management.

[BibT_eX]

[DOI]

Michail Lagoudakis

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Weighting Time-Frequency Representation of Speech Using Auditory Saliency for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Cong-Thanh Do

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Adaptation of an Expressive Single Speaker Deep Neural Network Speech Synthesis System.

[BibT_eX]

[DOI]

Jonathan Parker

Roberto Cipolla

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Towards Scalable Information-Seeking Multi-Domain Dialogue.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

ON the Use of Wavenet as a Statistical Vocoder.

[BibT_eX]

[DOI]

Nagaraj Adiga

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

On Finding the Relevant User Reviews for Advancing Conversational Faceted Search.

[BibT_eX]

[DOI]

Eleftherios Dimitrakis

Konstantinos Sgontzos

Panagiotis Papadakos

Yannis Marketakis

Yannis Tzitzikas

Proceedings of 4th Workshop on Sentic Computing, 2018

2017

LD-SDS: Towards an Expressive Spoken Dialogue System based on Linked-Data.

[BibT_eX]

[DOI]

CoRR, 2017

Domain Complexity and Policy Learning in Task-Oriented Dialogue Systems.

[BibT_eX]

[DOI]

Stefan Ultes

Proceedings of the Advanced Social Interaction with Agents, 2017

Single-Model Multi-domain Dialogue Management with Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the Advanced Social Interaction with Agents, 2017

On the Quality and Intelligibility of Noisy Speech Processed for Near-End Listening Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Improved Automatic Speech Recognition Using Subband Temporal Envelope Features and Time-Delay Neural Network Denoising Autoencoder.

[BibT_eX]

[DOI]

Cong-Thanh Do

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Adaptive gain control and time warp for enhanced speech intelligibility under reverberation.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Expressive visual text to speech and expression adaptation using deep neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Predicting dialogue success, naturalness, and length with acoustic features.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Effective emotion recognition in movie audio tracks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

Near and Far Field Speech-in-Noise Intelligibility Improvements Based on a Time-Frequency Energy Reallocation Approach.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Global Variance in Speech Synthesis With Linear Dynamical Models.

[BibT_eX]

[DOI]

Vassilios Digalakis

IEEE Signal Process. Lett., 2016

Adaptive Gain Control for Enhanced Speech Intelligibility Under Reverberation.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2016

Voice Activity Detection: Merging Source and Filter-based Information.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2016

Advances in phase-aware signal processing in speech communication.

[BibT_eX]

[DOI]

Pejman Mowlaee

Rahim Saeidi

Speech Commun., 2016

Bird detection in audio: A survey and a challenge.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016

Effectiveness of Near-End Speech Enhancement Under Equal-Loudness and Equal-Level Constraints.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Generalizing Steady State Suppression for Enhanced Intelligibility Under Reverberation.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Automated Pause Insertion for Improved Intelligibility Under Reverberation.

[BibT_eX]

[DOI]

Norbert Braunschweiler

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Modulation Enhancement of Temporal Envelopes for Increasing Speech Intelligibility in Noise.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Multi-stream spectral representation for statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Kayoko Yanagisawa

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Iterative estimation of phase using complex cepstrum representation.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

High-resolution sinusoidal modeling of unvoiced speech.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Initial investigation of speech synthesis based on complex-valued neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Learning Domain-Independent Dialogue Policies via Ontology Parameterisation.

[BibT_eX]

[DOI]

Proceedings of the SIGDIAL 2015 Conference, 2015

A fast algorithm for improved intelligibility of speech-in-noise based on frequency and time domain energy reallocation.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Towards a linear dynamical model based speech synthesizer.

[BibT_eX]

[DOI]

Vassilios Tsiaras

Vassilios Digalakis

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A maximum likelihood approach to the detection of moments of maximum excitation and its application to high-quality speech parameterization.

[BibT_eX]

[DOI]

Masami Akamine

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Intelligibility enhancement of casual speech for reverberant environments inspired by clear speech properties.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Fusion of multiple parameterisations for DNN-based sinusoidal speech synthesis with multi-task learning.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Fast and accurate phase unwrapping.

[BibT_eX]

[DOI]

Thomas Drugman

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Methods for applying dynamic sinusoidal models to statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Improved face-to-face communication using noise reduction and speech intelligibility enhancement.

[BibT_eX]

[DOI]

Anthony Griffin

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Robust excitation-based features for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Enhancing the intelligibility of statistically generated synthetic speech by means of noise-independent modifications.

[BibT_eX]

[DOI]

Daniel Erro

IEEE ACM Trans. Audio Speech Lang. Process., 2014

Fast Inter-Harmonic Reconstruction for Spectral Envelope Estimation in High-Pitched Voices.

[BibT_eX]

[DOI]

Thomas Drugman

IEEE Signal Process. Lett., 2014

Maximum Voiced Frequency Estimation: Exploiting Amplitude and Phase Spectra.

[BibT_eX]

[DOI]

Thomas Drugman

IEEE Signal Process. Lett., 2014

Approaching speech intelligibility enhancement with inspiration from Lombard and Clear speaking styles.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2014

Introduction to the Special Issue on The listening talker: context-dependent speech production and perception.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2014

On spectral and time domain energy reallocation for speech-in-noise intelligibility enhancement.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Emotional speech classification using adaptive sinusoidal modelling.

[BibT_eX]

[DOI]

Theodora Yakoumaki

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Phase importance in speech processing applications.

[BibT_eX]

[DOI]

Pejman Mowlaee

Rahim Saeidi

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

The importance of phase on voice quality assessment.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

An investigation of the application of dynamic sinusoidal models to statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Cluster adaptive training of average voice models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Linear dynamical models in speech synthesis.

[BibT_eX]

[DOI]

Vassilios Tsiaras

Vassilios Digalakis

Proceedings of the IEEE International Conference on Acoustics, 2014

Complex cepstrum factorization for statistical parametric synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Simple and artefact-free spectral modifications for enhancing the intelligibility of casual speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Robust full-band adaptive Sinusoidal analysis and synthesis of speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Pitch modifications of speech based on an adaptive Harmonic Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

A fixed dimension and perceptually based dynamic sinusoidal model of speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Analysis of emotional speech using an adaptive sinusoidal model.

[BibT_eX]

[DOI]

Theodora Yakoumaki

Proceedings of the 22nd European Signal Processing Conference, 2014

2013

Analysis and Synthesis of Speech Using an Adaptive Full-Band Harmonic Model.

[BibT_eX]

[DOI]

Gilles Degottex

Cassia Valentini-Botinhao

IEEE Trans. Speech Audio Process., 2013

Evaluating the intelligibility benefit of speech modifications in known noise conditions.

[BibT_eX]

[DOI]

Martin Cooke

Catherine Mayo

Bastian Sauert

Yan Tang

Speech Commun., 2013

Evaluating how well filtered white noise models the residual from sinusoidal modeling of musical instrument sounds.

[BibT_eX]

[DOI]

Marcelo F. Caetano

Gilles Degottex

Cassia Valentini-Botinhao

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Photo-realistic expressive text to talking head synthesis.

[BibT_eX]

[DOI]

Vincent Wan

Robert Anderson

Art Blokland

Norbert Braunschweiler

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise.

[BibT_eX]

[DOI]

Junichi Yamagishi

Simon King

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Increasing speech intelligibility via spectral shaping with frequency warping and dynamic range compression plus transient enhancement.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Linking loudness increases in normal and lombard speech to decreasing vowel formant separation.

[BibT_eX]

[DOI]

Catherine Mayo

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Assessing the intelligibility impact of vowel space expansion via clear speech-inspired frequency warping.

[BibT_eX]

[DOI]

Cassia Valentini-Botinhao

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Statistical synthesizer with embedded prosodic and spectral modifications to generate highly intelligible speech in noise.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Improving intelligibility in noise of HMM-generated speech via noise-dependent and -independent methods.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Automatic classification of systolic heart murmurs.

[BibT_eX]

[DOI]

Ioannis Germanakis

Proceedings of the IEEE International Conference on Acoustics, 2013

Time-scale modifications based on a full-band adaptive harmonic model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Adaptive sinusoidal modeling of percussive musical instrument sounds.

[BibT_eX]

[DOI]

Marcelo F. Caetano

Proceedings of the 21st European Signal Processing Conference, 2013

2012

Automatic glottal segmentation using local-based active contours and application to glottovibrography.

[BibT_eX]

[DOI]

Sevasti-Zoi Karakozoglou

Nathalie Henrich

Christophe d'Alessandro

Speech Commun., 2012

Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression.

[BibT_eX]

[DOI]

Varvara Kandia

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Can modified casual speech reach the intelligibility of clear speech?

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

On the Modeling of Voiceless Stop Sounds of Speech using Adaptive Quasi-Harmonic Models.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Unsupervised Acoustic Analyses of Normal and Lombard Speech, with Spectral Envelope Transformation to Improve Intelligibility.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Implementation of Simple Spectral Techniques to Enhance the Intelligibility of Speech using a Harmonic Model.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A full-band adaptive harmonic representation of speech.

[BibT_eX]

[DOI]

Gilles Degottex

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

An extension of the adaptive Quasi-Harmonic Model.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Speech-in-noise intelligibility improvement based on power recovery and dynamic range compression.

[BibT_eX]

[DOI]

Varvara Kandia

Proceedings of the 20th European Signal Processing Conference, 2012

2011

Adaptive AM-FM Signal Decomposition With Application to Speech Analysis.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

Voice Pathology Detection and Discrimination Based on Modulation Spectral Features.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

Scale Transform in Rhythmic Similarity of Music.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

Discrimination of speech from nonspeeech in broadcast news based on modulation frequency features.

[BibT_eX]

[DOI]

Speech Commun., 2011

Tremor in speakers with spasmodic dysphonia.

[BibT_eX]

[DOI]

Philippe H. Dejonckere

Proceedings of the 7th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2011

A Hybrid Quasi-Harmonic/CELP Wideband Speech Coding Scheme for Unit Selection TTS Synthesis.

[BibT_eX]

[DOI]

Chang-Heon Lee

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Glottal inverse filtering using stabilised weighted linear prediction.

[BibT_eX]

[DOI]

Paavo Alku

Proceedings of the IEEE International Conference on Acoustics, 2011

ON the recovery of time-varying spectral envelope information from AQHM-derived spectra.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Introduction to the Special Section on Voice Transformation.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

Three Dimensions of Pitched Instrument Onset Detection.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

Auditory Spectrum-Based Pitched Instrument Onset Detection.

[BibT_eX]

[DOI]

Emmanouil Benetos

IEEE Trans. Speech Audio Process., 2010

Reply to "Comments on 'Iterative Estimation of Sinusoidal Signal Parameters'".

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2010

Iterative Estimation of Sinusoidal Signal Parameters.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2010

Parataxis: Morphological Similarity in Traditional Music.

[BibT_eX]

[DOI]

Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Fast least-squares solution for sinusoidal, harmonic and quasi-harmonic models.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A factorial sparse coder model for single channel source separation.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Analysis/synthesis of speech based on an adaptive quasi-harmonic plus noise model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

On the robustness of the Quasi-Harmonic model of speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Dysphonia detection based on modulation spectral features and cepstral coefficients.

[BibT_eX]

[DOI]

Juan Ignacio Godino-Llorente

Julián D. Arias-Londoño

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Wrapped Gaussian Mixture Models for Modeling and High-Rate Quantization of Phase Data of Speech.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2009

Spectral jitter modeling and estimation.

[BibT_eX]

[DOI]

Miltiadis Vasilakis

Biomed. Signal Process. Control., 2009

A novel method for the extraction of vocal tremor.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2009

Modulation spectral features for objective voice quality assessment: the breathiness case.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2009

Rhythmic Similarity in Traditional Turkish Music.

[BibT_eX]

[DOI]

Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Pitched Instrument Onset Detection based on Auditory Spectra.

[BibT_eX]

[DOI]

Emmanouil Benetos

Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

AM-FM estimation for speech based on a time-varying sinusoidal model.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Normalized modulation spectral features for cross-database voice pathology detection.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Voice Transformation: A survey.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Chirp rate estimation of speech based on a time-varying quasi-harmonic model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

A scale transform based method for rhythmic similarity of music.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Video and audio based detection of filled hesitation pauses in classroom lectures.

[BibT_eX]

[DOI]

Vassilios Tsiaras

Costas Panagiotakis

Proceedings of the 17th European Signal Processing Conference, 2009

Evaluation of modulation frequency features for speaker verification and identification.

[BibT_eX]

[DOI]

Proceedings of the 17th European Signal Processing Conference, 2009

2008

Musical Genre Classification Using Nonnegative Matrix Factorization-Based Features.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2008

Beat Tracking using Group Delay Based Onset Detection.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2008, 2008

On the properties of a time-varying quasi-harmonic model of speech.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Dimensionality reduction of modulation frequency features for speech discrimination.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Singing voice detection using modulation frequency feature.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008

Improving the modeling of the noise part in the harmonic plus noise model of speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Rhythmic similarity of music based on dynamic periodicity warping.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Conditional Vector Quantization for Speech Coding.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

A mathematical model for accurate measurement of jitter.

[BibT_eX]

[DOI]

Miltiadis Vasilakis

Proceedings of the Fifth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2007

Speech-nonspeech discrimination using the information bottleneck method and spectro-temporal modulation index.

[BibT_eX]

[DOI]

Michael Wohlmayr

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Bit-erasure channel decoding for GMM-based multiple description coding.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

The harmonic model codec (HMC) framework for voIP.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Conditional Vector Quantization for Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

A Statistical Approach to Musical Genre Classification using Non-Negative Matrix Factorization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Towards a Voice Conversion System Based on Frame Selection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Stochastic Modeling and Quantization of Harmonic Phases in Speech using Wrapped Gaussian Mixture Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Speech - Nonspeech discrimination based on speech-relevant spectrogram modulations.

[BibT_eX]

[DOI]

Michael Wohlmayr

Proceedings of the 15th European Signal Processing Conference, 2007

2006

Fast Analysis/Synthesis of Harmonic Signals.

[BibT_eX]

[DOI]

Miltiadis Vasilakis

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

On the Detection of Discontinuities in Concatenative Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Progress in Nonlinear Speech Processing, 2005

Extraction of Speech-Relevant Information from Modulation Spectrograms.

[BibT_eX]

[DOI]

Michael Wohlmayr

Proceedings of the Progress in Nonlinear Speech Processing, 2005

Discontinuity detection in concatenated speech synthesis based on nonlinear speech analysis.

[BibT_eX]

[DOI]

Esther Klabbers

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Coding with Side Information Techniques for LSF Reconstruction in Voice Over IP.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Biometrics: Different Approaches for Using Gaussian Mixture Models in Handwriting.

[BibT_eX]

[DOI]

Sascha Schimke

Athanasios Valsamakis

Claus Vielhauer

Proceedings of the Communications and Multimedia Security, 2005

2004

Modeling Speech Based on Harmonic Plus Noise Models.

[BibT_eX]

[DOI]

Proceedings of the Nonlinear Speech Modeling and Applications, 2004

Nonlinear Speech Features for the Objective Detection of Discontinuities in Concatenative Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Nonlinear Speech Modeling and Applications, 2004

Combined estimation/coding of highband spectral envelopes for speech spectrum expansion.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2001

Removing linear phase mismatches in concatenative speech synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2001

Applying the harmonic plus noise model in concatenative speech synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2001

Perceptual and objective detection of discontinuities in concatenative speech synthesis.

[BibT_eX]

[DOI]

Ann K. Syrdal

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

A simple and fast way of generating a harmonic signal.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2000

Corpus-based techniques in the AT&t nextgen synthesis system.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Multimodal Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

On the implementation of the harmonic plus noise model for concatenative speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

Stochastic modeling of spectral adjustment for high quality pitch modification.

[BibT_eX]

[DOI]

Alexander Kain

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Single complex sinusoid and ARHE model based pitch extractors.

[BibT_eX]

[DOI]

Ilija Zeljkovic

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Synchronization of speech frames based on phase data with application to concatenative speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Detection of non-stationarity in speech signals and its application to time-scaling.

[BibT_eX]

[DOI]

David A. Kapilow

Juergen Schroeter

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Assessment and correction of voice quality variabilities in large speech databases for concatenative speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

Continuous probabilistic transform for voice conversion.

[BibT_eX]

[DOI]

Olivier Cappé

Eric Moulines

IEEE Trans. Speech Audio Process., 1998

Removing phase mismatches in concatenative speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the Third ESCA/COCOSDA Workshop on Speech Synthesis, 1998

Concatenative speech synthesis using a harmonic plus noise model.

[BibT_eX]

[DOI]

Proceedings of the Third ESCA/COCOSDA Workshop on Speech Synthesis, 1998

Real time voice alteration based on linear prediction.

[BibT_eX]

[DOI]

Ping-Fai Yang

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Exploration of acoustic correlates in speaker selection for concatenative synthesis.

[BibT_eX]

[DOI]

Ann K. Syrdal

Alistair Conkie

TD-PSOLA versus harmonic plus noise model in diphone based speech synthesis.

[BibT_eX]

[DOI]

Ann K. Syrdal