Paavo Alku
Orcid: 0000-0002-8173-9418
According to our database1,
Paavo Alku
authored at least 255 papers
between 1988 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Exploring the Impact of Fine-Tuning the Wav2vec2 Model in Database-Independent Detection of Dysarthric Speech.
IEEE J. Biomed. Health Informatics, August, 2024
Automatic classification of the severity level of Parkinson's disease: A comparison of speaking tasks, features, and classifiers.
Comput. Speech Lang., January, 2024
Investigation of self-supervised pre-trained models for classification of voice quality from speech and neck surface accelerometer signals.
Comput. Speech Lang., January, 2024
Comput. Speech Lang., January, 2024
Automatic classification of neurological voice disorders using wavelet scattering features.
Speech Commun., 2024
Pre-trained models for detection and severity level classification of dysarthria from speech.
Speech Commun., 2024
Speech Commun., 2024
2023
Speech Commun., November, 2023
Comput. Speech Lang., June, 2023
Exemplar-Based Sparse Representations for Detection of Parkinson's Disease From Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Automatic Assessment of Parkinson's Disease Using Speech Representations of Phonation and Articulation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Analysis of Instantaneous Frequency Components of Speech Signals for Epoch Extraction.
Comput. Speech Lang., 2023
Classification of Vocal Intensity Category from Speech using the Wav2vec2 and Whisper Embeddings.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Severity Classification of Parkinson's Disease from Speech using Single Frequency Filtering-based Features.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
IEEE Signal Process. Lett., 2022
Speech Commun., 2022
Speech Commun., 2022
Convolutional Neural Networks for Classification of Voice Qualities from Speech and Neck Surface Accelerometer Signals.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Comparing 1-dimensional and 2-dimensional spectral feature representations in voice pathology detection using machine learning and deep learning classifiers.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Proc. IEEE, 2021
Comput. Speech Lang., 2021
Automatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal features.
Comput. Speech Lang., 2021
Glottal features for classification of phonation type from speech and neck surface accelerometer signals.
Comput. Speech Lang., 2021
A Comparison of Cepstral Features in the Detection of Pathological Voices by Varying the Input and Filterbank of the Cepstrum Computation.
IEEE Access, 2021
Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks.
IEEE Access, 2021
Spectral modification for recognition of children's speech undermismatched conditions.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021
2020
Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Speech Commun., 2020
Speech Commun., 2020
Duration of the rhotic approximant /ɹ/ in spastic dysarthria of different severity levels.
Speech Commun., 2020
IEEE J. Sel. Top. Signal Process., 2020
Excitation Features of Speech for Emotion Recognition Using Neutral Speech as Reference.
Circuits Syst. Signal Process., 2020
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech.
Comput. Speech Lang., 2020
IEEE Access, 2020
IEEE Access, 2020
IEEE Access, 2020
Parkinson's Disease Detection from Speech Using Single Frequency Filtering Cepstral Coefficients.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
GlotNet - A Raw Waveform Model for the Glottal Excitation in Statistical Parametric Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Estimation of the glottal source from coded telephone speech using deep neural networks.
Speech Commun., 2019
Speech Commun., 2019
Analysis of phonation onsets in vowel production, using information from glottal area and flow estimate.
Speech Commun., 2019
Normal-to-Lombard adaptation of speech synthesis using long short-term memory recurrent neural networks.
Speech Commun., 2019
Speech Commun., 2019
Vocal effort compensation for MFCC feature extraction in a shouted versus normal speaker recognition task.
Comput. Speech Lang., 2019
Vocal Effort Based Speaking Style Conversion Using Vocoder Features and Parallel Learning.
IEEE Access, 2019
Augmented CycleGANs for Continuous Scale Normal-to-Lombard Speaking Style Conversion.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Mel-Frequency Cepstral Coefficients of Voice Source Waveforms for Classification of Phonation Types in Speech.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Lombard Speech Synthesis Using Transfer Learning in a Tacotron Text-to-Speech System.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Cycle-consistent Adversarial Networks for Non-parallel Vocal Effort Based Speaking Style Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2019
Waveform Generation for Text-to-speech Synthesis Using Pitch-synchronous Multi-scale Generative Adversarial Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
A Comparison Between STRAIGHT, Glottal, and Sinusoidal Vocoding in Statistical Parametric Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Speaker recognition from whispered speech: A tutorial survey and an application of time-varying linear prediction.
Speech Commun., 2018
Parameterization of a computational physical model for glottal flow using inverse filtering and high-speed videoendoscopy.
Speech Commun., 2018
Estimation of the glottal flow from speech pressure signals: Evaluation of three variants of iterative adaptive inverse filtering using computational physical modelling of voice production.
Speech Commun., 2018
Comparison of spectral tilt measures for sentence prominence in speech - Effects of dimensionality and adverse noise conditions.
Speech Commun., 2018
Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention.
CoRR, 2018
Dysarthric Speech Classification Using Glottal Features Computed from Non-words, Words and Sentences.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Time-regularized Linear Prediction for Noise-robust Extraction of the Spectral Envelope of Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Intelligibility Enhancement of Telephone Speech Using Gaussian Process Regression for Normal-to-Lombard Spectral Tilt Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
The Linear Predictive Modeling of Speech From Higher-Lag Autocorrelation Coefficients Applied to Noise-Robust Speaker Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Quadratic Programming Approach to Glottal Inverse Filtering by Joint Norm-1 and Norm-2 Optimization.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
IEEE Signal Process. Lett., 2017
Comparison of parametrization methods of electroglottographic and inverse filtered acoustic speech pressure signals in distinguishing between phonation types.
Biomed. Signal Process. Control., 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Speaking Style Conversion from Normal to Lombard Speech Using a Glottal Vocoder and Bayesian GMMs.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Evaluation of Spectral Tilt Measures for Sentence Prominence Under Different Noise Conditions.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Reducing Mismatch in Training of DNN-Based Glottal Excitation Models in a Statistical Parametric Text-to-Speech System.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Generative Adversarial Network-Based Glottal Waveform Model for Statistical Parametric Speech Synthesis.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Effects of Training Data Variety in Generating Glottal Pulses from Acoustic Features with DNNs.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Normal-to-shouted speech spectral mapping for speaker recognition under vocal effort mismatch.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Non-parallel voice conversion using i-vector PLDA: towards unifying speaker verification and transformation.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2016
Feature Extraction Using Power-Law Adjusted Linear Prediction With Application to Speaker Recognition Under Severe Vocal Effort Mismatch.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Phase perception of the glottal excitation and its relevance in statistical parametric speech synthesis.
Speech Commun., 2016
Phase modification for increasing the intelligibility of telephone speech in near-end noise conditions - evaluation of two methods.
Speech Commun., 2016
Previous exposure to intact speech increases intelligibility of its digitally degraded counterpart as a function of stimulus complexity.
NeuroImage, 2016
Comparing human and automatic speech recognition in a perceptual restoration experiment.
Comput. Speech Lang., 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Using Text and Acoustic Features in Predicting Glottal Excitation Waveforms for Parametric Speech Synthesis with Recurrent Neural Networks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Majorisation-Minimisation Based Optimisation of the Composite Autoregressive System with Application to Glottal Inverse Filtering.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
The Use of Read versus Conversational Lombard Speech in Spectral Tilt Modeling for Intelligibility Enhancement in Near-End Noise Conditions.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Intelligibility Enhancement at the Receiving End of the Speech Transmission System - Effects of Far-End Noise Reduction.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Time-Varying Quasi-Closed-Phase Weighted Linear Prediction Analysis of Speech for Accurate Formant Detection and Tracking.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
High-pitched excitation generation for glottal vocoding in statistical parametric speech synthesis using a deep neural network.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Quasi closed phase analysis of speech signals using time varying weighted linear prediction for accurate formant tracking.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
A subjective listening test of six different artificial bandwidth extension approaches in English, Chinese, German, and Korean.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Vowel Enhancement in Early Stage Spanish Esophageal Speech Using Natural Glottal Flow Pulse and Vocal Tract Frequency Warping.
Proceedings of the 6th Workshop on Speech and Language Processing for Assistive Technologies, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Accounting for uncertainty of i-vectors in speaker recognition using uncertainty propagation and modified imputation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Speech quality evaluation of artificial bandwidth extension: comparing subjective judgments and instrumental predictions.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Comparison of Gaussian process regression and Gaussian mixture models in spectral tilt modelling for intelligibility enhancement of telephone speech.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
AM-FM based filter bank analysis for estimation of spectro-temporal envelopes and its application for speaker recognition in noisy reverberant environments.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 18th International Congress of Phonetic Sciences, 2015
Non-native production training with an acoustic model and orthographic or transcription cues.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015
Does interest in language learning affect the non-native phoneme production in elderly learners?
Proceedings of the 18th International Congress of Phonetic Sciences, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Quasi Closed Phase Glottal Inverse Filtering Analysis With Weighted Linear Prediction.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
IEEE Signal Process. Lett., 2014
Synthesis and perception of breathy, normal, and Lombard speech in the presence of noise.
Comput. Speech Lang., 2014
An adaptive post-filtering method producing an artificial Lombard-like effect for intelligibility enhancement of narrowband telephone speech.
Comput. Speech Lang., 2014
Comput. Speech Lang., 2014
Biomed. Signal Process. Control., 2014
Spectral tilt modelling with extrapolated GMMs for intelligibility enhancement of narrowband telephone speech.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014
Deep neural network based trainable voice source model for synthesis of speech with varying vocal effort.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Subjective voice quality evaluation of artificial bandwidth extension: comparing different audio bandwidths and speech codecs.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Filtering and subspace selection for spectral features in detecting speech under physical stress.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Enhancement of speech intelligibility in near-end noise conditions with phase modification.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Spectral tilt modelling with GMMs for intelligibility enhancement of narrowband telephone speech.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Multi-scale modulation filtering in automatic detection of emotions in telephone speech.
Proceedings of the IEEE International Conference on Acoustics, 2014
Comparison of post-processing methods for intelligibility enhancement of narrowband speech in a mobile phone framework.
Proceedings of the IEEE International Conference on Acoustics, 2014
Voice source modelling using deep neural networks for statistical parametric speech synthesis.
Proceedings of the 22nd European Signal Processing Conference, 2014
2013
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Robust spectral representation using group delay function and stabilized weighted linear prediction for additive noise degradations.
Proceedings of the 7th Conference on Speech Technology and Human-Computer Dialogue, 2013
Lombard modified text-to-speech synthesis for improved intelligibility: submission for the hurricane challenge 2013.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Extended weighted linear prediction using the autocorrelation snapshot - a robust speech analysis method and its application to recognition of vocal emotions.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Frequency-adaptive post-filtering for intelligibility enhancement of narrowband telephone speech.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Comparison of spectrum estimators in speaker verification: mismatch conditions induced by vocal effort.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Robust formant detection using group delay function and stabilized weighted linear prediction.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Automatic detection of anger in telephone speech with robust autoregressive modulation filtering.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Bandwidth Extension of Telephone Speech to Low Frequencies Using Sinusoidal Synthesis and a Gaussian Mixture Model.
IEEE Trans. Speech Audio Process., 2012
IEEE Signal Process. Lett., 2012
IEEE Signal Process. Lett., 2012
Cortical processing of degraded speech sounds: Effects of distortion type and continuity.
NeuroImage, 2012
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Utilization of the Lombard effect in post-filtering for intelligibility enhancement of telephone speech.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Utilizing Markov Chain Monte Carlo (MCMC) Method for Improved Glottal Inverse Filtering.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
On measuring the intelligibility of synthetic speech in noise - Do we need a realistic noise environment?
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Conversational evaluation of artificial bandwidth extension of telephone speech using a mobile handset.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Comparing spectrum estimators in speaker verification under additive noise degradation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Comparison of post-filtering methods for intelligibility enhancement of telephone speech.
Proceedings of the 20th European Signal Processing Conference, 2012
Proceedings of the Blizzard Challenge 2012, Portland, OR, USA, September 14, 2012, 2012
2011
IEEE Trans. Speech Audio Process., 2011
Bandwidth Extension of Telephone Speech Using a Neural Network and a Filter Bank Implementation for Highband Mel Spectrum.
IEEE Trans. Speech Audio Process., 2011
Cortical encoding of aperiodic and periodic speech sounds: Evidence for distinct neural populations.
NeuroImage, 2011
Proceedings of the 7th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Low-Frequency Bandwidth Extension of Telephone Speech Using Sinusoidal Synthesis and Gaussian Mixture Model.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Noise Robust Feature Extraction Based on Extended Weighted Linear Prediction in LVCSR.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2011
Speech bandwidth extension using Gaussian mixture model-based estimation of the highband mel spectrum.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
The GlottHMM Speech Synthesis Entry for Blizzard Challenge 2011: Utilizing Source Unit Selection in HMM-Based Speech Synthesis for Improved Excitation Generation.
Proceedings of the Blizzard Challenge 2011, Turin, Italy, September 2, 2011, 2011
2010
Temporally Weighted Linear Prediction Features for Tackling Additive Noise in Speaker Verification.
IEEE Signal Process. Lett., 2010
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010
Temporally Weighted Linear Prediction Features for Speaker Verification in Additive Noise.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Extended weighted linear prediction (XLP) analysis of speech and its application to speaker verification in adverse conditions.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Bandwidth extension of telephone speech using a filter bank implementation for highband MEL spectrum.
Proceedings of the 18th European Signal Processing Conference, 2010
Proceedings of the Blizzard Challenge 2010, Kansai Science City, Japan, September 25, 2010, 2010
2009
Development, evaluation and implementation of an artificial bandwidth extension method of telephone speech in mobile terminal.
IEEE Trans. Consumer Electron., 2009
Proceedings of the Sixth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2009
New method for delexicalization and its application to prosodic tagging for text-to-speech synthesis.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
On separating glottal source and vocal tract information in telephony speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2009
2008
IEEE Trans. Speech Audio Process., 2008
Signal Process., 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
2007
IEEE Trans. Speech Audio Process., 2007
Proceedings of the Fifth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2007
The effect of highband harmonic structure in the artificial bandwidth expansion of telephone speech.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Stabilised weighted linear prediction - a robust all-pole method for speech processing.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
2006
Emotions in Vowel Segments of Continuous Speech: Analysis of the Glottal Flow Using the Normalised Amplitude Quotient.
Phonetica, 2006
Quality improvement of telephone speech by artificial bandwidth expansion - listening tests in three languages.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
2005
Assessment of glottal inverse filtering by using aeroelastic modelling of phonation and FE modelling of vocal tract.
Proceedings of the Fourth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2005
Subglottal pressure and NAQ variation in voice production of classically trained baritone singers.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Artificial Bandwidth Expansion Method to Improve Intelligibility and Quality of AMR-Coded Narrowband Speech.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
IEEE Trans. Speech Audio Process., 2004
Linear predictive method for improved spectral modeling of lower frequencies of speech with small prediction orders.
IEEE Trans. Speech Audio Process., 2004
Analysis of the voice source in different phonation types: simultaneous high-sped imaging of the vocal fold vibration and glottal inverse filtering.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Evaluation of an inverse filtering technique using physical modeling of voice production.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Emotions in Short Vowel Segments: Effects of the Glottal Flow as Reflected by the Normalized Amplitude Quotient.
Proceedings of the Affective Dialogue Systems, Tutorial and Research Workshop, 2004
2003
IEEE Signal Process. Lett., 2003
Signal Process., 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Time-domain parameterization of the closing phase of glottal airflow waveform from voices over a large intensity range.
IEEE Trans. Speech Audio Process., 2002
Measuring the effect of fundamental frequency raising as a strategy for increasing vocal intensity in soft, normal and loud phonation.
Speech Commun., 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
A time domain reformulation of linear prediction equivalent to the LSP decomposition.
Proceedings of the IEEE International Conference on Acoustics, 2002
2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
The use of fundamental frequency raising as a strategy for increasing vocal intensity in soft, normal, and loud phonation.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
2000
Proceedings of the IEEE International Symposium on Circuits and Systems, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
MEG-measurements of brain activity reveal the link between human speech production and perception.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Analysis of voice production in breathy, normal and pressed phonation by comparing inverse filtering and videokymography.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
All-pole spectral modelling of voiced speech with a highly compressed set of parameters.
Proceedings of the 10th European Signal Processing Conference, 2000
1999
On the linearity of the relationship between the sound pressure level and the negative peak amplitude of the differentiated glottal flow in vowel production.
Speech Commun., 1999
A new predictive method for all-pole modelling of speech spectra with a compressed set of parameters.
Proceedings of the 1999 International Symposium on Circuits and Systems, ISCAS 1999, Orlando, Florida, USA, May 30, 1999
1998
Separated Linear Prediction - A new all-pole modelling technique for speech analysis.
Speech Commun., 1998
Estimation of amplitude features of the glottal flow by inverse filtering speech pressure signals.
Speech Commun., 1998
Analyzing the effect of secondary excitations of the vocal tract on vocal intensity in different loudness conditions.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 9th European Signal Processing Conference, 1998
1997
Speech Commun., 1997
1996
Amplitude domain quotient for characterization of the glottal volume velocity waveform estimated by inverse filtering.
Speech Commun., 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
1992
Speech Commun., 1992
Inverse filtering of the glottal waveform using the Itakura-saito distortion measure.
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992
1990
A comparison of egg and a new automatic inverse filtering method in phonation change from breathy to normal.
Proceedings of the First International Conference on Spoken Language Processing, 1990
Proceedings of the First International Conference on Spoken Language Processing, 1990
1989
Proceedings of the First European Conference on Speech Communication and Technology, 1989
Proceedings of the First European Conference on Speech Communication and Technology, 1989
1988
Proceedings of the IEEE International Conference on Acoustics, 1988