Abeer Alwan

Proceedings of the 9th Workshop on Speech and Language Technology in Education, 2023

Non-uniform Speaker Disentanglement For Depression Detection From Raw Speech Signals.

[BibT_eX]

[DOI]

Jinhan Wang

Vijay Ravi

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Developmental Articulatory and Acoustic Features for Six to Ten Year Old Children.

[BibT_eX]

[DOI]

Vishwas M. Shetty

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

An Equitable Framework for Automatically Assessing Children's Oral Narrative Language Abilities.

[BibT_eX]

[DOI]

Alexander Johnson

Hariram Veeramani

Natarajan Balaji Shankar

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

FusedF0: Improving DNN-based F0 Estimation by Fusion of Summary-Correlograms and Raw Waveform Representations of Speech Signals.

[BibT_eX]

[DOI]

Eray Eren

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Leveraging Multiple Sources in Automatic African American English Dialect Detection for Adults and Children.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Towards Better Domain Adaptation for Self-Supervised Models: A Case Study of Child ASR.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2022

Spoken language interaction with robots: Recommendations for future research.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2022

Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Automatic Dialect Density Estimation for African American English.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASR.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Learning from human perception to improve automatic speaker verification in style-mismatched conditions.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Attention-based conditioning methods using variable frame rate for style-robust speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Towards Better Meta-Initialization with Task Augmentation for Kindergarten-Aged Speech Recognition.

[BibT_eX]

[DOI]

Yunzheng Zhu

Proceedings of the IEEE International Conference on Acoustics, 2022

Fraug: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

LPC Augment: an LPC-based ASR Data Augmentation Algorithm for Low and Zero-Resource Children's Dialects.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Can Social Robots Effectively Elicit Curiosity in STEM Topics from K-1 Students During Oral Assessments?

[BibT_eX]

[DOI]

Proceedings of the IEEE Global Engineering Education Conference, 2022

2021

Fundamental frequency feature warping for frequency normalization and data augmentation in child automatic speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 2021

Low Resource German ASR with Untranscribed Data Spoken by Non-Native Children - INTERSPEECH 2021 Shared Task SPAPL System.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

An Improved Single Step Non-Autoregressive Transformer for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Fundamental Frequency Feature Normalization and Data Augmentation for Child Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Bi-APC: Bidirectional Autoregressive Predictive Coding for Unsupervised Pre-Training and its Application to Children's ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Analysis of Disfluency in Children's Speech.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Speaker Discrimination in Humans and Machines: Effects of Speaking Style Variability.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Variable Frame Rate-Based Data Augmentation to Handle Speaking-Style Variability for Automatic Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

Introduction to the Issue on Data Science: Machine Learning for Audio Signal Processing.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2019

A robotic interface for the administration of language, literacy, and speech pathology assessments for children.

[BibT_eX]

[DOI]

Proceedings of the 8th ISCA International Workshop on Speech and Language Technology in Education, 2019

A Frequency Normalization Technique for Kindergarten Speech Recognition Inspired by the Role of f<sub>o</sub> in Vowel Perception.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Voice Quality and Between-Frame Entropy for Sleepiness Estimation.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Target and Non-target Speaker Discrimination by Humans and Machines.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Deep neural network based i-vector mapping for speaker verification using short utterances.

[BibT_eX]

[DOI]

Speech Commun., 2018

On the Difficulties of Automatic Speech Recognition for Kindergarten-Aged Children.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Using Voice Quality Supervectors for Affect Identification.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Filter Sampling and Combination CNN (FSC-CNN): A Compact CNN Model for Small-footprint ASR Acoustic Modeling Using Raw Waveforms.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Effectiveness of Voice Quality Features in Detecting Depression.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

Predicting Clinical Evaluations of Children's Speech with Limited Data Using Exemplar Word Template References.

[BibT_eX]

[DOI]

Proceedings of the 7th ISCA International Workshop on Speech and Language Technology in Education, 2017

Using Voice Quality Features to Improve Short-Utterance, Text-Independent Speaker Verification Systems.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Attention Based CLDNNs for Short-Duration Acoustic Scene Classification.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

CNN-Based Joint Mapping of Short and Long Utterance i-Vectors for Speaker Verification Using Short Utterances.

[BibT_eX]

[DOI]

Jinxi Guo

Usha Amrutha Nookala

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Robust Features in Deep-Learning-Based Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016

Speaker Identity and Voice Quality: Modeling Human Responses and Automatic Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Fusion Strategies for Robust Speech Recognition and Keyword Spotting for Channel- and Noise-Degraded Speech.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Noise-Robust Hidden Markov Models for Limited Training Data for Within-Species Bird Phrase Classification.

[BibT_eX]

[DOI]

Kantapon Kaewtip

Charles E. Taylor

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Speaker Verification Using Short Utterances with DNN-Based Estimation of Subglottal Acoustic Features.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

The relationship between acoustic and perceived intraspeaker variability in voice quality.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Age-dependent height estimation and speaker normalization for children's speech using the first three subglottal resonances.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Bird-phrase segmentation and verification: A noise-robust template-based approach.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Glottal source processing: From analysis to applications.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2014

The glottaltopogram: A method of analyzing high-speed images of the vocal folds.

[BibT_eX]

[DOI]

Gang Chen

Comput. Speech Lang., 2014

The relationship between the second subglottal resonance and vowel class, standing height, trunk length, and F0 variation for Mandarin speakers.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Speaker recognition via fusion of subglottal features and MFCCs.

[BibT_eX]

[DOI]

Hitesh Anand Gupta

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Investigating the effect of F0 and vocal intensity on harmonic magnitudes: data from high-speed laryngeal videoendoscopy.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Feature enhancement using sparse reference and estimated soft-mask exemplar-pairs for noisy speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Non-linear dimension reduction of Gabor features for noise-robust ASR.

[BibT_eX]

[DOI]

Hitesh Anand Gupta

Anirudh Raju

Proceedings of the IEEE International Conference on Acoustics, 2014

Frequency warping using subglottal resonances: Complementarity with VTLN and robustness to additive noise.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Multi-band summary correlogram-based pitch detection for noisy speech.

[BibT_eX]

[DOI]

Speech Commun., 2013

Automatic estimation of the first three subglottal resonances from adults' speech signals with application to speaker height estimation.

[BibT_eX]

[DOI]

Speech Commun., 2013

A pitch-based spectral enhancement technique for robust speech processing.

[BibT_eX]

[DOI]

Kantapon Kaewtip

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

All for one: feature combination for highly channel-degraded speech activity detection.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Investigating the relationship between glottal area waveform shape and harmonic magnitudes through computational modeling and laryngeal high-speed videoendoscopy.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A perceptually and physiologically motivated voice source model.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Bird phrase segmentation by entropy-driven change point detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

A sparse representation-based classifier for in-set bird phrase verification and classification with limited training data.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

A robust automatic bird phrase classifier using dynamic time-warping with prominent region identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Non-linear frequency warping for VTLN using subglottal resonances and the third formant frequency.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Change point detection methodology used for segmenting bird songs.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

2012

SAFE: A Statistical Approach to F0 Estimation Under Clean and Noisy Conditions.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Evaluation of a Sparse Representation-Based Classifier For Bird Phrase Classification Under Limited Data Conditions.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Estimating the voice source in noise.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Automatic estimation of the first two subglottal resonances in children's speech with application to speaker normalization in limited-data conditions.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A novel approach to soft-mask estimation and Log-Spectral enhancement for robust speech recognition.

[BibT_eX]

[DOI]

Julien van Hout

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

FBEM: A filter bank EM algorithm for the joint optimization of features and acoustic model parameters in bird call classification.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

The glottaltopograph: A method of analyzing high-speed images of the vocal folds.

[BibT_eX]

[DOI]

Gang Chen

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Automatic height estimation using the second subglottal resonance.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

A Generative Student Model for Scoring Word Reading Skills.

[BibT_eX]

[DOI]

Joseph Tepperman

Sungbok Lee

IEEE Trans. Speech Audio Process., 2011

A Unified Framework for Designing Optimal STSA Estimators Assuming Maximum Likelihood Phase Equivalence of Speech and Noise.

[BibT_eX]

[DOI]

Bengt Jonas Borgstrom

IEEE ACM Trans. Audio Speech Lang. Process., 2011

Perception of place of articulation for plosives and fricatives in noise.

[BibT_eX]

[DOI]

Jintao Jiang

Willa S. Chen

Speech Commun., 2011

Analysis and Automatic Estimation of Children's Subglottal Resonances.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics.

[BibT_eX]

[DOI]

Thomas Drugman

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Acoustic Correlates of Glottal Gaps.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Noise-robust F0 estimation using SNR-weighted summary correlograms from multi-band comb filters.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Log-spectral amplitude estimation with Generalized Gamma distributions for speech enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Automatic estimation of the second subglottal resonance from natural speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

HMM-Based Reconstruction of Unreliable Spectrographic Data for Noise Robust Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

A Statistical Approach to Mel-Domain Mask Estimation for Missing-Feature ASR.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2010

On the acoustic correlates of high and low nuclear pitch accents in American English.

[BibT_eX]

[DOI]

Stefanie Shattuck-Hufnagel

Speech Commun., 2010

Improved Speech Presence Probabilities Using HMM-Based Inference, With Applications to Speech Enhancement and ASR.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2010

On the interdependencies between voice quality, glottal gaps, and voice-source related acoustic measures.

[BibT_eX]

[DOI]

Gang Chen

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

SAFE: a statistical algorithm for F0 estimation for both clean and noisy speech.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

On using voice source measures in automatic gender classification of children's speech.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Efficient HMM-based estimation of missing features, with applications to packet loss concealment.

[BibT_eX]

[DOI]

Per Henrik Borgström

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Voice activity detection using harmonic frequency components in likelihood ratio test.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

A new voice source model based on high-speed imaging and its application to voice source estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Utilizing Compressibility in Reconstructing Spectrographic Data, With Applications to Noise Robust ASR.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2009

Assessment of emerging reading skills in young native speakers and language learners.

[BibT_eX]

[DOI]

Christy Kim Boscardin

Margaret Heritage

P. David Pearson

Speech Commun., 2009

Frequency warping for VTLN and speaker adaptation by linear transformation of standard MFCC.

[BibT_eX]

[DOI]

Sankaran Panchapagesan

Comput. Speech Lang., 2009

Measuring children's phonemic awareness through blending tasks.

[BibT_eX]

[DOI]

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2009

Temporal modulation processing of speech signals for noise robust ASR.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Bark-shift based nonlinear speaker normalization using the second subglottal resonance.

[BibT_eX]

[DOI]

Yi-Hui Lee

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A novel codebook search technique for estimating the open quotient.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A noise-type and level-dependent MPO-based speech enhancement architecture with variable frame analysis for noise-robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A correlation-maximization denoising filter used as an enhancement frontend for noise robust bird call classification.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Reducing F0 Frame Error of F0 tracking algorithms under noisy conditions with an unvoiced/voiced classification frontend.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

A Low-Complexity Parabolic Lip Contour Model With Speaker Normalization for High-Level Feature Extraction in Noise-Robust Audiovisual Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Syst. Man Cybern. Part A, 2008

A reliable technique for detecting the second subglottal resonance and its use in cross-language speaker adaptation.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Effects of intonational phrase boundaries on pitch-accented syllables in american English.

[BibT_eX]

[DOI]

Stefanie Shattuck-Hufnagel

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Vocal tract inversion by cepstral analysis-by-synthesis using chain matrices.

[BibT_eX]

[DOI]

Sankaran Panchapagesan

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

HMM-based estimation of unreliable spectral components for noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Dealing with limited and noisy data in ASR: a hybrid knowledge-based and statistical approach.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Speaker normalization based on subglottal resonances.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

An efficient approximation of the forward-backward algorithm to deal with packet loss, with applications to remote speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Speaker Adaptation With Limited Data Using Regression-Tree-Based Spectral Peak Alignment.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Robust Speaker Adaptation by Weighted Model Averaging Based on the Minimum Description Length Criterion.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Rate Allocation for Noncollaborative Multiuser Speech Communication Systems Based on Bargaining Theory.

[BibT_eX]

[DOI]

Mihaela van der Schaar

IEEE Trans. Speech Audio Process., 2007

Automatic evaluation of children's performance on an English syllable blending task.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Speech and Language Technology in Education, 2007

A System for Technology Based Assessment of Language and Literacy in Young Children: the Role of Multiple Information Sources.

[BibT_eX]

[DOI]

Patti Price

Joseph Tepperman

Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007

A Bayesian network classifier for word-level reading assessment.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Pitch accent versus lexical stress: quantifying acoustic measures related to the voice source.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A packetization and variable bitrate interframe compression scheme for vector quantizer-based distributed speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A Statistical Acoustic Confusability Metric Between Hidden Markov Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

Competition of plasmid-bearing and plasmid-free organisms in a chemostat: A study of bifurcation phenomena.

[BibT_eX]

[DOI]

Khalid Alhumaizi

Abdelhamid Ajbar

Math. Comput. Model., 2006

Adaptation of children's speech with limited data based on formant-like peak alignment.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2006

Rapid speaker adaptation using regression-tree based spectral peak alignment.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Pronunciation verification of children²s speech for automatic literacy assessment.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Automatic detection of voice onset time contrasts for use in pronunciation assessment.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Voice source correlates of prosodic features in american English: a pilot study.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Acoustically-Driven Talking Face Synthesis using Dynamic Bayesian Networks.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Multi-Parameter Frequency Warping for Vtln by Gradient Search.

[BibT_eX]

[DOI]

Sankaran Panchapagesan

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Age-and Gender-Dependent Analysis of Voice Source Characteristics.

[BibT_eX]

[DOI]

Markus Iseli

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

A Database of Vocal Tract Resonance Trajectories for Research in Speech Processing.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2005

Pronunciation variations of Spanish-accented English spoken by young children.

[BibT_eX]

[DOI]

Abe Kazemzadeh

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

TBALL data collection: the making of a young children's speech corpus.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

MLLR-like speaker adaptation based on linearization of VTLN with MFCC features.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Consonant confusion structure based on machine classification of visual features in continuous speech.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing 2005, 2005

2004

Entropy-based variable frame rate analysis of speech signals and its application to ASR.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

An improved correction formula for the estimation of harmonic magnitudes and its application to open quotient estimation.

[BibT_eX]

[DOI]

Markus Iseli

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Combining feature compensation and weighted Viterbi decoding for noise robust speech recognition with limited adaptation data.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

A psychoacoustic-masking model to predict the perception of speech-like stimuli in noise.

[BibT_eX]

[DOI]

Speech Commun., 2003

Band-limited feedback cancellation with a modified filtered-X LMS algorithm for hearing aids.

[BibT_eX]

[DOI]

Speech Commun., 2003

Editorial.

[BibT_eX]

[DOI]

Speech Commun., 2003

Non-linear feature extraction for robust speech recognition in stationary and non-stationary noise.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2003

A noise-robust ASR back-end technique based on weighted viterbi recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speech recognition over bluetooth wireless channels.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002

Speech transmission using rate-compatible trellis codes and embedded source coding.

[BibT_eX]

[DOI]

IEEE Trans. Commun., 2002

Low-bitrate distributed speech recognition for packet-based and wireless communication.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2002

The effect of additive noise on speech amplitude spectra: a quantitative analysis.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2002

On the Relationship between Face Movements, Tongue Movements, and Speech Acoustics.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2002

Evaluation of noise robust features on the Aurora databases.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Channel noise robustness for low-bitrate remote speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Predicting face movements from speech acoustics using spectral dynamics.

[BibT_eX]

[DOI]

Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Similarity structure in perceptual and physical measures for visual Consonants across talkers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

Analysis by synthesis of FM modulation and aspiration noise components in pathological voices.

[BibT_eX]

[DOI]

Brian Gabelman

Proceedings of the IEEE International Conference on Acoustics, 2002

Efficient adaptation text design based on the Kullback-Leibler measure.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Guest Editors' Introduction.

[BibT_eX]

[DOI]

Antonio Ortega

J. VLSI Signal Process., 2001

Noise robust feature extraction for ASR using the Aurora 2 database.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Predicting visual consonant perception from physical measures.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

On the perception of voicing for plosives in noise.

[BibT_eX]

[DOI]

Marcia Chen

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Joint channel decoding - Viterbi recognition for wireless applications.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

An efficient and scalable 2D DCT-based feature coding scheme for remote speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2001

Source and channel coding for remote speech recognition over error-prone channels.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2001

Similarity structure in visual phonetic perception and optical phonetics.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 2001

2000

Steady-state analysis of continuous adaptation in acoustic feedback reduction systems for hearing-aids.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2000

Noise source models for fricative consonants.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2000

AM-demodulation of speech spectra and its application io noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

On the correlation between facial movements, tongue movements and speech acoustics.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Inter- and intra-speaker variability of glottal flow derivative using the LF model.

[BibT_eX]

[DOI]

Markus Iseli

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Predicting the perceptual confusion of synthetic plosive consonants in noise.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Place of articulation cues for voiced and voiceless plosives and fricatives in syllable-initial position.

[BibT_eX]

[DOI]

Willa S. Chen

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Towards Efficient and Scalable Speech Compression Schemes for Robust Speech Recognition Applications.

[BibT_eX]

[DOI]

Naveen Srinivasamurthy

Antonio Ortega

Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

On the use of variable frame rate analysis in speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Modeling speech production and perception mechanisms and their applications to synthesis, recognition, and coding.

[BibT_eX]

[DOI]

Proceedings of the ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications, 1999

Modeling the masking of formant transitions in noise.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Perceptually based and embedded wideband CELP coding of speech.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Bias analysis in continuous adaptation systems for hearing aids.

[BibT_eX]

[DOI]

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Embedded joint source-channel coding of speech using symbol puncturing of trellis codes.

[BibT_eX]

[DOI]

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

Robust word recognition using threaded spectral peaks.

[BibT_eX]

[DOI]

Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997

A perceptually based embedded subband speech coder.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1997

A model of dynamic auditory perception and its application to robust word recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1997

Analysis by synthesis of pathological voices using the Klatt synthesizer.

[BibT_eX]

[DOI]

Speech Commun., 1997

Towards articulatory speech recognition: learning smooth maps to recover articulator information.

[BibT_eX]

[DOI]

Sam T. Roweis

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

New results in vowel production: MRI, EPG, and acoustic data.

[BibT_eX]

[DOI]

Yong Song

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Acoustic modelling of American English /r/.

[BibT_eX]

[DOI]

Carol Y. Espy-Wilson

Suzanne Boyce

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996

adaptive mobile multimedia networks.

[BibT_eX]

[DOI]

IEEE Wirel. Commun., 1996

Liquids in tamil.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

A psychoacoustic model for the noise masking of voiceless plosive bursts.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

From MRI and acoustic data to articulatory synthesis: a case study of the lateral approximants in american English.

[BibT_eX]

[DOI]

Philbert Bangayan

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

A model of dynamic auditory perception and its application to robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Parametric hybrid source models for voiced and voiceless fricative consonants.

[BibT_eX]

[DOI]

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995

Finite Precision Analysis of the Fast QRD-RLS Lattice Algorithm.

[BibT_eX]

Paulo S. R. Diniz

Proceedings of the 1995 IEEE International Symposium on Circuits and Systems, ISCAS 1995, Seattle, Washington, USA, April 30, 1995

Spectral analysis of subband filtered signals.

[BibT_eX]

[DOI]

Proceedings of the 1995 International Conference on Acoustics, 1995

A novel structure to compensate for frequency-dependent loudness recruitment of sensorineural hearing loss.

[BibT_eX]

[DOI]

Proceedings of the 1995 International Conference on Acoustics, 1995

A robust variable-rate speech coder.

[BibT_eX]

[DOI]

Proceedings of the 1995 International Conference on Acoustics, 1995

1994

Infinite Precision Analysis of the Fast QR Decomposition RLS Algorithm.

[BibT_eX]

[DOI]

Paulo S. R. Diniz

Proceedings of the 1994 IEEE International Symposium on Circuits and Systems, ISCAS 1994, London, England, UK, May 30, 1994

High-Performance IIR QMF Banks for Speech Subband Coding.

[BibT_eX]

[DOI]

Zhongnong Jiang

Alan N. Willson Jr.

Proceedings of the 1994 IEEE International Symposium on Circuits and Systems, ISCAS 1994, London, England, UK, May 30, 1994

An MRI study of fricative consonants.

[BibT_eX]

[DOI]

Katherine Haker

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

New adaptive-filtering techniques applied to speech echo cancellation.

[BibT_eX]

[DOI]

Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993

Strange attractors and chaotic dynamics in the production of voiced and voiceless fricatives.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

A perceptual metric for masking.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1993

1992

The role of F3 and F4 in identifying place of articulation for stop consonants.

[BibT_eX]

[DOI]