John H. L. Hansen

Orcid: 0000-0003-1382-9929

Affiliations:
  • University of Texas at Dallas


According to our database1, John H. L. Hansen authored at least 592 papers between 1987 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Meeting the Challenges of a Growing ICASSP: Highlights from ICASSP 2024 [Conference Highlights].
IEEE Signal Process. Mag., May, 2024

Speech Enhancement for Cochlear Implant Recipients Using Deep Complex Convolution Transformer With Frequency Transformation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Monaural Speech Dereverberation Using Deformable Convolutional Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Navigating the United States Legislative Landscape on Voice Privacy: Existing Laws, Proposed Bills, Protection for Children, and Synthetic Data for AI.
CoRR, 2024

We Need Variations in Speech Synthesis: Sub-center Modelling for Speaker Embeddings.
CoRR, 2024

Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples.
IEEE Access, 2024

Joint Language and Speaker Classification in Naturalistic Bilingual Adult-Toddler Interactions.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Apollo's Unheard Voices: Graph Attention Networks for Speaker Diarization and Clustering for Fearless Steps Apollo Collection.
Proceedings of the IEEE International Conference on Acoustics, 2024

Efficient Adapter Tuning of Pre-Trained Speech Models for Automatic Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024

Dual-Path Minimum-Phase and All-Pass Decomposition Network for Single Channel Speech Dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Situational Signal Processing with Ecological Momentary Assessment: Leveraging Environmental Context for Cochlear Implant Users.
Proceedings of the IEEE International Conference on Acoustics, 2024

Fearless Steps Apollo: Team Communications Based Community Resource Development for Science, Technology, Education, and Historical Preservation.
Proceedings of the IEEE International Conference on Acoustics, 2024

T-EnFP: An Efficient Transformer Encoder-Based System for Driving Behavior Classification.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Single-channel speech separation using soft-minimum permutation invariant training.
Speech Commun., June, 2023

Historical Audio Search and Preservation: Finding Waldo Within the Fearless Steps Apollo 11 Naturalistic Audio Corpus [Applications Corner].
IEEE Signal Process. Mag., May, 2023

Attention and DCT Based Global Context Modeling for Text-Independent Speaker Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Bilateral Cochlear Implant Processing of Coding Strategies With CCi-MOBILE, an Open-Source Research Platform.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Domain Expansion for End-to-End Speech Recognition: Applications for Accent/Dialect Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

DeepComboSAD: Spectro-Temporal Correlation Based Speech Activity Detection for Naturalistic Audio Streams.
IEEE Signal Process. Lett., 2023

Multi-objective Non-intrusive Hearing-aid Speech Assessment Model.
CoRR, 2023

What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Assessment of Non-Native Speech Intelligibility using Wav2vec2-based Mispronunciation Detection and Multi-level Goodness of Pronunciation Transformer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Speaker Tracking using Graph Attention Networks with Varying Duration Utterances across Multi-Channel Naturalistic Data: Fearless Steps Apollo-11 Audio Corpus.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

CFTNet: Complex-valued Frequency Transformation Network for Speech Enhancement.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Improving Transformer-Based Networks with Locality for Automatic Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Vision-Cloud Data Fusion for ADAS: A Lane Change Prediction Case Study.
IEEE Trans. Intell. Veh., 2022

CCi-MOBILE: A Portable Real Time Speech Processing Platform for Cochlear Implant and Hearing Research.
IEEE Trans. Biomed. Eng., 2022

Multi-Source Domain Adaptation for Text-Independent Forensic Speaker Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

SkipConvGAN: Monaural Speech Dereverberation Using Generative Adversarial Networks via Complex Time-Frequency Masking.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Assessing child communication engagement and statistical speech patterns for American English via speech recognition in naturalistic active learning spaces.
Speech Commun., 2022

Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise.
CoRR, 2022

Fearless Steps Challenge Phase-1 Evaluation Plan.
CoRR, 2022

Learning ASR pathways: A sparse multilingual ASR model.
CoRR, 2022

Data-driven Attention and Data-independent DCT based Global Context Modeling for Text-independent Speaker Recognition.
CoRR, 2022

Impact of Naturalistic Field Acoustic Environments on Forensic Text-independent Speaker Verification System.
CoRR, 2022

Deep Spoken Keyword Spotting: An Overview.
IEEE Access, 2022

Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Multi-Frequency Information Enhanced Channel Attention Module for Speaker Representation Learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Speech Modification for Intelligibility in Cochlear Implant Listeners: Individual Effects of Vowel- and Consonant-Boosting.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Audio Anti-spoofing Using Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Challenges remain in Building ASR for Spontaneous Preschool Children Speech in Naturalistic Educational Environments.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

FeaRLESS: Feature Refinement Loss for Ensembling Self-Supervised Learning Features in Robust End-to-end Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Speaker Trait Enhancement for Cochlear Implant Users: A Case Study for Speaker Emotion Perception.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Challenges in Metadata Creation for Massive Naturalistic Team-Based Audio Data.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Bimodal Cochlear Implant Processing based on Assisted Hearing algorithms with CCi-MOBILE: an open-source research platform.
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022

2021
Block-Based High Performance CNN Architectures for Frame-Level Overlapping Speech Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Analysis and Calibration of Lombard Effect and Whisper for Speaker Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Guided Generative Adversarial Neural Network for Representation Learning and Audio Generation Using Fewer Labelled Audio Data.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Curriculum Learning based approaches for robust end-to-end far-field speech recognition.
Speech Commun., 2021

Nonlinear waveform distortion: Assessment and detection of clipping on speech data and systems.
Speech Commun., 2021

An investigation of domain adaptation in speaker embedding space for speaker recognition.
Speech Commun., 2021

Challenges in real-time-embedded IoT Command Recognition.
Proceedings of the 7th IEEE World Forum on Internet of Things, 2021

Development of CNN-Based Cochlear Implant and Normal Hearing Sound Recognition Models Using Natural and Auralized Environmental Audio.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Real-Time Speaker Counting in a Cocktail Party Scenario Using Attention-Guided Convolutional Neural Network.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Fearless Steps Challenge Phase-3 (FSC P3): Advancing SLT for Unseen Channel and Mission Data Across NASA Apollo Audio.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Measuring Frequency of Child-directed WH-Question Words for Alternate Preschool Locations using Speech Recognition and Location Tracking Technologies.
Proceedings of the ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal Interaction, Montreal, QC, Canada, October 18, 2021

DEAAN: Disentangled Embedding and Adversarial Adaptation Network for Robust Speaker Representation Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

Speaker Conditioning of Acoustic Models Using Affine Transformation for Multi-Speaker Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Respiratory Distress Detection from Telephone Speech using Acoustic and Prosodic Features.
CoRR, 2020

Assessing Child Communication Engagement via Speech Recognition in Naturalistic Active Learning Spaces.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Sensor Fusion of Camera and Cloud Digital Twin Information for Intelligent Vehicles.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

Speaker Representation Learning Using Global Context Guided Channel and Time-Frequency Transformations.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Cross-Domain Adaptation with Discrepancy Minimization for Text-Independent Forensic Speaker Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Open-Set Short Utterance Forensic Speaker Verification Using Teacher-Student Network with Explicit Inductive Bias.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

SkipConvNet: Skip Convolutional Neural Network for Speech Dereverberation Using Optimally Smoothed Spectral Mapping.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

FEARLESS STEPS Challenge (FS-2): Supervised Learning with Massive Naturalistic Apollo Data.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Mobile-Assisted Prosody Training for Limited English Proficiency: Learner Background and Speech Learning Pattern.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Effect of Spectral Complexity Reduction and Number of Instruments on Musical Enjoyment with Cochlear Implants.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Frame-Based Overlapping Speech Detection Using Convolutional Neural Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A multi-view approach for Mandarin non-native mispronunciation verification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Portable Smart-Space Research Interface to Predetermine Environment Acoustics for Cochlear implant and Hearing aid users with CCi-MOBILE.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020

2019
Multi-domain adversarial training of neural network acoustic models for distant speech recognition.
Speech Commun., 2019

Speech and language processing for assessing child-adult interaction based on diarization and location.
Int. J. Speech Technol., 2019

A Unified Framework for Speech Separation.
CoRR, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.
CoRR, 2019

Exploring OpenStreetMap Availability for Driving Environment Understanding.
CoRR, 2019

Tagging child-adult interactions in naturalistic, noisy, daylong school environments using i-vector based diarization system.
Proceedings of the 8th ISCA International Workshop on Speech and Language Technology in Education, 2019

Risky Action Recognition in Lane Change Video Clips using Deep Spatiotemporal Networks with Segmentation Mask Transfer.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019

Towards Complexity Level Classification of Driving Scenarios Using Environmental Information.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019

Probabilistic Permutation Invariant Training for Speech Separation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Adversarial Regularization for End-to-End Robust Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Convolutional Neural Network-Based Speech Enhancement for Cochlear Implant Recipients.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Quantifying Cochlear Implant Users' Ability for Speaker Identification Using CI Auditory Stimuli.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

The 2019 Inaugural Fearless Steps Challenge: A Giant Leap for Naturalistic Audio.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Toeplitz Inverse Covariance Based Robust Speaker Clustering for Naturalistic Audio Streams.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Machine Learning Based Clustering Protocol for Determining Hearing Aid Initial Configurations from Pure-Tone Audiograms.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Semi-supervised Learning with Generative Adversarial Networks for Arabic Dialect Identification.
Proceedings of the IEEE International Conference on Acoustics, 2019

UTD-CRSS Systems for 2018 NIST Speaker Recognition Evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Cross-lingual Text-independent Speaker Verification Using Unsupervised Adversarial Discriminative Domain Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Transfer Learning Using Raw Waveform Sincnet for Robust Speaker Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2019

CCi-MOBILE: Design and Evaluation of a Cochlear Implant and Hearing Aid Research Platform for Speech Scientists and Engineers.
Proceedings of the 2019 IEEE EMBS International Conference on Biomedical & Health Informatics, 2019

Analyzing Large Receptive Field Convolutional Networks for Distant Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Domain Expansion in DNN-Based Acoustic Models for Robust Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Exploring the Intersection Between Speaker Verification and Emotion Recognition.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2019

2018
Text-Independent Speaker Verification Based on Triplet Convolutional Neural Network Embeddings.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Language/Dialect Recognition Based on Unsupervised Deep Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Curriculum Learning Based Approaches for Noise Robust Speaker Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Maximum-Likelihood Linear Transformation for Unsupervised Domain Adaptation in Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Leveraging Frequency-Dependent Kernel and DIP-Based Clustering for Robust Speech Activity Detection in Naturalistic Audio Streams.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Speech Activity Detection in Naturalistic Audio Environments: Fearless Steps Apollo Corpus.
IEEE Signal Process. Lett., 2018

Modelling and compensation for language mismatch in speaker verification.
Speech Commun., 2018

On the issues of intra-speaker variability and realism in speech, speaker, and language recognition tasks.
Speech Commun., 2018

Detection and Calibration of Whisper for Speaker Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Advancing Multi-Accented Lstm-CTC Speech Recognition Using a Domain Specific Student-Teacher Learning Paradigm.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

An Analysis of Transfer Learning for Domain Mismatched Text-independent Speaker Verification.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Convolutional Neural Network Based Speaker De-Identification.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Assessing Speaker Engagement in 2-Person Debates: Overlap Detection in United States Presidential Debates.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Speaker Recognition with Nonlinear Distortion: Clipping Analysis and Impact.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Testing Paradigms for Assistive Hearing Devices in Diverse Acoustic Environments.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Fusing Text-dependent Word-level i-Vector Models to Screen 'at Risk' Child Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Fearless Steps: Apollo-11 Corpus Advancements for Speech Technologies from Earth to the Moon.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Leveraging Native Language Information for Improved Accented Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Robust Speaker Clustering using Mixtures of von Mises-Fisher Distributions for Naturalistic Audio Streams.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Compensation for Domain Mismatch in Text-independent Speaker Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Robust Feature Clustering for Unsupervised Speech Activity Detection.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Automatic Screening to Detect 'At Risk' Child Speech Samples using a Clinical Group Verification framework.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

2017
Lane-Change Detection From Steering Signal Using Spectral Segmentation and Learning-Based Classification.
IEEE Trans. Intell. Veh., 2017

Active Learning Based Constrained Clustering For Speaker Diarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Single Sideband Frequency Offset Estimation and Correction for Quality Enhancement and Speaker Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Robust Harmonic Features for Classification-Based Pitch Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Teager-Kaiser Energy Operators for Overlapped Speech Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Automatic Sentiment Detection in Naturalistic Audio.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Signal Processing for Smart Vehicle Technologies: Part 2 [From the Guest Editors].
IEEE Signal Process. Mag., 2017

Driver Modeling for Detection and Assessment of Driver Distraction: Examples from the UTDrive Test Bed.
IEEE Signal Process. Mag., 2017

An Investigation of Deep-Learning Frameworks for Speaker Verification Antispoofing.
IEEE J. Sel. Top. Signal Process., 2017

Phoneme class based feature adaptation for mismatch acoustic modeling and recognition of distant noisy speech.
Int. J. Speech Technol., 2017

Deep neural network training for whispered speech recognition using small databases and generative model sampling.
Int. J. Speech Technol., 2017

Using speech technology for quantifying behavioral characteristics in peer-led team learning sessions.
Comput. Speech Lang., 2017

Assessment and classification of singing quality based on audio-visual features.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Navigation-orientated natural spoken language understanding for intelligent vehicle dialogue.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2017

Intent detection and semantic parsing for navigation dialogue language processing.
Proceedings of the 20th IEEE International Conference on Intelligent Transportation Systems, 2017

Dialect Recognition Based on Unsupervised Bottleneck Features.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Speech Enhancement Based on Harmonic Estimation Combined with MMSE to Improve Speech Intelligibility for Cochlear Implant Recipients.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Curriculum Learning Based Probabilistic Linear Discriminant Analysis for Noise Robust Speaker Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Improved Gender Independent Speaker Recognition Using Convolutional Neural Network Based Bottleneck Features.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Locally Weighted Linear Discriminant Analysis for Robust Speaker Verification.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

On Multi-Domain Training and Adaptation of End-to-End RNN Acoustic Models for Distant Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017


Speech Detection and Enhancement Using Single Microphone for Distant Speech Applications in Reverberant Environments.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Multi-Channel Apollo Mission Speech Transcripts Calibration.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Acoustic Scene Classification Using a CNN-SuperVector System Trained with Auditory and Spectrogram Image Features.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A study of speaker verification performance with expressive speech.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Environment aware speaker diarization for moving targets using parallel DNN-based recognizers.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

i-Vector/PLDA speaker recognition using support vectors with discriminant analysis.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

UTD-CRSS submission for MGB-3 Arabic dialect identification: Front-end and back-end advancements on broadcast speech.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Robust Features in Deep-Learning-Based Speech Recognition.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
A Generalized Nonnegative Tensor Factorization Approach for Distant Speech Recognition With Distributed Microphones.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Score-Aging Calibration for Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Generative Modeling of Pseudo-Whisper for Robust Whispered Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Conversational In-Vehicle Dialog Systems: The past, present, and future.
IEEE Signal Process. Mag., 2016

Signal Processing for Smart Vehicle Technologies [From the Guest Editors].
IEEE Signal Process. Mag., 2016

Microphone Array Processing Strategies for Distant-Based Automatic Speech Recognition.
IEEE Signal Process. Lett., 2016

Effective word count estimation for long duration daily naturalistic audio recordings.
Speech Commun., 2016

Unsupervised accent classification for deep data fusion of accent and language information.
Speech Commun., 2016

KU-ISPL Language Recognition System for NIST 2015 i-Vector Machine Learning Challenge.
CoRR, 2016

Automatic measurement and analysis of the child verbal communication using classroom acoustics within a child care center.
Proceedings of the 5th Workshop on Child Computer Interaction, 2016

Employing speech and location information for automatic assessment of child language environments.
Proceedings of the First International Workshop on Sensing, 2016

Unsupervised k-means clustering based out-of-set candidate selection for robust open-set language recognition.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Speaker independent diarization for child language environment analysis using deep neural networks.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Evaluation and calibration of Lombard effects in speaker verification.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

A robust diarization system for measuring dominance in Peer-Led Team Learning groups.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Between-Class Covariance Correction For Linear Discriminant Analysis in Language Recognition.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

A discriminative unsupervised method for speaker recognition using deep learning.
Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016

Improving speech recognition using limited accent diverse British English training data with deep neural networks.
Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016

Unsupervised driving performance assessment using free-positioned smartphones in vehicles.
Proceedings of the 19th IEEE International Conference on Intelligent Transportation Systems, 2016

Text-Available Speaker Recognition System for Forensic Applications.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Improving Boundary Estimation in Audiovisual Speech Activity Detection Using Bayesian Information Criterion.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Discussion.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Fusion Strategies for Robust Speech Recognition and Keyword Spotting for Channel- and Noise-Degraded Speech.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Robustness in Speech, Speaker, and Language Recognition: "You've Got to Know Your Limitations".
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

A Speaker Diarization System for Studying Peer-Led Team Learning Groups.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Generalized Discriminant Analysis (GDA) for Improved i-Vector Based Speaker Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Joint information from nonlinear and linear features for spoofing detection: An i-vector/DNN based approach.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

UTD-CRSS system for the NIST 2015 language recognition i-vector machine learning challenge.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

F0 estimation for noisy speech by exploring temporal harmonic structures in local time frequency spectrum segment.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Language recognition using deep neural networks with very limited training data.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Toward Access to Multi-Perspective Archival Spoken Word Content.
Proceedings of the Digital Libraries: Knowledge, Information, and Data in an Open Access Society, 2016

2015
Howling Detection in Hearing Aids Based on Generalized Teager-Kaiser Operator.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Speaker Recognition by Machines and Humans: A tutorial review.
IEEE Signal Process. Mag., 2015

A Hybrid Coherence Model for Noise Reduction in Reverberant Environments.
IEEE Signal Process. Lett., 2015

An advanced entropy-based feature with a frame-level vocal effort likelihood space modeling for distant whisper-island detection.
Speech Commun., 2015

Mean Hilbert envelope coefficients (MHEC) for robust speaker and language identification.
Speech Commun., 2015

Advanced parallel combined Gaussian mixture model based feature compensation integrated with iterative channel estimation.
Speech Commun., 2015

Automatic analysis of dialect/language sets.
Int. J. Speech Technol., 2015

Physical task stress and speaker variability in voice quality.
EURASIP J. Audio Speech Music. Process., 2015

F0 estimation for noisy speech based on exploring local time-frequency segment.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

In-vehicle speech recognition and tutorial keywords spotting for novice drivers' performance evaluation.
Proceedings of the 2015 IEEE Intelligent Vehicles Symposium, 2015

I-vector based physical task stress detection with different fusion strategies.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Robust i-vector extraction for neural network adaptation in noisy environment.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Frequency offset correction in single sideband (SSB) speech by deep neural network for speaker verification.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

An unsupervised visual-only voice activity detection approach using temporal orofacial features.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Probabilistic linear discriminant analysis for robust speaker identification in co-channel speech.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A new front-end for classification of non-speech sounds: a study on human whistle.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Anti-spoofing system: an investigation of measures to detect synthetic and human speech.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A study on deep neural network acoustic model adaptation for robust far-field speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Evaluation and calibration of short-term aging effects in speaker verification.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Automatic audio sentiment extraction using keyword spotting.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Laughter and filler detection in naturalistic audio.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Prof-Life-Log: Analysis and classification of activities in daily audio streams.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Leveraging automatic speech recognition in cochlear implants for improved speech intelligibility under reverberation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Robust overlapped speech detection and its application in word-count estimation for Prof-Life-Log data.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Weighted training for speech under Lombard Effect for speaker recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Robust unsupervised detection of human screams in noisy acoustic environments.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Analysis of speech and language communication for cochlear implant users in noisy lombard conditions.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Generative modeling of pseudo-target domain adaptation samples for whispered speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Image-guided customization of frequency-place mapping in cochlear implants.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Studying the relationship between physical and language environments of children: Who's speaking to whom and where?
Proceedings of the IEEE Signal Processing and Signal Processing Education Workshop, 2015

Developing an educational electro-mechanical model of the middle ear and impulse noise reduction algorithm for cochlear implant users.
Proceedings of the IEEE Signal Processing and Signal Processing Education Workshop, 2015

An i-Vector PLDA based gender identification approach for severely distorted and multilingual DARPA RATS data.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Blind Spectral Weighting for Robust Speaker Identification under Reverberation Mismatch.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

An investigation into back-end advancements for speaker recognition in multi-session and noisy enrollment scenarios.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Maximum Likelihood Acoustic Factor Analysis Models for Robust Speaker Verification in Noise.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

A coherence-based noise reduction algorithm for binaural hearing aids.
Speech Commun., 2014

Car noise verification and applications.
Int. J. Speech Technol., 2014

Effective background data selection for SVM-based speaker recognition with unseen test environments: more is not always better.
Int. J. Speech Technol., 2014

Environment mismatch compensation using average eigenspace-based methods for robust speech recognition.
Int. J. Speech Technol., 2014

Automatic assessment of language background in toddlers through phonotactic and pitch pattern modeling of short vocalizations.
Proceedings of the 4st Workshop on Child, Computer and Interaction, 2014

Training candidate selection for effective rejection in open-set language identification.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Spoken language mismatch in speaker verification: An investigation with NIST-SRE and CRSS Bi-Ling corpora.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Multichannel feature enhancement in distributed microphone arrays for robust distant speech recognition in smart rooms.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Utilization of unlabeled development data for speaker verification.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Robust Language Recognition Based on Diverse Features.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Investigating State-of-the-Art Speaker Verification in the case of Unlabeled Development Data.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Supra-Segmental Feature Based Speaker Trait Detection.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Threshold based decision-tree for automatic driving maneuver recognition using CAN-Bus signal.
Proceedings of the 17th International IEEE Conference on Intelligent Transportation Systems, 2014

A speech system for estimating daily word counts.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Speech activity detection for NASA apollo space missions: challenges and solutions.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Acoustic feature transformation using UBM-based LDA for speaker recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

'houston, we have a solution': a case study of the analysis of astronaut speech during NASA apollo 11 for long-term speaker modeling.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

F0 estimation in noisy speech based on long-term harmonic feature analysis combined with neural network classification.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Noisy speech enhancement based on long term harmonic model to improve speech intelligibility for hearing impaired listeners.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Investigation of the relative perceptual importance of temporal envelope and temporal fine structure between tonal and non-tonal languages.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Co-channel speech detection via spectral analysis of frequency modulated sub-bands.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Analysis and identification of human scream: implications for speaker recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Multichannel speech dereverberation based on convolutive nonnegative tensor factorization for ASR applications.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Model and feature based compensation for whispered speech recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Uncertainty propagation in front end factor analysis for noise robust speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Robust and efficient environment detection for adaptive speech enhancement in cochlear implants.
Proceedings of the IEEE International Conference on Acoustics, 2014

Frequency offset correction in single sideband speech for speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2014

UT-Vocal Effort II: Analysis and constrained-lexicon recognition of whispered speech.
Proceedings of the IEEE International Conference on Acoustics, 2014

Improving channel selection of sound coding algorithms in cochlear implants.
Proceedings of the IEEE International Conference on Acoustics, 2014

Effects of Multitasking on Drivability Through CAN-Bus Analysis.
Proceedings of the Smart Mobile In-Vehicle Systems, Next Generation Advancements, 2014

2013
Automatic Accent Assessment Using Phonetic Mismatch and Human Perception.
IEEE Trans. Speech Audio Process., 2013

Acoustic Factor Analysis for Robust Speaker Verification.
IEEE Trans. Speech Audio Process., 2013

Unsupervised Speech Activity Detection Using Voicing Measures and Perceptual Spectral Flux.
IEEE Signal Process. Lett., 2013

Singing speaker clustering based on subspace learning in the GMM mean supervector space.
Speech Commun., 2013

In-set/out-of-set speaker recognition in sustained acoustic scenarios using sparse data.
Speech Commun., 2013

Acoustic analysis and feature transformation from neutral to whisper for speaker identification within whispered speech audio streams.
Speech Commun., 2013

Environment dependent noise tracking for speech enhancement.
Int. J. Speech Technol., 2013

Multi-modal highlight generation for sports videos using an information-theoretic excitability measure.
EURASIP J. Adv. Signal Process., 2013

Compensation of SNR and noise type mismatch using an environmental sniffing based speech recognition solution.
EURASIP J. Audio Speech Music. Process., 2013

Linking transcribed conversational speech.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Belt Up: Investigating the impact of in-vehicular conversation on driving performance.
Proceedings of the 2013 IEEE Intelligent Vehicles Symposium (IV), 2013


'houston, we have a solution': using NASA apollo program to advance speech and language processing technology.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Dimensionality analysis of singing speech based on locality preserving projections.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Automatic regularization of cross-entropy cost for speaker recognition fusion.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Acoustic factor analysis based universal background model for robust speaker verification in noise.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

All for one: feature combination for highly channel-degraded speech activity detection.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Impact of noise reduction and spectrum estimation on noise robust speaker identification.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A preliminary study of child vocalization on a parallel corpus of US and shanghainese toddlers.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Prof-Life-Log: Personal interaction analysis for naturalistic audio streams.
Proceedings of the IEEE International Conference on Acoustics, 2013

Supervector pre-processing for PRSVM-based Chinese and Arabic dialect identification.
Proceedings of the IEEE International Conference on Acoustics, 2013

A new mask-based objective measure for predicting the intelligibility of binary masked speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

Speaker height estimation combining GMM and linear regression subsystems.
Proceedings of the IEEE International Conference on Acoustics, 2013

Overlapped-speech detection with applications to driver assessment for in-vehicle active safety systems.
Proceedings of the IEEE International Conference on Acoustics, 2013

Robust front-end processing for speaker identification over extremely degraded communication channels.
Proceedings of the IEEE International Conference on Acoustics, 2013

An investigation on back-end for speaker recognition in multi-session enrollment.
Proceedings of the IEEE International Conference on Acoustics, 2013

An advanced feature compensation method employing acoustic model with phonetically constrained structure.
Proceedings of the IEEE International Conference on Acoustics, 2013

Sentiment extraction from natural audio streams.
Proceedings of the IEEE International Conference on Acoustics, 2013

CRSS systems for 2012 NIST Speaker Recognition Evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2013

Duration mismatch compensation for i-vector based speaker recognition systems.
Proceedings of the IEEE International Conference on Acoustics, 2013

Automatic sentiment extraction from YouTube videos.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Phoneme Selective Speech Enhancement Using Parametric Estimators and the Mixture Maximum Model: A Unifying Approach.
IEEE Trans. Speech Audio Process., 2012

Constrained Iterative Speech Enhancement Using Phonetic Classes.
IEEE Trans. Speech Audio Process., 2012

Trends in Speech and Language Processing [In the Spotlight].
IEEE Signal Process. Mag., 2012

Automatic analysis of Mandarin accented English using phonological features.
Speech Commun., 2012

TEO-based speaker stress assessment using hybrid classification and tracking schemes.
Int. J. Speech Technol., 2012

Identifying impact factors of language development in young children's natural home environment.
Proceedings of the Third Workshop on Child, Computer and Interaction, 2012

A linguistic data acquisition front-end for language recognition evaluation.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Factor analysis of acoustic features using a mixture of probabilistic principal component analyzers for robust speaker verification.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Leveraging sensor information from portable devices towards automatic driving maneuver recognition.
Proceedings of the 15th International IEEE Conference on Intelligent Transportation Systems, 2012

Prof-Life-Log: Audio Environment Detection for Naturalistic Audio Streams.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Phoneme Class Based Adaptation for Mismatch Acoustic Modeling of Distant Noisy Speech.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Mean Hilbert Envelope Coefficients (MHEC) for Robust Speaker Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Speaker Clustering for a Mixture of Singing and Reading.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Gaussian Map based Acoustic Model Adaptation Using Untranscribed Data for Speech Recognition in Severely Adverse Environments.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Integrated Feature Normalization and Enhancement for robust Speaker Recognition using Acoustic Factor Analysis.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Front-end Channel Compensation using Mixture-dependent Feature Transformations for i-Vector Speaker Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Glottal Waveform Analysis of Physical Task Stress Speech.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Arabic Dialect Identification - 'Is the Secret in the Silence?' and Other Observations.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

ProfLifeLog: Environmental analysis and keyword recognition for naturalistic daily audio streams.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Blind reverberation mitigation for robust speaker identification.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A comparison of front-end compensation strategies for robust LVCSR under room reverberation and increased vocal effort.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A fast speaker verification with universal background support data selection.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Robust feature front-end for speaker identification.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Feature compensation employing online GMM adaptation for speech recognition in unknown severely adverse environments.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A multi-modal highlight extraction scheme for sports videos using an information-theoretic excitability measure.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Improved parcel sorting by combining automatic speech and character recognition.
Proceedings of the 2012 IEEE International Conference on Emerging Signal Processing Applications, 2012

Leveraging speech-active regions towards active safety in vehicles.
Proceedings of the 2012 IEEE International Conference on Emerging Signal Processing Applications, 2012

2011
International Large-Scale Vehicle Corpora for Research on Driver Behavior on the Road.
IEEE Trans. Intell. Transp. Syst., 2011

Whisper-Island Detection Based on Unsupervised Segmentation With Entropy-Based Speech Feature Processing.
IEEE ACM Trans. Audio Speech Lang. Process., 2011

Dialect Classification via Text-Independent Training and Testing for Arabic, Spanish, and Chinese.
IEEE Trans. Speech Audio Process., 2011

A Novel Mask Estimation Method Employing Posterior-Based Representative Mean Estimate for Missing-Feature Speech Recognition.
IEEE Trans. Speech Audio Process., 2011

A Study on Universal Background Model Training in Speaker Verification.
IEEE Trans. Speech Audio Process., 2011

Speaker Identification Within Whispered Speech Audio Streams.
IEEE Trans. Speech Audio Process., 2011

Mismatch modeling and compensation for robust speaker verification.
Speech Commun., 2011

Variational noise model composition through model perturbation for robust speech recognition with time-varying background noise.
Speech Commun., 2011

Information fusion for robust 'context and driver aware' active vehicle safety systems.
Inf. Fusion, 2011

Robust Emotional Stressed Speech Detection Using Weighted Frequency Subbands.
EURASIP J. Adv. Signal Process., 2011

Frame-Level Vocal Effort Likelihood Space Modeling for Improved Whisper-Island Detection.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Using Human Perception for Automatic Accent Assessment.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Phone Impact Based Speech Transmission Technique for Reliable Speech Recognition in Poor Wireless Network Conditions.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Detecting Sleepiness by Fusing Classifiers Trained with Novel Acoustic Features.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Feature Compensation for Speech Recognition in Severely Adverse Environments Due to Background Noise and Channel Distortion.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Robust Speaker Recognition in Non-Stationary Room Environments Based on Empirical Mode Decomposition.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Vowel Context and Speaker Interactions Influencing Glottal Open Quotient and Formant Frequency Shifts in Physical Task Stress.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Speaker Identification for Whispered Speech Using a Training Feature Transformation from Neutral to Whisper.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Acoustic Analysis of Whispered Speech for Phoneme and Speaker Dependency.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Front-End Compensation Methods for LVCSR Under Lombard Effect.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Relative proportionate NLMS: Improving convergence for acoustic channel identification.
Proceedings of the IEEE International Conference on Acoustics, 2011

Effective background data selection in SVM speaker recognition for unseen test environment: More is not always better.
Proceedings of the IEEE International Conference on Acoustics, 2011

Language identification using a combined articulatory prosody framework.
Proceedings of the IEEE International Conference on Acoustics, 2011

Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions.
Proceedings of the IEEE International Conference on Acoustics, 2011

Language identification for singing.
Proceedings of the IEEE International Conference on Acoustics, 2011

Phoneme selective speech enhancement using the generalized parametric spectral subtraction estimator.
Proceedings of the IEEE International Conference on Acoustics, 2011

UT-Scope: Towards LVCSR under Lombard effect induced by varying types and levels of noisy background.
Proceedings of the IEEE International Conference on Acoustics, 2011

Spatial, spectral and temporal adaptation for fast fading MIMO-OFDMA systems.
Proceedings of the Workshops Proceedings of the Global Communications Conference, 2011

A systematic strategy for robust automatic dialect identification.
Proceedings of the 19th European Signal Processing Conference, 2011

Audio-visual isolated digit recognition for whispered speech.
Proceedings of the 19th European Signal Processing Conference, 2011

2010
Missing-Feature Reconstruction by Leveraging Temporal Spectral Correlation for Robust Speech Recognition in Background Noise Conditions.
IEEE Trans. Speech Audio Process., 2010

Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments.
IEEE Trans. Speech Audio Process., 2010

Spoken Proper Name Retrieval for Limited Resource Languages Using Multilingual Hybrid Representations.
IEEE Trans. Speech Audio Process., 2010

Discriminative Training for Multiple Observation Likelihood Ratio Based Voice Activity Detection.
IEEE Signal Process. Lett., 2010

Phonetic Distance Based Confidence Measure.
IEEE Signal Process. Lett., 2010

The physiological microphone (PMIC): A competitive alternative for speaker assessment in stress detection and speaker verification.
Speech Commun., 2010

Analysis of CFA-BF: Novel combined fixed/adaptive beamforming for robust speech recognition in real car environments.
Speech Commun., 2010

Automatic voice onset time detection for unvoiced stops (/p/, /t/, /k/) with application to accent classification.
Speech Commun., 2010

Automatic Beamforming for Blind Extraction of Speech From Music Environment Using Variance of Spectral Flux-Inspired Criterion.
IEEE J. Sel. Top. Signal Process., 2010

The "UTDrive" in-vehicle voice activity detection system.
Proceedings of the IEEE Intelligent Vehicles Symposium (IV), 2010

Driver adaptive and context aware active safety systems using CAN-bus signals.
Proceedings of the IEEE Intelligent Vehicles Symposium (IV), 2010

A Bayesian approach to voice activity detection using multiple statistical models and discriminative training.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Quality conversion of non-acoustic signals for facilitating human-to-human speech communication under harsh acoustic conditions.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Assessment of single-channel speech enhancement techniques for speaker identification under mismatched conditions.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A novel feature extraction strategy for multi-stream robust emotion identification.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Speaker recognition using supervised probabilistic principal component analysis.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

An effective feature compensation scheme tightly matched with speech recognizer employing SVM-based GMM generation.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Session variability contrasts in the MARP corpus.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Analysis and detection of cognitive load and frustration in drivers' speech.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Automatic excitement-level detection for sports highlights generation.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Advancements in whisper-island detection using the linear predictive residual.
Proceedings of the IEEE International Conference on Acoustics, 2010

An efficient microphone array based voice activity detector for driver's speech in noise and music rich in-vehicle environments.
Proceedings of the IEEE International Conference on Acoustics, 2010

Automatic language analysis and identification based on speech production knowledge.
Proceedings of the IEEE International Conference on Acoustics, 2010

Towards more intelligible physiological microphone speech: A probabilistic transformation approach.
Proceedings of the IEEE International Conference on Acoustics, 2010

Speech under physical stress: A production-based framework.
Proceedings of the IEEE International Conference on Acoustics, 2010

Dialect distance assessment method based on comparison of pitch pattern statistical models.
Proceedings of the IEEE International Conference on Acoustics, 2010

A kernel mean matching approach for environment mismatch compensation in speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Angry emotion detection from real-life conversational speech by leveraging content structure.
Proceedings of the IEEE International Conference on Acoustics, 2010

A novel feature sub-sampling method for efficient universal background model training in speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Acoustic analysis for speaker identification of whispered speech.
Proceedings of the IEEE International Conference on Acoustics, 2010

Broad phoneme class based speech enhancement using mixture maximum model.
Proceedings of the IEEE International Conference on Acoustics, 2010

Limited resource speech recognition for Nigerian English.
Proceedings of the IEEE International Conference on Acoustics, 2010

Test token driven acoustic balancing for sparse enrollment data in cohort GMM speaker recognition.
Proceedings of the 18th European Signal Processing Conference, 2010

A scanning window scheme based on SVM training error rate for unsupervised audio segmentation.
Proceedings of the 18th European Signal Processing Conference, 2010

Dialect identification: Impact of differences between read versus spontaneous speech.
Proceedings of the 18th European Signal Processing Conference, 2010

2009
Speechfind.
Proceedings of the Handbook of Research on Digital Libraries: Design, 2009

Feature Compensation Techniques for ASR on Band-Limited Speech.
IEEE Trans. Speech Audio Process., 2009

Babble Noise: Modeling, Analysis, and Applications.
IEEE Trans. Speech Audio Process., 2009

Time-Frequency Correlation-Based Missing-Feature Reconstruction for Robust Speech Recognition in Band-Restricted Conditions.
IEEE Trans. Speech Audio Process., 2009

Analysis and Compensation of Lombard Speech Across Noise Type and Levels With Application to In-Set/Out-of-Set Speaker Recognition.
IEEE Trans. Speech Audio Process., 2009

Feature compensation in the cepstral domain employing model combination.
Speech Commun., 2009

Preliminary study of stress/neutral detection on recordings of children in the natural home environment.
Proceedings of the Second Workshop on Child, Computer and Interaction, 2009

Automatic childhood autism detection by vocalization decomposition with phone-like units.
Proceedings of the Second Workshop on Child, Computer and Interaction, 2009

Assessing the stress/neutral speech environment in adult/child interactions for applications in child language development.
Proceedings of the Second Workshop on Child, Computer and Interaction, 2009

Advancements in whisper-island detection within normally phonated audio streams.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Robust minimal variance distortionless speech power spectra enhancement using order statistic filter for microphone array.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

On the use of phonological features for automatic accent analysis.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

The role of age in factor analysis for speaker identification.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Robust angry speech detection employing a TEO-based discriminative classifier combination.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Variational model composition for robust speech recognition with time-varying background noise.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Speaker identification for whispered speech using modified temporal patterns and MFCCs.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Speech enhancement minimizing generalized euclidean distortion using supergaussian priors.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Reduced complexity equalization of lombard effect for speech recognition in noisy adverse environments.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Enhancing in-vehicle safety via contact sensor for stress detection.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2009

Assessment of speech dialog systems using multi-modal cognitive load analysis and driving performance metrics.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2009

A speech presence microphone array beamformer using model based speech presence probability estimation.
Proceedings of the IEEE International Conference on Acoustics, 2009

Factor analysis-based information integration for Arabic dialect identification.
Proceedings of the IEEE International Conference on Acoustics, 2009

Speaker identification with whispered speech based on modified LFCC parameters and feature mapping.
Proceedings of the IEEE International Conference on Acoustics, 2009

Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environment.
Proceedings of the IEEE International Conference on Acoustics, 2009

Leveraging speech production knowledge for improved speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Mask estimation employing Posterior-based Representative Mean for missing-feature speech recognition with time-varying background noise.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition.
Speech Commun., 2008

Feature Compensation Employing Multiple Environmental Models for Robust In-Vehicle Speech Recognition.
IEICE Trans. Inf. Syst., 2008

Towards an Intelligent Acoustic Front End for Automatic Speech Recognition: Built-in Speaker Normalization.
EURASIP J. Audio Speech Music. Process., 2008

Intelligent Audio, Speech, and Music Processing Applications.
EURASIP J. Audio Speech Music. Process., 2008

Signal processing for young child speech language development.
Proceedings of the First Workshop on Child, Computer and Interaction, 2008

An entropy based feature for whisper-island detection within audio streams.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Filling acoustic holes through leveraged uncorellated GMMs for in-set/out-of-set speaker recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Evidence of coarticulation in a phonological feature detection system.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Detection of speech under physical stress: model development, sensor selection, and feature fusion.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Dialect separation assessment using log-likelihood score distributions.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Dialect classification via discriminative training.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Environment mismatch compensation using average eigenspace for speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Babble speech: acoustic and perceptual variability.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Missing-feature method for speaker recognition in band-restricted conditions.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Analysis and perception of speech under physical task stress.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Speaker identification for whispered speech based on frequency warping and score competition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Generalized parametric spectral subtraction using weighted Euclidean distortion.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Body sensor networks for driver distraction identification.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2008

Driver behavior analysis and route recognition by Hidden Markov Models.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2008

Active accident avoidance case study: Integrating drowsiness monitoring system with lateral control and speed regulation in passenger vehicles.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2008

In-set/out-of-set speaker recognition: leverging the speaker and noise balance.
Proceedings of the IEEE International Conference on Acoustics, 2008

Speech babble: Analysis and modeling for speech systems.
Proceedings of the IEEE International Conference on Acoustics, 2008

Dialect Classification for Online Podcasts Fusing Acoustic and Language Based Structural and Semantic Information.
Proceedings of the ACL 2008, 2008

2007
In-Set/Out-of-Set Speaker Recognition Under Sparse Enrollment.
IEEE Trans. Speech Audio Process., 2007

Dialect/Accent Classification Using Unrestricted Audio.
IEEE Trans. Speech Audio Process., 2007

Unsupervised Discriminative Training With Application to Dialect Classification.
IEEE Trans. Speech Audio Process., 2007

Discriminative In-Set/Out-of-Set Speaker Recognition.
IEEE Trans. Speech Audio Process., 2007

Environmental Sniffing: Noise Knowledge Estimation for Robust Speech Systems.
IEEE Trans. Speech Audio Process., 2007

Blind Feature Compensation for Time-Variant Band-Limited Speech Recognition.
IEEE Signal Process. Lett., 2007

The Effect of Listener Accent Background on Accent Perception and Comprehension.
EURASIP J. Audio Speech Music. Process., 2007

Speech Under Stress: Analysis, Modeling and Recognition.
Proceedings of the Speaker Classification I: Fundamentals, Features, and Methods, 2007

Multivariate Cepstral Feature Compensation on Band-limited Data for Robust Speech Recognition.
Proceedings of the 16th Nordic Conference of Computational Linguistics, 2007

Analysis and classification of speech mode: whispered through shouted.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Environmentally aware voice activity detector.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Score distribution scaling for speaker recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Noise tracking for speech systems in adverse environments.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Advances in speechfind: transcript reliability estimation employing confidence measure based on discriminative sub-word model for SDR.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Lombard speech impact on perceptual speaker recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Class constrained ROVER based speech enhancement.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Getting start with UTDrive: driver-behavior modeling and assessment of distraction for in-vehicle speech systems.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Dialect Classification on Printed Text using Perplexity Measure and Conditional Random Fields.
Proceedings of the IEEE International Conference on Acoustics, 2007

Language Normalization for Bilingual Speaker Recognition Systems.
Proceedings of the IEEE International Conference on Acoustics, 2007

Phonological feature based variable frame rate scheme for improved speech recognition.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Speechfind for CDP: Advances in spoken document retrieval for the U. S. collaborative digitization program.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Multi-stream dialect classification using SVM-GMM hybrid classifiers.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora.
IEEE Trans. Speech Audio Process., 2006

Speech Enhancement Based on Generalized Minimum Mean Square Error Estimators and Masking Properties of the Auditory System.
IEEE Trans. Speech Audio Process., 2006

Advances in phone-based modeling for automatic accent classification.
IEEE Trans. Speech Audio Process., 2006

Analysis of lombard effect under different types and levels of noise with application to in-set speaker ID systems.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

A cohort - UBM approach to mitigate data sparseness for in-set/out-of-set speaker recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Noise update modeling for speech enhancement: when do we do enough?
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Missing-feature reconstruction for band-limited speech recognition in spoken document retrieval.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

The role of prosody in the perception of US native English accents.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Unsupervised Spanish dialect classification.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Decision directed constrained iterative speech enhancement.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

A robust fusion method for multilingual spoken document retrieval systems employing tiered resources.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Stress Level Classification of Speech Using Euclidean Distance Metrics in a Novel Hybrid Multi-Dimensional Feature Space.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Unsupervised Class-Based Feature Compensation for Time-Variable Bandwidth-Limited Speech.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Perceptual Recognition Cues in Native English Accent Variation: "Listener Accent, Perceived Accent, and Comprehension".
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Spoken Proper Name Retrieval in Audio Streams for Limited-Resource Languages Via Lattice Based Search Using Hybrid Representations.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Rapid discriminative acoustic model based on eigenspace mapping for fast speaker adaptation.
IEEE Trans. Speech Audio Process., 2005

Efficient audio stream segmentation via the combined T<sup>2</sup> statistic and Bayesian information criterion.
IEEE Trans. Speech Audio Process., 2005

SpeechFind: Advances in Spoken Document Retrieval for a National Gallery of the Spoken Word.
IEEE Trans. Speech Audio Process., 2005

An Auditory-Masking-Threshold-Based Noise Suppression Algorithm GMMSE-AMT[ERB] for Listeners with Sensorineural Hearing Loss.
EURASIP J. Adv. Signal Process., 2005

Speaker verification using Gaussian mixture models within changing real car environments.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

In-set/out-of-set speaker identification based on discriminative speech frame selection.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Improved "TEO" feature-based automatic stress detection using physiological and acoustic speech sensors.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Statistical class-based MFCC enhancement of filtered and band-limited speech for robust ASR.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Advances in word based dialect/accent classification.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Collaborative voice activity detection for hearing aids.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Towards an Intelligent Acoustic Front-End for Automatic Speech Recognition: Built-In Speaker Normalization (BISN).
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Effects of Phoneme Characteristics on TEO Feature-based Automatic Stress Detection in Speech.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

MFCC Compensation for Improved Recognition of Filtered and Band-Limited Speech.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Dialect/Accent Classification via Boosted Word Modeling.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Audio-visual SPeaker localization for car navigation systems.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

In-vehicle based speech processing for hearing impaired subjects.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

High-level feature weighted GMM network for audio stream classification.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Dialect analysis and modeling for automatic classification.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Cluster-dependent modeling and confidence measure processing for in-set/out-of-set speaker identification.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Speech enhancement based on a combined multi-channel array with constrained iterative and auditory masked processing.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Advances in unsupervised audio segmentation for the broadcast news and NGSW corpora.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Identifying in-set and out-of-set speakers using neighborhood information.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Language modeling structures in audio transcription for retrieval of historical speeches.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

2003
CSA-BF: a constrained switched adaptive beamformer for speech enhancement and recognition in real car environments.
IEEE Trans. Speech Audio Process., 2003

Evaluation of an auditory masked threshold noise suppression algorithm in normal-hearing and hearing-impaired listeners.
Speech Commun., 2003

CFA-BF: a novel combined fixed/adaptive beamforming for robust speech recognition in real car environments.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A new perspective on feature extraction for robust in-vehicle speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Perceptual MVDR-based cepstral coefficients (PMCCs) for high accuracy speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Frequency distribution based weighted sub-band approach for classification of emotional/stressful content in speech.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Perceptual based speech enhancement for normal-hearing and hearing-impaired individuals.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Use of trajectory models for automatic accent classification.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Environmental sniffing: robust digit recognition for an in-vehicle environment.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Discriminative acoustic model using eigenspace mapping for rapid speaker adaptation.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

CSA-BF: novel constrained switched adaptive beamforming for speech enhancement & recognition in real car environments.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
A comparison of spectral smoothing methods for segment concatenation based speech synthesis.
Speech Commun., 2002

Speechfind: an experimental on-line spoken document retrieval system for historical audio archives.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Improved structural maximum likelihood eigenspace mapping for rapid speaker adaptation.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

High performance digit recognition in real car environments.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Frequency band analysis for stress detection using a teager energy operator based feature.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Speech watermarking through parametric modeling.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Stochastic trajectory model analysis for accent classification.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Rapid speaker adaptation using multi-stream Structural Maximum Likelihood Eigenspace Mapping.
Proceedings of the IEEE International Conference on Acoustics, 2002

Application of automatic speech recognition in call classification.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Nonlinear feature based classification of speech under stress.
IEEE Trans. Speech Audio Process., 2001

Speech enhancement using a constrained iterative sinusoidal model.
IEEE Trans. Speech Audio Process., 2001

Fast likelihood computation techniques in nearest-neighbor based search for continuous speech recognition.
IEEE Signal Process. Lett., 2001

University of Colorado Dialogue Systems for Travel and Navigation.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Transcript-free search of audio archives for the national gallery of the spoken word.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2001

A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Robust digit recognition in noise: an evaluation using the AURORA corpus.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Analysis of the root-cepstrum for acoustic modeling and fast decoding in speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Combined front-end signal processing for in-vehicle speech systems.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Robust speech recognition in noise: an evaluation using the SPINE corpus.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

"CU-move" : analysis & corpus development for interactive in-vehicle speech systems.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
A comparative study of traditional and newly proposed features for recognition of speech under stress.
IEEE Trans. Speech Audio Process., 2000

High resolution speech feature parametrization for monophone-based stressed speech recognition.
IEEE Signal Process. Lett., 2000

Unsupervised audio stream segmentation and clustering via the Bayesian information criterion.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Improved Jacobian adaptation for fast acoustic model adaptation in noisy speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Speech enhancement based on a constrained sinusoidal model.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Audio stream phrase recognition for a national gallery of the spoken word: "one small step".
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

"CU-move": robust speech processing for in-vehicle speech systems.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

PCA-PMC: a novel use of a priori knowledge for fast parallel model combination.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
N-channel hidden Markov models for combined stressed speech classification and recognition.
IEEE Trans. Speech Audio Process., 1999

Selective training for hidden Markov models with applications to speech classification.
IEEE Trans. Speech Audio Process., 1999

The DSP Learning environment - modern DSP education: The Story of Three Greek Philosophers.
IEEE Signal Process. Mag., 1999

Auditory masking threshold estimation for broadband noise sources with application to speech enhancement.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Methods for stress classification: nonlinear TEO and linear speech based features.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Speech under stress conditions: overview of the effect on speech production and on system performance.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

An experimental study of speaker verification sensitivity to computer voice-altered imposters.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment.
IEEE Trans. Biomed. Eng., 1998

An improved (Auto: I, LSP: T) constrained iterative speech enhancement for colored noise environments.
IEEE Trans. Speech Audio Process., 1998

An auditory-based distortion measure with application to concatenative speech synthesis.
IEEE Trans. Speech Audio Process., 1998

HMM-based stressed speech modeling with application to improved synthesis and recognition of isolated speech under stress.
IEEE Trans. Speech Audio Process., 1998

Likelihood decision boundary estimation between HMM pairs in speech recognition.
IEEE Trans. Speech Audio Process., 1998

An efficient scoring algorithm for Gaussian mixture model based speaker identification.
IEEE Signal Process. Lett., 1998

Automatic segmentation of speech recorded in unknown noisy channel characteristics.
Speech Commun., 1998

Linear and nonlinear speech feature analysis for stress classification.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Robust speech activity detection in the presence of noise.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

A duration-based confidence measure for automatic segmentation of noise corrupted speech.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

An effective quality evaluation protocol for speech enhancement algorithms.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Spectral smoothing for concatenative speech synthesis.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Speech feature modeling for robust stressed speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Classification of speech under stress based on features derived from the nonlinear Teager energy operator.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Speaker-specific pitch contour modeling and modification.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Speech enhancement for crosstalk interference.
IEEE Signal Process. Lett., 1997

Text-directed speech enhancement employing phone class parsing and feature map constrained vector quantization.
Speech Commun., 1997

Getting started with SUSAS: a speech under simulated and actual stress database.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

A novel training approach for improving speech recognition under adverse stressful conditions.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Spectral normalization employing hidden Markov modeling of line spectrum pair frequencies.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

An auditory-based measure for improved phone segment concatenation.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Frequency characteristics of foreign accented speech.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Direct speech feature estimation using an iterative EM algorithm for vocal fold pathology detection.
IEEE Trans. Biomed. Eng., 1996

A noninvasive technique for detecting hypernasal speech using a nonlinear operator.
IEEE Trans. Biomed. Eng., 1996

Feature analysis and neural network-based classification of speech under stress.
IEEE Trans. Speech Audio Process., 1996

Classification of speech under stress using target driven features.
Speech Commun., 1996

Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition.
Speech Commun., 1996

Generating stressed speech from neutral speech using a modified CELP vocoder.
Speech Commun., 1996

Language accent classification in American English.
Speech Commun., 1996

A screening test for speech pathology assessment using objective quality measures.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Vocal fold pathology assessment using AM autocorrelation analysis of the teager energy operator.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Recent advances in hypernasal speech detection using the nonlinear teager energy operator.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Synthesis of stressed speech from isolated neutral speech using HMM-based models.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Improved speech recognition via speaker stress directed classification.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Text-directed speech enhancement using phoneme classification and feature map constrained vector quantization.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Improved HMM training and scoring strategies with application to accent classification.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Dual-channel iterative speech enhancement with constraints on an auditory-based spectrum.
IEEE Trans. Speech Audio Process., 1995

Source generator equalization and enhancement of spectral properties for robust speech recognition in noise and stress.
IEEE Trans. Speech Audio Process., 1995

Robust speech recognition training via duration and spectral-based stress token generation.
IEEE Trans. Speech Audio Process., 1995

Robust feature-estimation and objective quality assessment for noisy speech recognition using the Credit Card corpus.
IEEE Trans. Speech Audio Process., 1995

Markov model-based phoneme class partitioning for improved constrained iterative speech enhancement.
IEEE Trans. Speech Audio Process., 1995

ICARUS: Source generator based real-time recognition of speech in noisy stressful and Lombard effect environments.
Speech Commun., 1995

Stress independent robust HMM speech recognition using neural network stress classification.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Source generator based stressed speech perturbation.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Foreign accent classification using source generator based prosodic features.
Proceedings of the 1995 International Conference on Acoustics, 1995

A source generator based modeling framework for synthesis of speech under stress.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Morphological constrained feature enhancement with adaptive cepstral compensation (MCE-ACC) for speech recognition in noise and Lombard effect.
IEEE Trans. Speech Audio Process., 1994

Boundary-Constrained Morphological Skeleton Minimization and Skeleton Reconstruction.
IEEE Trans. Pattern Anal. Mach. Intell., 1994

A source generator based production model for environmental robustness in speech recognition.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Nonlinear speech analysis using the teager energy operator with application to speech classification under stress.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Speech enhancement based on a new set of auditory constrained parameters.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Duration and spectral based stress token generation for HMM speech recognition under stress.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Minimum cost based phoneme class detection for improved iterative speech enhancement.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Adaptive source generator compensation and enhancement for speech recognition in noisy stressful environments.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
A new dual-channel speech enhancement technique with application to CELP coding in noise.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

A novel speech recognizer for keyword spotting.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

ICARUS: an mwave-based real-time speech recognition system in noise and lombard effect.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Dual-channel speech enhancement with auditory spectrum based constraints.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

Morphological skeletonization for medical image compression.
Proceedings of the Fifth Annual IEEE Symposium on Computer-Based Medical Systems (CBMS'92), 1992

1991
Constrained iterative speech enhancement with application to speech recognition.
IEEE Trans. Signal Process., 1991

An improved image coding algorithm using morphological operator theory.
Proceedings of the 1991 International Conference on Acoustics, 1991

Speech enhancement employing adaptive boundary detection and morphological based spectral constraints.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
Lombard effect compensation for robust automatic speech recognition in noise.
Proceedings of the First International Conference on Spoken Language Processing, 1990

1989
Stress compensation and noise reduction algorithms for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 1989

1988
Constrained iterative speech enhancement with application to automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 1988

1987
Iterative speech enhancement with spectral constraints.
Proceedings of the IEEE International Conference on Acoustics, 1987


  Loading...