John H. L. Hansen
Orcid: 0000-0003-1382-9929Affiliations:
- University of Texas at Dallas
According to our database1,
John H. L. Hansen
authored at least 592 papers
between 1987 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on viaf.org
-
on orcid.org
-
on id.loc.gov
-
on d-nb.info
-
on utdallas.edu
-
on isni.org
On csauthors.net:
Bibliography
2024
Meeting the Challenges of a Growing ICASSP: Highlights from ICASSP 2024 [Conference Highlights].
IEEE Signal Process. Mag., May, 2024
Speech Enhancement for Cochlear Implant Recipients Using Deep Complex Convolution Transformer With Frequency Transformation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Navigating the United States Legislative Landscape on Voice Privacy: Existing Laws, Proposed Bills, Protection for Children, and Synthetic Data for AI.
CoRR, 2024
CoRR, 2024
Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples.
IEEE Access, 2024
Joint Language and Speaker Classification in Naturalistic Bilingual Adult-Toddler Interactions.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
Apollo's Unheard Voices: Graph Attention Networks for Speaker Diarization and Clustering for Fearless Steps Apollo Collection.
Proceedings of the IEEE International Conference on Acoustics, 2024
Efficient Adapter Tuning of Pre-Trained Speech Models for Automatic Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024
Dual-Path Minimum-Phase and All-Pass Decomposition Network for Single Channel Speech Dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2024
Situational Signal Processing with Ecological Momentary Assessment: Leveraging Environmental Context for Cochlear Implant Users.
Proceedings of the IEEE International Conference on Acoustics, 2024
Fearless Steps Apollo: Team Communications Based Community Resource Development for Science, Technology, Education, and Historical Preservation.
Proceedings of the IEEE International Conference on Acoustics, 2024
T-EnFP: An Efficient Transformer Encoder-Based System for Driving Behavior Classification.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Speech Commun., June, 2023
Historical Audio Search and Preservation: Finding Waldo Within the Fearless Steps Apollo 11 Naturalistic Audio Corpus [Applications Corner].
IEEE Signal Process. Mag., May, 2023
Attention and DCT Based Global Context Modeling for Text-Independent Speaker Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Bilateral Cochlear Implant Processing of Coding Strategies With CCi-MOBILE, an Open-Source Research Platform.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Domain Expansion for End-to-End Speech Recognition: Applications for Accent/Dialect Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
DeepComboSAD: Spectro-Temporal Correlation Based Speech Activity Detection for Naturalistic Audio Streams.
IEEE Signal Process. Lett., 2023
What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Assessment of Non-Native Speech Intelligibility using Wav2vec2-based Mispronunciation Detection and Multi-level Goodness of Pronunciation Transformer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Speaker Tracking using Graph Attention Networks with Varying Duration Utterances across Multi-Channel Naturalistic Data: Fearless Steps Apollo-11 Audio Corpus.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Improving Transformer-Based Networks with Locality for Automatic Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
IEEE Trans. Intell. Veh., 2022
CCi-MOBILE: A Portable Real Time Speech Processing Platform for Cochlear Implant and Hearing Research.
IEEE Trans. Biomed. Eng., 2022
IEEE ACM Trans. Audio Speech Lang. Process., 2022
SkipConvGAN: Monaural Speech Dereverberation Using Generative Adversarial Networks via Complex Time-Frequency Masking.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Assessing child communication engagement and statistical speech patterns for American English via speech recognition in naturalistic active learning spaces.
Speech Commun., 2022
Data-driven Attention and Data-independent DCT based Global Context Modeling for Text-independent Speaker Recognition.
CoRR, 2022
Impact of Naturalistic Field Acoustic Environments on Forensic Text-independent Speaker Verification System.
CoRR, 2022
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Multi-Frequency Information Enhanced Channel Attention Module for Speaker Representation Learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Speech Modification for Intelligibility in Cochlear Implant Listeners: Individual Effects of Vowel- and Consonant-Boosting.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Audio Anti-spoofing Using Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Challenges remain in Building ASR for Spontaneous Preschool Children Speech in Naturalistic Educational Environments.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
FeaRLESS: Feature Refinement Loss for Ensembling Self-Supervised Learning Features in Robust End-to-end Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Speaker Trait Enhancement for Cochlear Implant Users: A Case Study for Speaker Emotion Perception.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Bimodal Cochlear Implant Processing based on Assisted Hearing algorithms with CCi-MOBILE: an open-source research platform.
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
2021
Block-Based High Performance CNN Architectures for Frame-Level Overlapping Speech Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Guided Generative Adversarial Neural Network for Representation Learning and Audio Generation Using Fewer Labelled Audio Data.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Curriculum Learning based approaches for robust end-to-end far-field speech recognition.
Speech Commun., 2021
Nonlinear waveform distortion: Assessment and detection of clipping on speech data and systems.
Speech Commun., 2021
An investigation of domain adaptation in speaker embedding space for speaker recognition.
Speech Commun., 2021
Proceedings of the 7th IEEE World Forum on Internet of Things, 2021
Development of CNN-Based Cochlear Implant and Normal Hearing Sound Recognition Models Using Natural and Auralized Environmental Audio.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Real-Time Speaker Counting in a Cocktail Party Scenario Using Attention-Guided Convolutional Neural Network.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Fearless Steps Challenge Phase-3 (FSC P3): Advancing SLT for Unseen Channel and Mission Data Across NASA Apollo Audio.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Measuring Frequency of Child-directed WH-Question Words for Alternate Preschool Locations using Speech Recognition and Location Tracking Technologies.
Proceedings of the ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal Interaction, Montreal, QC, Canada, October 18, 2021
DEAAN: Disentangled Embedding and Adversarial Adaptation Network for Robust Speaker Representation Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021
Speaker Conditioning of Acoustic Models Using Affine Transformation for Multi-Speaker Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
Respiratory Distress Detection from Telephone Speech using Acoustic and Prosodic Features.
CoRR, 2020
Assessing Child Communication Engagement via Speech Recognition in Naturalistic Active Learning Spaces.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020
Speaker Representation Learning Using Global Context Guided Channel and Time-Frequency Transformations.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Cross-Domain Adaptation with Discrepancy Minimization for Text-Independent Forensic Speaker Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Open-Set Short Utterance Forensic Speaker Verification Using Teacher-Student Network with Explicit Inductive Bias.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
SkipConvNet: Skip Convolutional Neural Network for Speech Dereverberation Using Optimally Smoothed Spectral Mapping.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
FEARLESS STEPS Challenge (FS-2): Supervised Learning with Massive Naturalistic Apollo Data.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Mobile-Assisted Prosody Training for Limited English Proficiency: Learner Background and Speech Learning Pattern.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Effect of Spectral Complexity Reduction and Number of Instruments on Musical Enjoyment with Cochlear Implants.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Portable Smart-Space Research Interface to Predetermine Environment Acoustics for Cochlear implant and Hearing aid users with CCi-MOBILE.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020
2019
Multi-domain adversarial training of neural network acoustic models for distant speech recognition.
Speech Commun., 2019
Speech and language processing for assessing child-adult interaction based on diarization and location.
Int. J. Speech Technol., 2019
CoRR, 2019
CoRR, 2019
Tagging child-adult interactions in naturalistic, noisy, daylong school environments using i-vector based diarization system.
Proceedings of the 8th ISCA International Workshop on Speech and Language Technology in Education, 2019
Risky Action Recognition in Lane Change Video Clips using Deep Spatiotemporal Networks with Segmentation Mask Transfer.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019
Towards Complexity Level Classification of Driving Scenarios Using Environmental Information.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Convolutional Neural Network-Based Speech Enhancement for Cochlear Implant Recipients.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Quantifying Cochlear Implant Users' Ability for Speaker Identification Using CI Auditory Stimuli.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Toeplitz Inverse Covariance Based Robust Speaker Clustering for Naturalistic Audio Streams.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
A Machine Learning Based Clustering Protocol for Determining Hearing Aid Initial Configurations from Pure-Tone Audiograms.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Semi-supervised Learning with Generative Adversarial Networks for Arabic Dialect Identification.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Cross-lingual Text-independent Speaker Verification Using Unsupervised Adversarial Discriminative Domain Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
CCi-MOBILE: Design and Evaluation of a Cochlear Implant and Hearing Aid Research Platform for Speech Scientists and Engineers.
Proceedings of the 2019 IEEE EMBS International Conference on Biomedical & Health Informatics, 2019
Analyzing Large Receptive Field Convolutional Networks for Distant Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2019
2018
Text-Independent Speaker Verification Based on Triplet Convolutional Neural Network Embeddings.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
IEEE ACM Trans. Audio Speech Lang. Process., 2018
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Maximum-Likelihood Linear Transformation for Unsupervised Domain Adaptation in Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Leveraging Frequency-Dependent Kernel and DIP-Based Clustering for Robust Speech Activity Detection in Naturalistic Audio Streams.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Speech Activity Detection in Naturalistic Audio Environments: Fearless Steps Apollo Corpus.
IEEE Signal Process. Lett., 2018
Speech Commun., 2018
On the issues of intra-speaker variability and realism in speech, speaker, and language recognition tasks.
Speech Commun., 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Advancing Multi-Accented Lstm-CTC Speech Recognition Using a Domain Specific Student-Teacher Learning Paradigm.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
An Analysis of Transfer Learning for Domain Mismatched Text-independent Speaker Verification.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
Assessing Speaker Engagement in 2-Person Debates: Overlap Detection in United States Presidential Debates.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Fearless Steps: Apollo-11 Corpus Advancements for Speech Technologies from Earth to the Moon.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Robust Speaker Clustering using Mixtures of von Mises-Fisher Distributions for Naturalistic Audio Streams.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Automatic Screening to Detect 'At Risk' Child Speech Samples using a Clinical Group Verification framework.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018
2017
Lane-Change Detection From Steering Signal Using Spectral Segmentation and Learning-Based Classification.
IEEE Trans. Intell. Veh., 2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Single Sideband Frequency Offset Estimation and Correction for Quality Enhancement and Speaker Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
IEEE Signal Process. Mag., 2017
Driver Modeling for Detection and Assessment of Driver Distraction: Examples from the UTDrive Test Bed.
IEEE Signal Process. Mag., 2017
IEEE J. Sel. Top. Signal Process., 2017
Phoneme class based feature adaptation for mismatch acoustic modeling and recognition of distant noisy speech.
Int. J. Speech Technol., 2017
Deep neural network training for whispered speech recognition using small databases and generative model sampling.
Int. J. Speech Technol., 2017
Using speech technology for quantifying behavioral characteristics in peer-led team learning sessions.
Comput. Speech Lang., 2017
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017
Navigation-orientated natural spoken language understanding for intelligent vehicle dialogue.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2017
Proceedings of the 20th IEEE International Conference on Intelligent Transportation Systems, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Speech Enhancement Based on Harmonic Estimation Combined with MMSE to Improve Speech Intelligibility for Cochlear Implant Recipients.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Curriculum Learning Based Probabilistic Linear Discriminant Analysis for Noise Robust Speaker Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Improved Gender Independent Speaker Recognition Using Convolutional Neural Network Based Bottleneck Features.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
On Multi-Domain Training and Adaptation of End-to-End RNN Acoustic Models for Distant Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Speech Detection and Enhancement Using Single Microphone for Distant Speech Applications in Reverberant Environments.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Acoustic Scene Classification Using a CNN-SuperVector System Trained with Auditory and Spectrogram Image Features.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Environment aware speaker diarization for moving targets using parallel DNN-based recognizers.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
UTD-CRSS submission for MGB-3 Arabic dialect identification: Front-end and back-end advancements on broadcast speech.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
2016
A Generalized Nonnegative Tensor Factorization Approach for Distant Speech Recognition With Distributed Microphones.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
IEEE Signal Process. Mag., 2016
IEEE Signal Process. Mag., 2016
Microphone Array Processing Strategies for Distant-Based Automatic Speech Recognition.
IEEE Signal Process. Lett., 2016
Effective word count estimation for long duration daily naturalistic audio recordings.
Speech Commun., 2016
Unsupervised accent classification for deep data fusion of accent and language information.
Speech Commun., 2016
KU-ISPL Language Recognition System for NIST 2015 i-Vector Machine Learning Challenge.
CoRR, 2016
Automatic measurement and analysis of the child verbal communication using classroom acoustics within a child care center.
Proceedings of the 5th Workshop on Child Computer Interaction, 2016
Employing speech and location information for automatic assessment of child language environments.
Proceedings of the First International Workshop on Sensing, 2016
Unsupervised k-means clustering based out-of-set candidate selection for robust open-set language recognition.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Speaker independent diarization for child language environment analysis using deep neural networks.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
A robust diarization system for measuring dominance in Peer-Led Team Learning groups.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Between-Class Covariance Correction For Linear Discriminant Analysis in Language Recognition.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016
Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016
Improving speech recognition using limited accent diverse British English training data with deep neural networks.
Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016
Unsupervised driving performance assessment using free-positioned smartphones in vehicles.
Proceedings of the 19th IEEE International Conference on Intelligent Transportation Systems, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Improving Boundary Estimation in Audiovisual Speech Activity Detection Using Bayesian Information Criterion.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Fusion Strategies for Robust Speech Recognition and Keyword Spotting for Channel- and Noise-Degraded Speech.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Robustness in Speech, Speaker, and Language Recognition: "You've Got to Know Your Limitations".
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Generalized Discriminant Analysis (GDA) for Improved i-Vector Based Speaker Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Joint information from nonlinear and linear features for spoofing detection: An i-vector/DNN based approach.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
UTD-CRSS system for the NIST 2015 language recognition i-vector machine learning challenge.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
F0 estimation for noisy speech by exploring temporal harmonic structures in local time frequency spectrum segment.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the Digital Libraries: Knowledge, Information, and Data in an Open Access Society, 2016
2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
IEEE Signal Process. Mag., 2015
IEEE Signal Process. Lett., 2015
An advanced entropy-based feature with a frame-level vocal effort likelihood space modeling for distant whisper-island detection.
Speech Commun., 2015
Mean Hilbert envelope coefficients (MHEC) for robust speaker and language identification.
Speech Commun., 2015
Advanced parallel combined Gaussian mixture model based feature compensation integrated with iterative channel estimation.
Speech Commun., 2015
EURASIP J. Audio Speech Music. Process., 2015
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015
In-vehicle speech recognition and tutorial keywords spotting for novice drivers' performance evaluation.
Proceedings of the 2015 IEEE Intelligent Vehicles Symposium, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Frequency offset correction in single sideband (SSB) speech by deep neural network for speaker verification.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
An unsupervised visual-only voice activity detection approach using temporal orofacial features.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Probabilistic linear discriminant analysis for robust speaker identification in co-channel speech.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Anti-spoofing system: an investigation of measures to detect synthetic and human speech.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
A study on deep neural network acoustic model adaptation for robust far-field speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Leveraging automatic speech recognition in cochlear implants for improved speech intelligibility under reverberation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Robust overlapped speech detection and its application in word-count estimation for Prof-Life-Log data.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Analysis of speech and language communication for cochlear implant users in noisy lombard conditions.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Generative modeling of pseudo-target domain adaptation samples for whispered speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Studying the relationship between physical and language environments of children: Who's speaking to whom and where?
Proceedings of the IEEE Signal Processing and Signal Processing Education Workshop, 2015
Developing an educational electro-mechanical model of the middle ear and impulse noise reduction algorithm for cochlear implant users.
Proceedings of the IEEE Signal Processing and Signal Processing Education Workshop, 2015
An i-Vector PLDA based gender identification approach for severely distorted and multilingual DARPA RATS data.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Blind Spectral Weighting for Robust Speaker Identification under Reverberation Mismatch.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
An investigation into back-end advancements for speaker recognition in multi-session and noisy enrollment scenarios.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Maximum Likelihood Acoustic Factor Analysis Models for Robust Speaker Verification in Noise.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Speech Commun., 2014
Effective background data selection for SVM-based speaker recognition with unseen test environments: more is not always better.
Int. J. Speech Technol., 2014
Environment mismatch compensation using average eigenspace-based methods for robust speech recognition.
Int. J. Speech Technol., 2014
Automatic assessment of language background in toddlers through phonotactic and pitch pattern modeling of short vocalizations.
Proceedings of the 4st Workshop on Child, Computer and Interaction, 2014
Training candidate selection for effective rejection in open-set language identification.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Spoken language mismatch in speaker verification: An investigation with NIST-SRE and CRSS Bi-Ling corpora.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Multichannel feature enhancement in distributed microphone arrays for robust distant speech recognition in smart rooms.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014
Investigating State-of-the-Art Speaker Verification in the case of Unlabeled Development Data.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014
Threshold based decision-tree for automatic driving maneuver recognition using CAN-Bus signal.
Proceedings of the 17th International IEEE Conference on Intelligent Transportation Systems, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
'houston, we have a solution': a case study of the analysis of astronaut speech during NASA apollo 11 for long-term speaker modeling.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
F0 estimation in noisy speech based on long-term harmonic feature analysis combined with neural network classification.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Noisy speech enhancement based on long term harmonic model to improve speech intelligibility for hearing impaired listeners.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Investigation of the relative perceptual importance of temporal envelope and temporal fine structure between tonal and non-tonal languages.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Multichannel speech dereverberation based on convolutive nonnegative tensor factorization for ASR applications.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Uncertainty propagation in front end factor analysis for noise robust speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014
Robust and efficient environment detection for adaptive speech enhancement in cochlear implants.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
UT-Vocal Effort II: Analysis and constrained-lexicon recognition of whispered speech.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the Smart Mobile In-Vehicle Systems, Next Generation Advancements, 2014
2013
IEEE Trans. Speech Audio Process., 2013
IEEE Trans. Speech Audio Process., 2013
Unsupervised Speech Activity Detection Using Voicing Measures and Perceptual Spectral Flux.
IEEE Signal Process. Lett., 2013
Singing speaker clustering based on subspace learning in the GMM mean supervector space.
Speech Commun., 2013
In-set/out-of-set speaker recognition in sustained acoustic scenarios using sparse data.
Speech Commun., 2013
Acoustic analysis and feature transformation from neutral to whisper for speaker identification within whispered speech audio streams.
Speech Commun., 2013
Int. J. Speech Technol., 2013
Multi-modal highlight generation for sports videos using an information-theoretic excitability measure.
EURASIP J. Adv. Signal Process., 2013
Compensation of SNR and noise type mismatch using an environmental sniffing based speech recognition solution.
EURASIP J. Audio Speech Music. Process., 2013
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013
Belt Up: Investigating the impact of in-vehicular conversation on driving performance.
Proceedings of the 2013 IEEE Intelligent Vehicles Symposium (IV), 2013
I4u submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
'houston, we have a solution': using NASA apollo program to advance speech and language processing technology.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Acoustic factor analysis based universal background model for robust speaker verification in noise.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
All for one: feature combination for highly channel-degraded speech activity detection.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Impact of noise reduction and spectrum estimation on noise robust speaker identification.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
A preliminary study of child vocalization on a parallel corpus of US and shanghainese toddlers.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Supervector pre-processing for PRSVM-based Chinese and Arabic dialect identification.
Proceedings of the IEEE International Conference on Acoustics, 2013
A new mask-based objective measure for predicting the intelligibility of binary masked speech.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Overlapped-speech detection with applications to driver assessment for in-vehicle active safety systems.
Proceedings of the IEEE International Conference on Acoustics, 2013
Robust front-end processing for speaker identification over extremely degraded communication channels.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
An advanced feature compensation method employing acoustic model with phonetically constrained structure.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
Phoneme Selective Speech Enhancement Using Parametric Estimators and the Mixture Maximum Model: A Unifying Approach.
IEEE Trans. Speech Audio Process., 2012
IEEE Trans. Speech Audio Process., 2012
IEEE Signal Process. Mag., 2012
Speech Commun., 2012
TEO-based speaker stress assessment using hybrid classification and tracking schemes.
Int. J. Speech Technol., 2012
Identifying impact factors of language development in young children's natural home environment.
Proceedings of the Third Workshop on Child, Computer and Interaction, 2012
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012
Factor analysis of acoustic features using a mixture of probabilistic principal component analyzers for robust speaker verification.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012
Leveraging sensor information from portable devices towards automatic driving maneuver recognition.
Proceedings of the 15th International IEEE Conference on Intelligent Transportation Systems, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Phoneme Class Based Adaptation for Mismatch Acoustic Modeling of Distant Noisy Speech.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Gaussian Map based Acoustic Model Adaptation Using Untranscribed Data for Speech Recognition in Severely Adverse Environments.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Integrated Feature Normalization and Enhancement for robust Speaker Recognition using Acoustic Factor Analysis.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Front-end Channel Compensation using Mixture-dependent Feature Transformations for i-Vector Speaker Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Arabic Dialect Identification - 'Is the Secret in the Silence?' and Other Observations.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
ProfLifeLog: Environmental analysis and keyword recognition for naturalistic daily audio streams.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
A comparison of front-end compensation strategies for robust LVCSR under room reverberation and increased vocal effort.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Feature compensation employing online GMM adaptation for speech recognition in unknown severely adverse environments.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
A multi-modal highlight extraction scheme for sports videos using an information-theoretic excitability measure.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Emerging Signal Processing Applications, 2012
Proceedings of the 2012 IEEE International Conference on Emerging Signal Processing Applications, 2012
2011
International Large-Scale Vehicle Corpora for Research on Driver Behavior on the Road.
IEEE Trans. Intell. Transp. Syst., 2011
Whisper-Island Detection Based on Unsupervised Segmentation With Entropy-Based Speech Feature Processing.
IEEE ACM Trans. Audio Speech Lang. Process., 2011
Dialect Classification via Text-Independent Training and Testing for Arabic, Spanish, and Chinese.
IEEE Trans. Speech Audio Process., 2011
A Novel Mask Estimation Method Employing Posterior-Based Representative Mean Estimate for Missing-Feature Speech Recognition.
IEEE Trans. Speech Audio Process., 2011
IEEE Trans. Speech Audio Process., 2011
IEEE Trans. Speech Audio Process., 2011
Speech Commun., 2011
Variational noise model composition through model perturbation for robust speech recognition with time-varying background noise.
Speech Commun., 2011
Information fusion for robust 'context and driver aware' active vehicle safety systems.
Inf. Fusion, 2011
EURASIP J. Adv. Signal Process., 2011
Frame-Level Vocal Effort Likelihood Space Modeling for Improved Whisper-Island Detection.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Phone Impact Based Speech Transmission Technique for Reliable Speech Recognition in Poor Wireless Network Conditions.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Feature Compensation for Speech Recognition in Severely Adverse Environments Due to Background Noise and Channel Distortion.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Robust Speaker Recognition in Non-Stationary Room Environments Based on Empirical Mode Decomposition.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Vowel Context and Speaker Interactions Influencing Glottal Open Quotient and Formant Frequency Shifts in Physical Task Stress.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Speaker Identification for Whispered Speech Using a Training Feature Transformation from Neutral to Whisper.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Relative proportionate NLMS: Improving convergence for acoustic channel identification.
Proceedings of the IEEE International Conference on Acoustics, 2011
Effective background data selection in SVM speaker recognition for unseen test environment: More is not always better.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Phoneme selective speech enhancement using the generalized parametric spectral subtraction estimator.
Proceedings of the IEEE International Conference on Acoustics, 2011
UT-Scope: Towards LVCSR under Lombard effect induced by varying types and levels of noisy background.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the Workshops Proceedings of the Global Communications Conference, 2011
Proceedings of the 19th European Signal Processing Conference, 2011
Proceedings of the 19th European Signal Processing Conference, 2011
2010
Missing-Feature Reconstruction by Leveraging Temporal Spectral Correlation for Robust Speech Recognition in Background Noise Conditions.
IEEE Trans. Speech Audio Process., 2010
Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments.
IEEE Trans. Speech Audio Process., 2010
Spoken Proper Name Retrieval for Limited Resource Languages Using Multilingual Hybrid Representations.
IEEE Trans. Speech Audio Process., 2010
Discriminative Training for Multiple Observation Likelihood Ratio Based Voice Activity Detection.
IEEE Signal Process. Lett., 2010
The physiological microphone (PMIC): A competitive alternative for speaker assessment in stress detection and speaker verification.
Speech Commun., 2010
Analysis of CFA-BF: Novel combined fixed/adaptive beamforming for robust speech recognition in real car environments.
Speech Commun., 2010
Automatic voice onset time detection for unvoiced stops (/p/, /t/, /k/) with application to accent classification.
Speech Commun., 2010
Automatic Beamforming for Blind Extraction of Speech From Music Environment Using Variance of Spectral Flux-Inspired Criterion.
IEEE J. Sel. Top. Signal Process., 2010
Proceedings of the IEEE Intelligent Vehicles Symposium (IV), 2010
Proceedings of the IEEE Intelligent Vehicles Symposium (IV), 2010
A Bayesian approach to voice activity detection using multiple statistical models and discriminative training.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Quality conversion of non-acoustic signals for facilitating human-to-human speech communication under harsh acoustic conditions.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Assessment of single-channel speech enhancement techniques for speaker identification under mismatched conditions.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
An effective feature compensation scheme tightly matched with speech recognizer employing SVM-based GMM generation.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
An efficient microphone array based voice activity detector for driver's speech in noise and music rich in-vehicle environments.
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Towards more intelligible physiological microphone speech: A probabilistic transformation approach.
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Dialect distance assessment method based on comparison of pitch pattern statistical models.
Proceedings of the IEEE International Conference on Acoustics, 2010
A kernel mean matching approach for environment mismatch compensation in speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010
Angry emotion detection from real-life conversational speech by leveraging content structure.
Proceedings of the IEEE International Conference on Acoustics, 2010
A novel feature sub-sampling method for efficient universal background model training in speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Test token driven acoustic balancing for sparse enrollment data in cohort GMM speaker recognition.
Proceedings of the 18th European Signal Processing Conference, 2010
A scanning window scheme based on SVM training error rate for unsupervised audio segmentation.
Proceedings of the 18th European Signal Processing Conference, 2010
Dialect identification: Impact of differences between read versus spontaneous speech.
Proceedings of the 18th European Signal Processing Conference, 2010
2009
Proceedings of the Handbook of Research on Digital Libraries: Design, 2009
IEEE Trans. Speech Audio Process., 2009
IEEE Trans. Speech Audio Process., 2009
Time-Frequency Correlation-Based Missing-Feature Reconstruction for Robust Speech Recognition in Band-Restricted Conditions.
IEEE Trans. Speech Audio Process., 2009
Analysis and Compensation of Lombard Speech Across Noise Type and Levels With Application to In-Set/Out-of-Set Speaker Recognition.
IEEE Trans. Speech Audio Process., 2009
Speech Commun., 2009
Preliminary study of stress/neutral detection on recordings of children in the natural home environment.
Proceedings of the Second Workshop on Child, Computer and Interaction, 2009
Automatic childhood autism detection by vocalization decomposition with phone-like units.
Proceedings of the Second Workshop on Child, Computer and Interaction, 2009
Assessing the stress/neutral speech environment in adult/child interactions for applications in child language development.
Proceedings of the Second Workshop on Child, Computer and Interaction, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Robust minimal variance distortionless speech power spectra enhancement using order statistic filter for microphone array.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Robust angry speech detection employing a TEO-based discriminative classifier combination.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Variational model composition for robust speech recognition with time-varying background noise.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Speaker identification for whispered speech using modified temporal patterns and MFCCs.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Speech enhancement minimizing generalized euclidean distortion using supergaussian priors.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Reduced complexity equalization of lombard effect for speech recognition in noisy adverse environments.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2009
Assessment of speech dialog systems using multi-modal cognitive load analysis and driving performance metrics.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2009
A speech presence microphone array beamformer using model based speech presence probability estimation.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Speaker identification with whispered speech based on modified LFCC parameters and feature mapping.
Proceedings of the IEEE International Conference on Acoustics, 2009
Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environment.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
Mask estimation employing Posterior-based Representative Mean for missing-feature speech recognition with time-varying background noise.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
2008
A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition.
Speech Commun., 2008
Feature Compensation Employing Multiple Environmental Models for Robust In-Vehicle Speech Recognition.
IEICE Trans. Inf. Syst., 2008
Towards an Intelligent Acoustic Front End for Automatic Speech Recognition: Built-in Speaker Normalization.
EURASIP J. Audio Speech Music. Process., 2008
EURASIP J. Audio Speech Music. Process., 2008
Proceedings of the First Workshop on Child, Computer and Interaction, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Filling acoustic holes through leveraged uncorellated GMMs for in-set/out-of-set speaker recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Detection of speech under physical stress: model development, sensor selection, and feature fusion.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Speaker identification for whispered speech based on frequency warping and score competition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2008
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2008
Active accident avoidance case study: Integrating drowsiness monitoring system with lateral control and speed regulation in passenger vehicles.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Dialect Classification for Online Podcasts Fusing Acoustic and Language Based Structural and Semantic Information.
Proceedings of the ACL 2008, 2008
2007
IEEE Trans. Speech Audio Process., 2007
IEEE Trans. Speech Audio Process., 2007
IEEE Trans. Speech Audio Process., 2007
IEEE Trans. Speech Audio Process., 2007
IEEE Trans. Speech Audio Process., 2007
IEEE Signal Process. Lett., 2007
EURASIP J. Audio Speech Music. Process., 2007
Proceedings of the Speaker Classification I: Fundamentals, Features, and Methods, 2007
Multivariate Cepstral Feature Compensation on Band-limited Data for Robust Speech Recognition.
Proceedings of the 16th Nordic Conference of Computational Linguistics, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Advances in speechfind: transcript reliability estimation employing confidence measure based on discriminative sub-word model for SDR.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Getting start with UTDrive: driver-behavior modeling and assessment of distraction for in-vehicle speech systems.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Dialect Classification on Printed Text using Perplexity Measure and Conditional Random Fields.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Phonological feature based variable frame rate scheme for improved speech recognition.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
Speechfind for CDP: Advances in spoken document retrieval for the U. S. collaborative digitization program.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
2006
Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora.
IEEE Trans. Speech Audio Process., 2006
Speech Enhancement Based on Generalized Minimum Mean Square Error Estimators and Masking Properties of the Auditory System.
IEEE Trans. Speech Audio Process., 2006
IEEE Trans. Speech Audio Process., 2006
Analysis of lombard effect under different types and levels of noise with application to in-set speaker ID systems.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
A cohort - UBM approach to mitigate data sparseness for in-set/out-of-set speaker recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Missing-feature reconstruction for band-limited speech recognition in spoken document retrieval.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
A robust fusion method for multilingual spoken document retrieval systems employing tiered resources.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Stress Level Classification of Speech Using Euclidean Distance Metrics in a Novel Hybrid Multi-Dimensional Feature Space.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Unsupervised Class-Based Feature Compensation for Time-Variable Bandwidth-Limited Speech.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Perceptual Recognition Cues in Native English Accent Variation: "Listener Accent, Perceived Accent, and Comprehension".
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Spoken Proper Name Retrieval in Audio Streams for Limited-Resource Languages Via Lattice Based Search Using Hybrid Representations.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
Rapid discriminative acoustic model based on eigenspace mapping for fast speaker adaptation.
IEEE Trans. Speech Audio Process., 2005
Efficient audio stream segmentation via the combined T<sup>2</sup> statistic and Bayesian information criterion.
IEEE Trans. Speech Audio Process., 2005
SpeechFind: Advances in Spoken Document Retrieval for a National Gallery of the Spoken Word.
IEEE Trans. Speech Audio Process., 2005
An Auditory-Masking-Threshold-Based Noise Suppression Algorithm GMMSE-AMT[ERB] for Listeners with Sensorineural Hearing Loss.
EURASIP J. Adv. Signal Process., 2005
Speaker verification using Gaussian mixture models within changing real car environments.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
In-set/out-of-set speaker identification based on discriminative speech frame selection.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Improved "TEO" feature-based automatic stress detection using physiological and acoustic speech sensors.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Statistical class-based MFCC enhancement of filtered and band-limited speech for robust ASR.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Towards an Intelligent Acoustic Front-End for Automatic Speech Recognition: Built-In Speaker Normalization (BISN).
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Effects of Phoneme Characteristics on TEO Feature-based Automatic Stress Detection in Speech.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Cluster-dependent modeling and confidence measure processing for in-set/out-of-set speaker identification.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Speech enhancement based on a combined multi-channel array with constrained iterative and auditory masked processing.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Language modeling structures in audio transcription for retrieval of historical speeches.
Proceedings of the 2004 12th European Signal Processing Conference, 2004
2003
CSA-BF: a constrained switched adaptive beamformer for speech enhancement and recognition in real car environments.
IEEE Trans. Speech Audio Process., 2003
Evaluation of an auditory masked threshold noise suppression algorithm in normal-hearing and hearing-impaired listeners.
Speech Commun., 2003
CFA-BF: a novel combined fixed/adaptive beamforming for robust speech recognition in real car environments.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Perceptual MVDR-based cepstral coefficients (PMCCs) for high accuracy speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Frequency distribution based weighted sub-band approach for classification of emotional/stressful content in speech.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Perceptual based speech enhancement for normal-hearing and hearing-impaired individuals.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
CSA-BF: novel constrained switched adaptive beamforming for speech enhancement & recognition in real car environments.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
A comparison of spectral smoothing methods for segment concatenation based speech synthesis.
Speech Commun., 2002
Speechfind: an experimental on-line spoken document retrieval system for historical audio archives.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Improved structural maximum likelihood eigenspace mapping for rapid speaker adaptation.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Frequency band analysis for stress detection using a teager energy operator based feature.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Rapid speaker adaptation using multi-stream Structural Maximum Likelihood Eigenspace Mapping.
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002
2001
IEEE Trans. Speech Audio Process., 2001
IEEE Trans. Speech Audio Process., 2001
Fast likelihood computation techniques in nearest-neighbor based search for continuous speech recognition.
IEEE Signal Process. Lett., 2001
Proceedings of the First International Conference on Human Language Technology Research, 2001
Transcript-free search of audio archives for the national gallery of the spoken word.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2001
A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Analysis of the root-cepstrum for acoustic modeling and fast decoding in speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
2000
A comparative study of traditional and newly proposed features for recognition of speech under stress.
IEEE Trans. Speech Audio Process., 2000
High resolution speech feature parametrization for monophone-based stressed speech recognition.
IEEE Signal Process. Lett., 2000
Unsupervised audio stream segmentation and clustering via the Bayesian information criterion.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Improved Jacobian adaptation for fast acoustic model adaptation in noisy speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Audio stream phrase recognition for a national gallery of the spoken word: "one small step".
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the IEEE International Conference on Acoustics, 2000
1999
N-channel hidden Markov models for combined stressed speech classification and recognition.
IEEE Trans. Speech Audio Process., 1999
Selective training for hidden Markov models with applications to speech classification.
IEEE Trans. Speech Audio Process., 1999
The DSP Learning environment - modern DSP education: The Story of Three Greek Philosophers.
IEEE Signal Process. Mag., 1999
Auditory masking threshold estimation for broadband noise sources with application to speech enhancement.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999
Speech under stress conditions: overview of the effect on speech production and on system performance.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999
An experimental study of speaker verification sensitivity to computer voice-altered imposters.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999
1998
A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment.
IEEE Trans. Biomed. Eng., 1998
An improved (Auto: I, LSP: T) constrained iterative speech enhancement for colored noise environments.
IEEE Trans. Speech Audio Process., 1998
An auditory-based distortion measure with application to concatenative speech synthesis.
IEEE Trans. Speech Audio Process., 1998
HMM-based stressed speech modeling with application to improved synthesis and recognition of isolated speech under stress.
IEEE Trans. Speech Audio Process., 1998
IEEE Trans. Speech Audio Process., 1998
An efficient scoring algorithm for Gaussian mixture model based speaker identification.
IEEE Signal Process. Lett., 1998
Speech Commun., 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
A duration-based confidence measure for automatic segmentation of noise corrupted speech.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Classification of speech under stress based on features derived from the nonlinear Teager energy operator.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
1997
Text-directed speech enhancement employing phone class parsing and feature map constrained vector quantization.
Speech Commun., 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
A novel training approach for improving speech recognition under adverse stressful conditions.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Spectral normalization employing hidden Markov modeling of line spectrum pair frequencies.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
1996
Direct speech feature estimation using an iterative EM algorithm for vocal fold pathology detection.
IEEE Trans. Biomed. Eng., 1996
IEEE Trans. Biomed. Eng., 1996
IEEE Trans. Speech Audio Process., 1996
Speech Commun., 1996
Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition.
Speech Commun., 1996
Speech Commun., 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Vocal fold pathology assessment using AM autocorrelation analysis of the teager energy operator.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Recent advances in hypernasal speech detection using the nonlinear teager energy operator.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
Text-directed speech enhancement using phoneme classification and feature map constrained vector quantization.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
Improved HMM training and scoring strategies with application to accent classification.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
1995
Dual-channel iterative speech enhancement with constraints on an auditory-based spectrum.
IEEE Trans. Speech Audio Process., 1995
Source generator equalization and enhancement of spectral properties for robust speech recognition in noise and stress.
IEEE Trans. Speech Audio Process., 1995
Robust speech recognition training via duration and spectral-based stress token generation.
IEEE Trans. Speech Audio Process., 1995
Robust feature-estimation and objective quality assessment for noisy speech recognition using the Credit Card corpus.
IEEE Trans. Speech Audio Process., 1995
Markov model-based phoneme class partitioning for improved constrained iterative speech enhancement.
IEEE Trans. Speech Audio Process., 1995
ICARUS: Source generator based real-time recognition of speech in noisy stressful and Lombard effect environments.
Speech Commun., 1995
Stress independent robust HMM speech recognition using neural network stress classification.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the 1995 International Conference on Acoustics, 1995
Proceedings of the 1995 International Conference on Acoustics, 1995
1994
Morphological constrained feature enhancement with adaptive cepstral compensation (MCE-ACC) for speech recognition in noise and Lombard effect.
IEEE Trans. Speech Audio Process., 1994
Boundary-Constrained Morphological Skeleton Minimization and Skeleton Reconstruction.
IEEE Trans. Pattern Anal. Mach. Intell., 1994
A source generator based production model for environmental robustness in speech recognition.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Nonlinear speech analysis using the teager energy operator with application to speech classification under stress.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
Duration and spectral based stress token generation for HMM speech recognition under stress.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
Minimum cost based phoneme class detection for improved iterative speech enhancement.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
1993
Adaptive source generator compensation and enhancement for speech recognition in noisy stressful environments.
Proceedings of the IEEE International Conference on Acoustics, 1993
1992
A new dual-channel speech enhancement technique with application to CELP coding in noise.
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Proceedings of the Second International Conference on Spoken Language Processing, 1992
ICARUS: an mwave-based real-time speech recognition system in noise and lombard effect.
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992
Proceedings of the Fifth Annual IEEE Symposium on Computer-Based Medical Systems (CBMS'92), 1992
1991
IEEE Trans. Signal Process., 1991
Proceedings of the 1991 International Conference on Acoustics, 1991
Speech enhancement employing adaptive boundary detection and morphological based spectral constraints.
Proceedings of the 1991 International Conference on Acoustics, 1991
1990
Proceedings of the First International Conference on Spoken Language Processing, 1990
1989
Proceedings of the IEEE International Conference on Acoustics, 1989
1988
Constrained iterative speech enhancement with application to automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 1988
1987
Proceedings of the IEEE International Conference on Acoustics, 1987