Hugo Van hamme

Orcid: 0000-0003-1331-5186

Affiliations:
  • KU Leuven, Center for Processing Speech and Images


According to our database1, Hugo Van hamme authored at least 250 papers between 1988 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SparrKULee: A Speech-Evoked Auditory Response Repository from KU Leuven, Containing the EEG of 85 Participants.
Data, August, 2024

Scale-aware dual-branch complex convolutional recurrent network for monaural speech enhancement.
Comput. Speech Lang., 2024

Efficient Extraction of Noise-Robust Discrete Units from Self-Supervised Speech Models.
CoRR, 2024

MSNER: A Multilingual Speech Dataset for Named Entity Recognition.
CoRR, 2024

Detecting Post-Stroke Aphasia Via Brain Responses to Speech in a Deep Learning Framework.
CoRR, 2024

Unsupervised Accent Adaptation Through Masked Language Model Correction of Discrete Self-Supervised Speech Units.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Benefits of pre-trained mono- and cross-lingual speech representations for spoken language understanding of Dutch dysarthric speech.
EURASIP J. Audio Speech Music. Process., December, 2023

Relating EEG to continuous speech using deep neural networks: a review.
CoRR, 2023

Analysis of XLS-R for Speech Quality Assessment.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Parameter-efficient Dysarthric Speech Recognition Using Adapter Fusion and Householder Transformation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Rehearsal-Free Online Continual Learning for Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Cross-Lingual Transfer Learning for Alzheimer's Detection from Spontaneous Speech.
Proceedings of the IEEE International Conference on Acoustics, 2023

Using Adapters to Overcome Catastrophic Forgetting in End-to-End Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

ICASSP 2023 Auditory EEG Decoding Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Whisper-Slu: Extending a Pretrained Speech-to-Text Transformer for Low Resource Spoken Language Understanding.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Multi-encoder attention-based architectures for sound recognition with partial visual assistance.
EURASIP J. Audio Speech Music. Process., 2022

Bidirectional Representations for Low Resource Spoken Language Understanding.
CoRR, 2022

Impact of visual assistance for automated audio captioning.
CoRR, 2022

Multi-Source Transformer Architectures for Audiovisual Scene Classification.
CoRR, 2022

Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection.
CoRR, 2022

Weak-Supervised Dysarthria-Invariant Features for Spoken Language Understanding Using an Fhvae and Adversarial Training.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Learning to Jointly Transcribe and Subtitle for End-To-End Spontaneous Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Bottleneck Low-rank Transformers for Low-resource Spoken Language Understanding.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Pre-trained Speech Representations as Feature Extractors for Speech Quality Assessment in Online Conferencing Applications.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Relating the fundamental frequency of speech with EEG using a dilated convolutional network.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Multitask Learning for Low Resource Spoken Language Understanding.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Learning Subject-Invariant Representations from Speech-Evoked EEG Using Variational Autoencoders.
Proceedings of the IEEE International Conference on Acoustics, 2022

Continual Learning for Monolingual End-to-End Automatic Speech Recognition.
Proceedings of the 30th European Signal Processing Conference, 2022

Impact of Temporal Resolution on Convolutional Recurrent Networks for Audio Tagging and Sound Event Detection.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Cross-lingual Detection of Dysphonic Speech for Dutch and Hungarian Datasets.
Proceedings of the 15th International Joint Conference on Biomedical Engineering Systems and Technologies, 2022

2021
Show me where the action is!
Multim. Tools Appl., 2021

Low resource end-to-end spoken language understanding with capsule networks.
Comput. Speech Lang., 2021

Predicting speech intelligibility from EEG using a dilated convolutional network.
CoRR, 2021

Pre-training for low resource speech-to-intent applications.
CoRR, 2021

An Equal Data Setting for Attention-Based Encoder-Decoder and HMM/DNN Models: A Case Study in Finnish ASR.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

A Light Transformer For Speech-To-Intent Applications.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

A Study into Pre-Training Strategies for Spoken Language Understanding on Dysarthric Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speech Disorder Classification Using Extended Factorized Hierarchical Variational Auto-Encoders.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Extracting Different Levels of Speech Information from EEG Using an LSTM-Based Model.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Audiovisual Transfer Learning for Audio Tagging and Sound Event Detection.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
State gradients for analyzing memory in LSTM language models.
Comput. Speech Lang., 2020

Analysis of memory in LSTM-RNNs for source separation.
CoRR, 2020

Multitask Learning with Capsule Networks for Speech-to-Intent Applications.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

An LSTM Based Architecture to Relate Speech Stimulus to Eeg.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Modeling the relationship between acoustic stimulus and EEG with a dilated convolutional neural network.
Proceedings of the 28th European Signal Processing Conference, 2020

On the long-term learning ability of LSTM LMs.
Proceedings of the 28th European Symposium on Artificial Neural Networks, 2020

2019
Hyperspectral image classification using Non-negative Tensor Factorization and 3D Convolutional Neural Networks.
Signal Process. Image Commun., 2019

Effective weakly supervised semantic frame induction using expression sharing in hierarchical hidden Markov models.
CoRR, 2019

Robust Hierarchical Learning for Non-Negative Matrix Factorization With Outliers.
IEEE Access, 2019

18μW SoC for near-microphone Keyword Spotting and Speaker Verification.
Proceedings of the 2019 Symposium on VLSI Circuits, Kyoto, Japan, June 9-14, 2019, 2019

Audiovisual Transformer Architectures for Large-Scale Classification and Synchronization of Weakly Labeled Audio Events.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

CNN-LSTM Models for Multi-Speaker Source Separation Using Bayesian Hyper Parameter Optimization.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Practical Applicability of Deep Neural Networks for Overlapping Speaker Separation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
Information-Weighted Neural Cache Language Models for ASR.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

The CAMETRON Lecture Recording System: High Quality Video Recording and Editing with Minimal Human Supervision.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

TF-LM: TensorFlow-based Language Modeling Toolkit.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Memory Time Span in LSTMs for Multi-Speaker Source Separation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

State Gradients for RNN Memory Analysis.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Capsule Networks for Low Resource Spoken Language Understanding.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Multi-Scenario Deep Learning for Multi-Speaker Source Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Weakly Supervised Learning of Hidden Markov Models for Spoken Language Acquisition.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Joint Denoising and Dereverberation Using Exemplar-Based Sparse Representations and Decaying Norm Constraint.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Automatic relevance determination for nonnegative dictionary learning in the gamma-Poisson model.
Signal Process., 2017

Language Models of Spoken Dutch.
CoRR, 2017

Automatic Smoker Detection from Telephone Speech Signals.
Proceedings of the Speech and Computer - 19th International Conference, 2017

Improving Source Separation via Multi-Speaker Representations.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Character-Word LSTM Language Models.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

2016
Unseen Noise Estimation Using Separable Deep Auto Encoder for Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Noise robust exemplar matching with alpha-beta divergence.
Speech Commun., 2016

Under-determined reverberant audio source separation using Bayesian Non-negative Matrix Factorization.
Speech Commun., 2016

Noise robust footstep location estimation using a wireless acoustic sensor network.
J. Ambient Intell. Smart Environ., 2016

Unsupervised Learning of Continuous Density HMM for Variable-Length Spoken Unit Discovery.
IEICE Trans. Inf. Syst., 2016

Incrementally learn the relevance of words in a dictionary for spoken language acquisition.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

SCALE: A Scalable Language Engineering Toolkit.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Joint Sound Source Separation and Speaker Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Active speaker detection with audio-visual co-training.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Data selection for noise robust exemplar matching.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Language model adaptation for ASR of spoken translations using phrase-based translation models and named entity models.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Supervised speech dereverberation in noisy environments using exemplar-based sparse representations.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Improving cross-domain n-gram language modelling with skipgrams.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Coupled Dictionaries for Exemplar-Based Speech Enhancement and Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Blind audio source counting and separation of anechoic mixtures using the multichannel complex NMF framework.
Signal Process., 2015

A stable approach for model order selection in nonnegative matrix factorization.
Pattern Recognit. Lett., 2015

Two-stage blind audio source counting and separation of stereo instantaneous mixtures using Bayesian tensor factorisation.
IET Signal Process., 2015

Height estimation from speech signals using i-vectors and least-squares support vector regression.
Proceedings of the 38th International Conference on Telecommunications and Signal Processing, 2015

Noise robust exemplar matching for speech enhancement: applications to automatic speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Mutually exclusive grounding for weakly supervised non-negative matrix factorisation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Efficient language model adaptation for automatic speech recognition of spoken translations.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A multi-channel speech enhancement framework for robust NMF-based speech recognition for speech-impaired users.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Investigating modulation spectrogram features for deep neural network-based automatic speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Who's Speaking?: Audio-Supervised Classification of Active Speakers in Video.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Improving n-gram probability estimates by compound-head clustering.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Exemplar-based speech enhancement for deep neural network based automatic speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Adaptive noise dictionary design for noise robust exemplar matching of speech.
Proceedings of the 23rd European Signal Processing Conference, 2015

Noise robust exemplar matching with coupled dictionaries for single-channel speech enhancement.
Proceedings of the 23rd European Signal Processing Conference, 2015

Energy efficient monitoring of activities of daily living using wireless acoustic sensor networks in clean and noisy conditions.
Proceedings of the 23rd European Signal Processing Conference, 2015

Hybrid input spaces for exemplar-based noise robust speech recognition using coupled dictionaries.
Proceedings of the 23rd European Signal Processing Conference, 2015

Monitoring activities of daily living using Wireless Acoustic Sensor Networks in clean and noisy conditions.
Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2015

2014
Noise Robust Exemplar Matching Using Sparse Representations of Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

The self-taught vocal interface.
EURASIP J. Audio Speech Music. Process., 2014

Speaker age estimation using i-vectors.
Eng. Appl. Artif. Intell., 2014

Fast vocabulary acquisition in an NMF-based self-learning vocal user interface.
Comput. Speech Lang., 2014

Acquisition of ordinal words using weakly supervised NMF.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Dysarthric vocal interfaces with minimal training data.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Exemplar-based noise robust automatic speech recognition using modulation spectrogram features.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

GMM Weights Adaptation Based on Subspace Approaches for Speaker Verification.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Learning Like a Toddler: Watching Television Series to Learn Vocabulary from Images and Audio.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Speech Recognition Web Services for Dutch.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Automatic assessment of children's reading with the FLaVoR decoding using a phone confusion model.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

An evaluation of unsupervised acoustic model training for a dysarthric speech interface.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Blind speech source localization, counting and separation for 2-channel convolutive mixtures in a reverberant environment.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Modelling primitive streaming of simple tone sequences through factorisation of modulation pattern tensors.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Noise-robust speech recognition with exemplar-based sparse representations using Alpha-Beta divergence.
Proceedings of the IEEE International Conference on Acoustics, 2014

Active-set newton algorithm for non-negative sparse coding of audio.
Proceedings of the IEEE International Conference on Acoustics, 2014

Coping with language data sparsity: Semantic head mapping of compound words.
Proceedings of the IEEE International Conference on Acoustics, 2014

Coupled dictionary training for exemplar-based speech enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2014

Blind audio source separation of stereo mixtures using Bayesian Non-negative Matrix Factorization.
Proceedings of the 22nd European Signal Processing Conference, 2014

2013
Missing Data Solutions for Robust Speech Recognition.
Proceedings of the Essential Speech and Language Technology for Dutch, 2013

The JASMIN Speech Corpus: Recordings of Children, Non-natives and Elderly People.
Proceedings of the Essential Speech and Language Technology for Dutch, 2013

Rapid speaker adaptation in latent speaker space with non-negative matrix factorization.
Speech Commun., 2013

Joint training of non-negative Tucker decomposition and discrete density hidden Markov models.
Comput. Speech Lang., 2013

The Diagonalized Newton Algorithm for Nonnegative Matrix Factorization
Proceedings of the 1st International Conference on Learning Representations, 2013

Bayesian non-parametric matrix factorization for discovering words in spoken utterances.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

An exemplar-based NMF approach to audio event detection.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Automatic Monitoring of Activities of Daily Living based on Real-life Acoustic Sensor Data: a~preliminary study.
Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, 2013

A Self Learning Vocal Interface for Speech-impaired Users.
Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, 2013

Automating speech reception threshold measurements using automatic speech recognition.
Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, 2013

Comparing and combining classifiers for self-taught vocal interfaces.
Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, 2013

Model order estimation using Bayesian NMF for discovering phone patterns in spoken utterances.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Self-taught assistive vocal interfaces: an overview of the ALADIN project.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A diagonalized newton algorithm for non-negative sparse coding.
Proceedings of the IEEE International Conference on Acoustics, 2013

Embedding time warping in exemplar-based sparse representations of speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

Accent recognition using i-vector, Gaussian Mean Supervector and Gaussian posterior probability supervector for spontaneous telephone speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

Exemplar selection techniques for sparse representations of speech using multiple dictionaries.
Proceedings of the 21st European Signal Processing Conference, 2013

NMF-based keyword learning from scarce data.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Large Scale Graph Regularized Non-Negative Matrix Factorization With ℓ<sub>1</sub> Normalization Based on Kullback-Leibler Divergence.
IEEE Trans. Signal Process., 2012

Supervised input space scaling for non-negative matrix factorization.
Signal Process., 2012

Human language technology and communicative disabilities: requirements and possibilities for the future.
Lang. Resour. Evaluation, 2012

Multi-candidate missing data imputation for robust speech recognition.
EURASIP J. Audio Speech Music. Process., 2012

Subspace-GMM acoustic models for under-resourced languages: feasibility study.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012

Noise-robust digit recognition with exemplar-based sparse representations of variable length.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2012

Combining exemplar-based matching and exemplar-based sparse representations of speech.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Label Noise Robustness and Learning Speed in a Self-Learning Vocal User Interface.
Proceedings of the Natural Interaction with Robots, 2012

Speaker age estimation using Hidden Markov Model weight supervectors.
Proceedings of the 11th International Conference on Information Science, 2012

Speaker adaptation using Maximum Likelihood General Regression.
Proceedings of the 11th International Conference on Information Science, 2012

Robust Tracking for Automatic Reading Tutors.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Advances in noise robust digit recognition using hybrid exemplar-based techniques.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A Self-Learning Assistive Vocal Interface Based on Vocabulary Learning and Grammar Induction.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Data-driven speech representations for NMF-based word learning.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Age Estimation from Telephone Speech using i-vectors.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Latent variable speaker adaptation of Gaussian mixture weights and means.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Tri-factorization learning of sub-word units with application to vocabulary acquisition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Fast word acquisition in an NMF-based learning framework.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Weakly supervised keyword learning using sparse representations of speech.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

An On-Line NMF Model for Temporal Pattern Learning: Theory with Application to Automatic Speech Recognition.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

Towards a Self-Learning Assistive Vocal Interface: Vocabulary and Grammar Learning.
Proceedings of the 1st Workshop on Speech and Multimodal Interaction in Assistive Environments, 2012

2011
Advances in Missing Feature Techniques for Robust Large-Vocabulary Continuous Speech Recognition.
IEEE Trans. Speech Audio Process., 2011

Sparse conjugate directions pursuit with application to fixed-size kernel models.
Mach. Learn., 2011

Modelling vocabulary acquisition, adaptation and generalization in infants using adaptive Bayesian PLSA.
Neurocomputing, 2011

Gaussian Selection Using Self-Organizing Map for Automatic Speech Recognition.
Proceedings of the Advances in Self-Organizing Maps - 8th International Workshop, 2011

A two-layer non-negative matrix factorization model for vocabulary discovery.
Proceedings of the 2011 Symposium on Machine Learning in Speech and Language Processing, 2011

Rapid Speaker Adaptation using Maximum Likelihood Neural Regression.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Image pattern discovery by using the spatial closeness of visual code words.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Phonetic analysis of a computational model for vocabulary acquisition from auditory inputs.
Proceedings of the 1st International Conference on Development and Learning and on Epigenetic Robotics, 2011

Rapid speaker adaptation with speaker adaptive training and non-negative matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2011

Unsupervised vocabulary discovery using non-negative matrix factorization with graph regularization.
Proceedings of the IEEE International Conference on Acoustics, 2011

Progress in example based automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Speaker age estimation and gender detection based on supervised Non-Negative Matrix Factorization.
Proceedings of the IEEE Workshop on Biometric Measurements and Systems for Security and Medical Applications, 2011

An hierarchical exemplar-based sparse model of speech, with an application to ASR.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Automatic Speech Recognition Using Missing Data Techniques: Handling of Real-World Data.
Proceedings of the Robust Speech Recognition of Uncertain or Missing Data, 2011

2010
Compressive Sensing for Missing Data Imputation in Noise Robust Speech Recognition.
IEEE J. Sel. Top. Signal Process., 2010

Learning from images and speech with Non-negative Matrix Factorization enhanced by input space scaling.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Feature versus model based noise robustness.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Histogram equalization and noise masking for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Automatic voice onset time estimation from reassignment spectra.
Speech Commun., 2009

Unsupervised learning of time-frequency patches as a noise-robust representation of speech.
Speech Commun., 2009

Developing a reading tutor: Design and evaluation of dedicated speech recognition and synthesis modules.
Speech Commun., 2009

A Computational Model of Language Acquisition: the Emergence of Words.
Fundam. Informaticae, 2009

On a Computational Model for Language Acquisition: Modeling Cross-Speaker Generalisation.
Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

Applying non-negative matrix factorization on time-frequency reassignment spectra for missing data mask estimation.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Application of noise robust MDT speech recognition on the SPEECON and speechdat-car databases.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Evaluation of phone lattice based speech decoding.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Adaptive non-negative matrix factorization in a computational model of language acquisition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
Discovering Phone Patterns in Spoken Utterances by Non-Negative Matrix Factorization.
IEEE Signal Process. Lett., 2008

Recording Speech of Children, Non-Natives and Elderly People for HLT Applications: the JASMIN-CGN Corpus.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Children's Oral Reading Corpus (CHOREC): Description and Assessment of Annotator Agreement.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

HAC-models: a novel approach to continuous speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Comparison of variable selection methods and classifiers for native accent identification.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Discriminative model combination and language model selection in a reading tutor for children.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Lip synchronization: from phone lattice to PCA eigen-projections using neural networks.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Improving the multigram algorithm by using lattices as input.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A computational model of language acquisition: focus on word discovery.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Robust speech recognition using missing data techniques in the prospect domain and fuzzy masks.
Proceedings of the IEEE International Conference on Acoustics, 2008

Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies.
Proceedings of the IEEE International Conference on Acoustics, 2008

Fast speaker adaptation using non-negative matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2008

Unsupervised learning of auditory filter banks using non-negative matrix factorisation.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Estimation of the Voicing Cut-Off Frequency Contour Based on a Cumulative Harmonicity Score.
IEEE Signal Process. Lett., 2007

A Review of Signal Subspace Speech Enhancement and Its Application to Noise Robust Speech Recognition.
EURASIP J. Adv. Signal Process., 2007

Automatically learning the units of speech by non-negative matrix factorisation.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Vector-quantization based mask estimation for missing data automatic speech recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Fixed-size kernel logistic regression for phoneme classification.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Automatic assessment of children's reading level.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

DCT-Based Amplitude and Frequency Modulated Harmonic-Plus-Noise Modelling for Text-to-Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Model-based feature enhancement with uncertainty decoding for noise robust ASR.
Speech Commun., 2006

JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Single frame selection for phoneme classification.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Handling convolutional noise in missing data automatic speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Robust phone lattice decoding.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Developing an automatic assessment tool for children²s oral reading.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Handling Time-Derivative Features in a Missing Data Framework for Robust Automatic Speech Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Maximum Likelihood Based Temporal Frame Selection.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Application of Minimum Statistics and Minima Controlled Recursive Averaging Methods to Estimate a Cepstral Noise Model for Robust ASR.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Kalman and unscented kalman filter feature enhancement for noise robust ASR.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

PROSPECT features and their application to missing data techniques for vocal tract length normalization.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Statistical language models for large vocabulary spontaneous speech recognition in dutch.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Effect of Phase-Sensitive Environment Model and Higher Order VTS on Noisy Speech Feature Enhancement.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
A Comparison of Two Different Approaches to Morphological Analysis of Dutch.
Proceedings of the 7th Meeting of the ACL Special Interest Group in Computational Phonology: Current Themes in Computational Phonology and Morphology, 2004

Evaluation and Adaptation of the Celex Dutch Morphological Database.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Use and Evaluation of Prosodic Annotations in Dutch.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Accounting for the uncertainty of speech estimates in the context of model-based feature enhancement.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

PROSPECT features and their application to missing data techniques for robust speech recognition.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Joint removal of additive and convolutional noise with model-based feature enhancement.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Robust speech recognition using cepstral domain missing data techniques and noisy masks.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Correction of likelihoods for degrees of freedom in robust speech recognition using missing feature theory.
Proceedings of the Seventh International Symposium on Signal Processing and Its Applications, 2003

Evaluation of model-based feature enhancement on the AURORA-4 task.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Robust speech recognition using model-based feature enhancement.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Robust speech recognition using missing feature theory in the cepstral or LDA domain.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Two correction models for likelihoods in robust speech recognition using missing feature theory.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Assessment of dereverberation algorithms for large vocabulary speech recognition systems.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

FLavor: a flexible architecture for LVCSR.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Investigation of speech recognition over IP channels.
Proceedings of the IEEE International Conference on Acoustics, 2002

2000
Dialect adaptation for Mandarin Chinese speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Evaluation of various confidence-based strategies for isolated word rejection.
Proceedings of the IEEE International Conference on Acoustics, 2000

Model-based feature enhancement for noisy speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Accuracy versus complexity in context dependent phone modeling.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Adapting Western Language Recognizer for Chinese Recognition.
Proceedings of the 1998 International Symposium on Chinese Spoken Language Processing, 1998

Speaker normalization for automatic speech recognition - An on-line approach.
Proceedings of the 9th European Signal Processing Conference, 1998

1996
General framework for asymptotic properties of generalized weighted nonlinear least-squares estimators with deterministic and stochastic weighting.
IEEE Trans. Autom. Control., 1996

An adaptive-beam pruning technique for continuous speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1994
Parametric identification of transfer functions in the frequency domain-a survey.
IEEE Trans. Autom. Control., 1994

Identification of linear dynamic systems using piecewise constant excitations: Use, misuse and alternatives.
Autom., 1994

Comparison of acoustic features and robustness tests of a real-time recogniser using a hardware telephone line simulator.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

ARDOSS: autoregressive domain spectral subtraction for robust speech recognition in additive noise.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

1991
A stochastical limit to the resolution of least squares estimation of the frequencies of a double complex sinusoid.
IEEE Trans. Signal Process., 1991

Maximum likelihood estimation of superimposed complex sinusoids in white Gaussian noise by reduced effort coarse search (RECS).
IEEE Trans. Signal Process., 1991

1988
Karhunen-Loeve analysis of dynamic sequences of thermographic images for early breast cancer detection.
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1988


  Loading...