Reinhold Häb-Umbach

Orcid: 0000-0001-9468-7330

Affiliations:
  • University of Paderborn, Department of Electrical Engineering and Information Technology, Germany


According to our database1, Reinhold Häb-Umbach authored at least 243 papers between 1989 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models.
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024

Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2024

Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios.
Proceedings of the IEEE International Conference on Acoustics, 2024

Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder.
Proceedings of the 32nd European Signal Processing Conference, 2024

Target-Specific Dataset Pruning for Compression of Audio Tagging Models.
Proceedings of the 32nd European Signal Processing Conference, 2024

2023
Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Mixture Encoder Supporting Continuous Speech Separation for Meeting Recognition.
CoRR, 2023

LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices.
CoRR, 2023

MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems.
CoRR, 2023

Post-Processing Independent Evaluation of Sound Event Detection Systems.
CoRR, 2023

Re-examining the quality dimensions of synthetic speech.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Mixture Encoder for Joint Speech Separation and Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems.
Proceedings of the IEEE International Conference on Acoustics, 2023

Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2023

On the Integration of Sampling Rate Synchronization and Acoustic Beamforming.
Proceedings of the 31st European Signal Processing Conference, 2023

Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks.
Proceedings of the 57th Asilomar Conference on Signals, Systems, and Computers, ACSSC 2023, Pacific Grove, CA, USA, October 29, 2023

2022
Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications.
IEEE Trans. Veh. Technol., 2022

A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network.
CoRR, 2022

Informed vs. Blind beamforming in AD-HOC Acoustic Sensor Networks for Meeting Transcription.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

MMS-MSG: A Multi-Purpose Multi-Speaker Mixture Signal Generator.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Monaural Source Separation: From Anechoic To Reverberant Environments.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Investigation into Target Speaking Rate Adaptation for Voice Conversion.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Utterance-by-utterance overlap-aware neural diarization with Graph-PIT.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

An Initialization Scheme for Meeting Separation with Spatial Mixture Models.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

SA-SDR: A Novel Loss Function for Separation of Meeting Style Data.
Proceedings of the IEEE International Conference on Acoustics, 2022

On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes.
Proceedings of the IEEE International Conference on Acoustics, 2022

Threshold Independent Evaluation of Sound Event Detection Scores.
Proceedings of the IEEE International Conference on Acoustics, 2022

Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels.
Proceedings of the 30th European Signal Processing Conference, 2022

2021
Explanation as a Social Practice: Toward a Conceptual Framework for the Social Design of AI Systems.
IEEE Trans. Cogn. Dev. Syst., 2021

Far-Field Automatic Speech Recognition.
Proc. IEEE, 2021

Geometry calibration in wireless acoustic sensor networks utilizing DoA and distance information.
EURASIP J. Audio Speech Music. Process., 2021

A Database for Research on Detection and Enhancement of Speech Transmitted over HF links.
CoRR, 2021

Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery.
CoRR, 2021

Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend.
Proceedings of the IEEE International Conference on Acoustics, 2021

Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks.
Proceedings of the IEEE International Conference on Acoustics, 2021

Contrastive Predictive Coding Supported Factorized Variational Autoencoder For Unsupervised Learning Of Disentangled Speech Representations.
Proceedings of the IEEE International Conference on Acoustics, 2021

Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech.
Proceedings of the 29th European Signal Processing Conference, 2021

Adapting Sound Recognition to A New Environment Via Self-Training.
Proceedings of the 29th European Signal Processing Conference, 2021

Self-Trained Audio Tagging and Sound Event Detection in Domestic Environments.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

Speeding Up Permutation Invariant Training for Source Separation.
Proceedings of the 14th ITG Conference on Speech Communication, online, September 29, 2021

On Source-Microphone Distance Estimation Using Convolutional Recurrent Neural Networks.
Proceedings of the 14th ITG Conference on Speech Communication, online, September 29, 2021

A Comparison and Combination of Unsupervised Blind Source Separation Techniques.
Proceedings of the 14th ITG Conference on Speech Communication, online, September 29, 2021

2020
Jointly Optimal Denoising, Dereverberation, and Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation.
CoRR, 2020

Adversarial Contrastive Predictive Coding for Unsupervised Learning of Disentangled Representations.
CoRR, 2020

Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

End-to-End Training of Time Domain Audio Separation and Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Demystifying TasNet: A Dissecting Approach.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Jointly Optimal Dereverberation and Beamforming.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic Sensor Networks.
Proceedings of the 28th European Signal Processing Conference, 2020

Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-Supervised Sound Event Detection.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019
Speech Processing for Digital Home Assistants: Combining signal processing with deep-learning techniques.
IEEE Signal Process. Mag., 2019

Introduction to the Issue on Far-Field Speech Processing in the Era of Deep Learning: Speech Enhancement, Separation, and Recognition.
IEEE J. Sel. Top. Signal Process., 2019

Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation.
IEEE J. Sel. Top. Signal Process., 2019

SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition.
CoRR, 2019

Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

A Study on Online Source Extraction in the Presence of Changing Speaker Positions.
Proceedings of the Statistical Language and Speech Processing, 2019

Privacy-Preserving Variational Information Feature Extraction for Domestic Activity Monitoring versus Speaker Identification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multi-Channel Block-Online Source Extraction Based on Utterance Adaptation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Unsupervised Training of Neural Mask-Based Beamforming.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

All-neural Online Source Separation, Counting, and Diarization for Meeting Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2019

Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR.
Proceedings of the IEEE International Conference on Acoustics, 2019

Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Convolutional Recurrent Neural Network and Data Augmentation for Audio Tagging with Noisy Labels and Minimal Supervision.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Weakly Supervised Sound Activity Detection and Event Classification in Acoustic Sensor Networks.
Proceedings of the 8th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, 2019

An Investigation into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Machine learning techniques for semantic analysis of dysarthric speech: An experimental study.
Speech Commun., 2018

Frame-Online DNN-WPE Dereverberation.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Full Bayesian Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Deep Attractor Networks for Speaker Re-Identification and Blind Source Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Efficient Sampling Rate Offset Compensation - an Overlap-Save Based Approach.
Proceedings of the 26th European Signal Processing Conference, 2018

Insights into the Interplay of Sampling Rate Offsets and MVDR Beamforming.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

Benchmarking Neural Network Architectures for Acoustic Sensor Networks.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

MARVELO - A Framework for Signal Processing in Wireless Acoustic Sensor Networks.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

2017
A generic neural acoustic beamforming architecture for robust multi-channel speech processing.
Comput. Speech Lang., 2017

Directional Statistics and Filtering Using libDirectional.
CoRR, 2017

On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming.
CoRR, 2017

Multi-stage coherence drift based sampling rate synchronization for acoustic beamforming.
Proceedings of the 19th IEEE International Workshop on Multimedia Signal Processing, 2017

A study on transfer learning for acoustic event detection in a real life scenario.
Proceedings of the 19th IEEE International Workshop on Multimedia Signal Processing, 2017

Leveraging Text Data for Word Segmentation for Underresourced Languages.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Tight Integration of Spatial and Spectral Features for BSS with Deep Clustering Embeddings.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Beamnet: End-to-end training of a beamformer-supported multi-channel ASR system.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A generalized log-spectral amplitude estimator for single-channel speech enhancement.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Optimizing neural-network supported acoustic beamforming by algorithmic differentiation.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

The REVERB Challenge: A Benchmark Task for Reverberation-Robust ASR Techniques.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Acoustic Microphone Geometry Calibration: An overview and experimental evaluation of state-of-the-art algorithms.
IEEE Signal Process. Mag., 2016

A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research.
EURASIP J. Adv. Signal Process., 2016

On the Appropriateness of Complex-Valued Neural Networks for Speech Enhancement.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

A priori SNR Estimation Using a Generalized Decision Directed Approach.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Neural network based spectral mask estimation for acoustic beamforming.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Blind speech separation based on complex spherical k-mode clustering.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Investigations into Bluetooth low energy localization precision limits.
Proceedings of the 24th European Signal Processing Conference, 2016

Robust Online Multi-Channel Speech Recognition.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

On the Bias of Direction of Arrival Estimation Using Linear Microphone Arrays.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Factor Graph Decoding for Speech Presence Probability Estimation.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

A Priori SNR Estimation Using Weibull Mixture Model.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Noise-Presence-Probability-Based Noise PSD Estimation by Using DNNs.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

2015
A combined hardware-software approach for acoustic sensor network synchronization.
Signal Process., 2015

Autonomous Learning of Representations.
Künstliche Intell., 2015

Absolute Geometry Calibration of Distributed Microphone Arrays in an Audio-Visual Sensor Network.
CoRR, 2015

Typicality and emotion in the voice of children with autism spectrum condition: evidence across three languages.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Semantic analysis of spoken input using Markov logic networks.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

On optimal smoothing in minimum statistics based noise tracking.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Source counting in speech mixtures by nonparametric Bayesian estimation of an infinite Gaussian mixture model.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Aligning training modelswith smartphone properties in WiFi fingerprinting based indoor localization.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Unsupervised adaptation of a denoising autoencoder by Bayesian Feature Enhancement for reverberant asr under mismatch conditions.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

DOA-estimation based on a complex Watson kernel method.
Proceedings of the 23rd European Signal Processing Conference, 2015

BLSTM supported GEV beamformer front-end for the 3RD CHiME challenge.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
An Overview of Noise-Robust Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

A New Observation Model in the Logarithmic Mel Power Spectral Domain for the Automatic Recognition of Noisy Reverberant Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Towards online source counting in speech mixtures applying a variational EM for complex Watson mixture models.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014

An evaluation of unsupervised acoustic model training for a dysarthric speech interface.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

A gossiping approach to sampling clock synchronization in wireless acoustic sensor networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Iterative Bayesian word segmentation for unsupervised vocabulary discovery from phoneme lattices.
Proceedings of the IEEE International Conference on Acoustics, 2014

Source counting in speech mixtures using a variational EM approach for complex WATSON mixture models.
Proceedings of the IEEE International Conference on Acoustics, 2014

Online Observation ErrorModel Estimation for Acoustic Sensor Network Synchronization.
Proceedings of the 11th ITG Symposium on Speech Communication, 2014

Coordinate Mapping Between an Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking.
Proceedings of the 11th ITG Symposium on Speech Communication, 2014

Spectral Noise Tracking for Improved Nonstationary Noise Robust ASR.
Proceedings of the 11th ITG Symposium on Speech Communication, 2014

2013
Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition.
IEEE Trans. Speech Audio Process., 2013

Smartphone-based sensor fusion for improved vehicular navigation.
Proceedings of the 10th Workshop on Positioning, Navigation and Communication, 2013

Server based indoor navigation using RSSI and inertial sensor information.
Proceedings of the 10th Workshop on Positioning, Navigation and Communication, 2013

Using the turbo principle for exploiting temporal and spectral correlations in speech presence probability estimation.
Proceedings of the IEEE International Conference on Acoustics, 2013

DOA-based microphone array postion self-calibration using circular statistics.
Proceedings of the IEEE International Conference on Acoustics, 2013

Parameter estimation and classification of censored Gaussian data with application to WiFi indoor positioning.
Proceedings of the IEEE International Conference on Acoustics, 2013

Improved single-channel nonstationary noise tracking by an optimized MAP-based postprocessor.
Proceedings of the IEEE International Conference on Acoustics, 2013

Map-based estimation of the parameters of a Gaussian Mixture Model in the presence of noisy observations.
Proceedings of the IEEE International Conference on Acoustics, 2013

GMM-based significance decoding.
Proceedings of the IEEE International Conference on Acoustics, 2013

Blind speech separation exploiting temporal and spectral correlations using 2D-HMMs.
Proceedings of the 21st European Signal Processing Conference, 2013

Sampling rate synchronisation in acoustic sensor networks with a pre-trained clock skew error model.
Proceedings of the 21st European Signal Processing Conference, 2013

A hidden Markov model for indoor user tracking based on WiFi fingerprinting and step detection.
Proceedings of the 21st European Signal Processing Conference, 2013

On acoustic channel identification in multi-microphone systems via adaptive blind signal enhancement techniques.
Proceedings of the 21st European Signal Processing Conference, 2013

A hierarchical system for word discovery exploiting DTW-based initialization.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Unsupervised word segmentation from noisy input.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Exploiting Temporal Correlations in Joint Multichannel Speech Separation and Noise Suppression Using Hidden Markov Models.
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012

Microphone Array Position Self-Calibration from Reverberant Speech Input.
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012

Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Improved noise power spectral density tracking by a MAP-based postprocessor.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Investigations Into a Statistical Observation Model for Logarithmic Mel Power Spectral Density Features of Noisy Reverberant Speech.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

Quality Analysis and Optimization of the MAP-based Noise Power Spectral Density Tracker.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

Reverberant Speech Recognition.
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

2011
Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation.
IEEE Trans. Speech Audio Process., 2011

Adaptive systems for unsupervised speaker tracking and speech recognition.
Evol. Syst., 2011

On Initial Seed Selection for Frequency Domain Blind Speech Separation.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Unsupervised Geometry Calibration of Acoustic Sensor Networks Using Source Correspondences.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Unsupervised Learning of Acoustic Events Using Dynamic Time Warping and Hierarchical K-Means++ Clustering.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A Versatile Gaussian Splitting Approach to Non-Linear State Estimation and its Application to Noise-Robust ASR.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

MAP-based estimation of the parameters of non-stationary Gaussian processes from noisy observations.
Proceedings of the IEEE International Conference on Acoustics, 2011

Conditional Bayesian Estimation Employing a Phase-Sensitive Observation Model for Noise Robust Speech Recognition.
Proceedings of the Robust Speech Recognition of Uncertain or Missing Data, 2011

A Model-Based Approach to Joint Compensation of Noise and Reverberation for Speech Recognition.
Proceedings of the Robust Speech Recognition of Uncertain or Missing Data, 2011

Introduction.
Proceedings of the Robust Speech Recognition of Uncertain or Missing Data, 2011

Uncertainty Decoding and Conditional Bayesian Estimation.
Proceedings of the Robust Speech Recognition of Uncertain or Missing Data, 2011

2010
Model-Based Feature Enhancement for Reverberant Speech Recognition.
IEEE Trans. Speech Audio Process., 2010

Introduction to the Issue on Speech Processing for Natural Interaction With Intelligent Environments.
IEEE J. Sel. Top. Signal Process., 2010

Online Diarization of Streaming Audio-Visual Data for Smart Environments.
IEEE J. Sel. Top. Signal Process., 2010

Barometric height estimation combined with map-matching in a loosely-coupled Kalman-filter.
Proceedings of the 7th Workshop on Positioning Navigation and Communication, 2010

Ungrounded independent non-negative factor analysis.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

On the exploitation of hidden Markov models and linear dynamic models in a hybrid decoder architecture for continuous speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Blind speech separation employing directional statistics in an Expectation Maximization framework.
Proceedings of the IEEE International Conference on Acoustics, 2010

On the Initialization of Dynamic Models for Speech Features.
Proceedings of the 9. ITG-Fachtagung Sprachkommunikation 2010, 2010

2009
Parameter Estimation of a State-Space Model of Noise for Robust Speech Recognition.
IEEE Trans. Speech Audio Process., 2009

Approaches to Iterative Speech Feature Enhancement and Recognition.
IEEE Trans. Speech Audio Process., 2009

A GPS positioning approach exploiting GSM velocity estimates.
Proceedings of the 6th Workshop on Positioning, Navigation and Communication, 2009

Robust vehicle localization based on multi-level sensor fusion and online parameter estimation.
Proceedings of the 6th Workshop on Positioning, Navigation and Communication, 2009

Joint Parameter Estimation and Tracking in a Multi-Stage Kalman Filter for Vehicle Positioning.
Proceedings of the 69th IEEE Vehicular Technology Conference, 2009

Fusing audio and video information for online speaker diarization.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

An analytic derivation of a phase-sensitive observation model for noise robust speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Model based feature enhancement for automatic speech recognition in reverberant environments.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition.
IEEE Trans. Speech Audio Process., 2008

Modeling the dynamics of speech and noise for speech feature enhancement in ASR.
Proceedings of the IEEE International Conference on Acoustics, 2008

Speech enhancement with a new generalized eigenvector blocking matrix for application in a generalized sidelobe canceller.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
A Novel Similarity Measure for Positioning Cellular Phones by a Comparison With a Database of Signal Power Levels.
IEEE Trans. Veh. Technol., 2007

Blind Acoustic Beamforming Based on Generalized Eigenvalue Decomposition.
IEEE Trans. Speech Audio Process., 2007

Velocity Estimation of Mobile Terminals by Exploiting GSM Downlink Signalling.
Proceedings of the 4th Workshop on Positioning, Navigation and Communication, 2007

Combined time and frequency domain OFDM channel estimation.
Proceedings of the Multi-Carrier Spread Spectrum 2007, 2007

Channel estimation by exploiting sublayer information in OFDM systems.
Proceedings of the Multi-Carrier Spread Spectrum 2007, 2007

An approach to iterative speech feature enhancement and recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Blind adaptive principal eigenvector beamforming for acoustical source separation.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Joint speaker segmentation, localization and identification for streaming audio.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Multi-resolution soft features for channel-robust distributed speech recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

OFDM Channel Estimation Based on Combined Estimation in Time and Frequency Domain.
Proceedings of the IEEE International Conference on Acoustics, 2007

Amigo Context Management Service with Applications in Ambient Communication Scenarios.
Proceedings of the Constructing Ambient Intelligence, 2007

2006
Uncertainty decoding for distributed speech recognition over error-prone networks.
Speech Commun., 2006

Online speaker change detection by combining BIC with microphone array beamforming.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Improved source modeling and predictive classification for channel robust speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Iterative Speech Enhancement using a Non-Linear Dynamic State Model of Speech and its Parameters.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

An Inexpensive Packet Loss Compensation Scheme for Distributed Speech Recognition Based on Soft-Features.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Unified probabilistic approach to error concealment for distributed speech recognition.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

A comparison of particle filtering variants for speech feature enhancement.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Speech processing in the networked home environment - a view on the amigo project.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Acoustic filter-and-sum beamforming by adaptive principal component analysis.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

A Comparison of Soft-Feature Distributed Speech Recognition with Candidate Codecs for Speech Enabled Mobile Services.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Robust speaker direction estimation with particle filtering.
Proceedings of the IEEE 6th Workshop on Multimedia Signal Processing, 2004

Adaptive beamforming combined with particle filtering for acoustic source localization.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Soft features for improved distributed speech recognition over wireless networks.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2002
Large vocabulary continuous speech recognition of Broadcast News - The Philips/RWTH approach.
Speech Commun., 2002

Employment of a multipath receiver structure in a combined GALILEO/UMTS receiver.
Proceedings of the 55th IEEE Vehicular Technology Conference, 2002

2001
Automatic generation of phonetic regression class trees for MLLR adaptation.
IEEE Trans. Speech Audio Process., 2001

Multiclass Linear Dimension Reduction by Weighted Pairwise Fisher Criteria.
IEEE Trans. Pattern Anal. Mach. Intell., 2001

2000
Multi-class linear dimension reduction by generalized Fisher criteria.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Data-driven phonetic regression class tree estimation for MLLR adaptation.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Multi-Class Linear Feature Extraction by Nonlinear PCA.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

LDA derived cepstral trajectory filters in adverse environmental conditions.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
A study of broadcast news audio stream segmentation and segment clustering.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

An investigation of cepstral parameterisations for large vocabulary speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

The philips/RWTH system for transcription of broadcast news.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Investigations on inter-speaker variability in the feature space.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
A study on speaker normalization using vocal tract normalization and speaker adaptive training.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
The development of a command-based speech interface for a telephone answering machine.
Speech Commun., 1997

Acoustic front ends for speaker-independent digit recognition in car environments.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Robust speech recognition for wireless networks and mobile telephony.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

European speech databases for telephone applications.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Signal representations for hidden Markov model based online handwriting recognition.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
FRESCO: the French telephone speech data collection - part of the european Speechdat(m) project.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

A comparative study of linear feature transformation techniques for automatic speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1995
Continuous speech dictation - From theory to practice.
Speech Commun., 1995

Human factors of a voice-controlled car stereo.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Automatic transcription of unknown words in a speech recognition system.
Proceedings of the 1995 International Conference on Acoustics, 1995

Application of clustering techniques to mixture density modelling for continuous-speech recognition.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Improvements in beam search for 10000-word continuous-speech recognition.
IEEE Trans. Speech Audio Process., 1994

An Overview of the Philips Research System for Large Vocabulary Continuous Speech Recognition.
Int. J. Pattern Recognit. Artif. Intell., 1994

1993
Design and use of speech recognition algorithms for a mobile radio telephone.
Speech Commun., 1993

The Philips research system for large-vocabulary continuous-speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Improvements in connected digit recognition using linear discriminant analysis and mixture densities.
Proceedings of the IEEE International Conference on Acoustics, 1993

Continuous mixture densities and linear discriminant analysis for improved context-dependent acoustic models.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
A modified trellis coding technique for partial response channels.
IEEE Trans. Commun., 1992

Trellis Codes for Partial-Response Magnetooptical Direct Overwrite Recording.
IEEE J. Sel. Areas Commun., 1992

Improvements in beam search for 10000-word continuous speech recognition.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

Linear discriminant analysis for improved large vocabulary continuous speech recognition.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
A look-ahead search technique for large vocabulary continuous speech recognition.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

1989
A systematic approach to carrier recovery and detection of digitally phase modulated signals of fading channels.
IEEE Trans. Commun., 1989


  Loading...