Timo Gerkmann
Orcid: 0000-0002-8678-4699Affiliations:
- University Hamburg, Germany
According to our database1,
Timo Gerkmann
authored at least 162 papers
between 2006 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2024
IEEE Trans. Image Process., 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
End-to-End Label Uncertainty Modeling in Speech Emotion Recognition Using Bayesian Neural Networks and Label Distribution Learning.
IEEE Trans. Affect. Comput., 2024
Mask-Weighted Spatial Likelihood Coding for Speaker-Independent Joint Localization and Mask Estimation.
CoRR, 2024
Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech.
CoRR, 2024
Dynamics of Collective Group Affect: Group-level Annotations and the Multimodal Modeling of Convergence and Divergence.
CoRR, 2024
Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models.
CoRR, 2024
EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation.
CoRR, 2024
CoRR, 2024
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024
Meta-Learning For Variable Array Configurations in End-to-End Few-Shot Multichannel Speech Enhancement.
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024
Uncertainty-Based Remixing for Unsupervised Domain Adaptation in Deep Speech Enhancement.
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
EMOCONV-Diff: Diffusion-Based Speech Emotion Conversion for Non-Parallel and in-the-Wild Data.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
A neural network-supported two-stage algorithm for lightweight dereverberation on hearing devices.
EURASIP J. Audio Speech Music. Process., December, 2023
IEEE ACM Trans. Audio Speech Lang. Process., 2023
IEEE ACM Trans. Audio Speech Lang. Process., 2023
StoRM: A Diffusion-Based Stochastic Regeneration Model for Speech Enhancement and Dereverberation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
IEEE ACM Trans. Audio Speech Lang. Process., 2023
CoRR, 2023
On the Behavior of Intrusive and Non-intrusive Speech Enhancement Metrics in Predictive and Generative Settings.
CoRR, 2023
In-the-wild Speech Emotion Conversion Using Disentangled Self-Supervised Representations and Neural Vocoder-based Resynthesis.
CoRR, 2023
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
Leveraging Semantic Information for Efficient Self-Supervised Emotion Recognition with Audio-Textual Distilled Models.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Extending DNN-based Multiplicative Masking to Deep Subband Filtering for Improved Dereverberation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Reducing the Prior Mismatch of Stochastic Differential Equations for Diffusion-based Speech Enhancement.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Acoustic and Visual Knowledge Distillation for Contrastive Audio-Visual Localization.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Analysing Diffusion-based Generative Approaches Versus Discriminative Approaches for Speech Restoration.
Proceedings of the IEEE International Conference on Acoustics, 2023
Partially Adaptive Multichannel Joint Reduction of Ego-Noise and Environmental Noise.
Proceedings of the IEEE International Conference on Acoustics, 2023
Uncertainty Estimation in Deep Speech Enhancement Using Complex Gaussian Mixture Models.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
End-To-End Optimization of Online Neural Network-supported Two-Stage Dereverberation for Hearing Devices.
CoRR, 2022
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech Enhancement.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
End-To-End Label Uncertainty Modeling for Speech-based Arousal Recognition Using Bayesian Neural Networks.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Neural Network-augmented Kalman Filtering for Robust Online Speech Dereverberation in Noisy Reverberant Environments.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the International Joint Conference on Neural Networks, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Customizable End-To-End Optimization Of Online Neural Network-Supported Dereverberation For Hearing Devices.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Label Uncertainty Modeling and Prediction for Speech Emotion Recognition using t-Distributions.
Proceedings of the 10th International Conference on Affective Computing and Intelligent Interaction, 2022
2021
Efficient Joint Estimation of Tracer Distribution and Background Signals in Magnetic Particle Imaging Using a Dictionary Approach.
IEEE Trans. Medical Imaging, 2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks.
CoRR, 2021
Disentanglement Learning for Variational Autoencoders Applied to Audio-Visual Speech Enhancement.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021
See the Silence: Improving Visual-Only Voice Activity Detection by Optical Flow and RGB Fusion.
Proceedings of the Computer Vision Systems - 13th International Conference, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the 14th ITG Conference on Speech Communication, online, September 29, 2021
Proceedings of the 14th ITG Conference on Speech Communication, online, September 29, 2021
An Integrated Deep Clustering-Based System for Speaker Count Agnostic Speech Separation.
Proceedings of the 14th ITG Conference on Speech Communication, online, September 29, 2021
Joint Reduction of Ego-noise and Environmental Noise with a Partially-adaptive Dictionary.
Proceedings of the 14th ITG Conference on Speech Communication, online, September 29, 2021
2020
Frontiers Robotics AI, 2020
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Improving mix-and-separate training in audio-visual sound source separation with an object prior.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Nonlinear Spatial Filtering for Multichannel Speech Enhancement in Inhomogeneous Noise Fields.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
CoRR, 2019
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
An Analysis of Noise-aware Features in Combination with the Size and Diversity of Training Data for DNN-based Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
On the Importance of Super-Gaussian Speech Priors for Machine-Learning Based Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
A Study on the Benefits of Phase-Aware Speech Enhancement in Challenging Noise Scenarios.
Proceedings of the Latent Variable Analysis and Signal Separation, 2018
Proceedings of the 13th ITG Symposium on Speech Communication, 2018
2017
An Analysis of Adaptive Recursive Smoothing with Applications to Noise PSD Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
CoRR, 2017
CoRR, 2017
On the Importance of Super-Gaussian Speech Priors for Pre-Trained Speech Enhancement.
CoRR, 2017
MixMax Approximation as a Super-Gaussian Log-Spectral Amplitude Estimator for Speech Enhancement.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
2016
On MMSE-Based Estimation of Amplitude and Complex Speech Spectral Coefficients Under Phase-Uncertainty.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Fundamental Frequency Informed Speech Enhancement in a Flexible Statistical Framework.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016
BIAS correction methods for adaptive recursive smoothing with applications in noise PSD estimation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
A Combination of Pre-Trained Approaches and Generic Methods for an Improved Speech Enhancement.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016
Combined Single-Microphone Wiener and MVDR Filtering based on Speech Interframe Correlations and Speech Presence Probability.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016
2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Two-Stage Filter-Bank System for Improved Single-Channel Noise Reduction in Hearing Aids.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
IEEE Signal Process. Mag., 2015
Front-end technologies for robust ASR in reverberant environments - spectral enhancement-based dereverberation and auditory modulation filterbank features.
EURASIP J. Adv. Signal Process., 2015
Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech.
EURASIP J. Adv. Signal Process., 2015
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015
MMSE-optimal combination of wiener filtering and harmonic model based speech enhancement in a general framework.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Multi-channel PSD estimators for speech dereverberation - A theoretical and experimental comparison.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Utilizing spectro-temporal correlations for an improved speech presence probability based noise power estimation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Multi-channel linear prediction-based speech dereverberation with low-rank power spectrogram approximation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 23rd European Signal Processing Conference, 2015
2014
Bayesian Estimation of Clean Speech Spectral Coefficients Given a Priori Knowledge of the Phase.
IEEE Trans. Signal Process., 2014
STFT phase reconstruction in voiced speech for an improved single-channel speech enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Subjective speech quality and speech intelligibility evaluation of single-channel dereverberation algorithms.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014
Speech dereverberation with convolutive transfer function approximation using map and variational deconvolution approaches.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014
A study on speech quality and speech intelligibility measures for quality assessment of single-channel dereverberation algorithms.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014
A posteriori speech presence probability estimation based on averaged observations and a super-Gaussian speech model.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Frequency-domain single-channel inverse filtering for speech dereverberation: Theory and practice.
Proceedings of the IEEE International Conference on Acoustics, 2014
MMSE-optimal enhancement of complex speech coefficients with uncertain prior knowledge of the clean speech phase.
Proceedings of the IEEE International Conference on Acoustics, 2014
Speech dereverberation with multi-channel linear prediction and sparse priors for the desired signal.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014
A speech presence probability estimator based on fixed priors and a heavy-tailed speech model.
Proceedings of the 22nd European Signal Processing Conference, 2014
Efficient Multi-Channel Acoustic Echo Cancellation Using Constrained Sparse Filter Updates in the Subband Domain.
Proceedings of the 11th ITG Symposium on Speech Communication, 2014
2013
Synthesis Lectures on Speech and Audio Processing, Morgan & Claypool Publishers, ISBN: 978-3-031-02564-8, 2013
IEEE Signal Process. Lett., 2013
Privacy-preserving distributed speech enhancement forwireless sensor networks by processing in the encrypted domain.
Proceedings of the IEEE International Conference on Acoustics, 2013
On the relation between speech corruption models in the spectral and the cepstral domain.
Proceedings of the IEEE International Conference on Acoustics, 2013
Phase-sensitive real-time capable speech enhancement under voiced-unvoiced uncertainty.
Proceedings of the 21st European Signal Processing Conference, 2013
Proceedings of the 21st European Signal Processing Conference, 2013
2012
IEEE Trans. Speech Audio Process., 2012
Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay.
IEEE Trans. Speech Audio Process., 2012
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
A new linear MMSE filter for single channel speech enhancement based on Nonnegative Matrix Factorization.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011
Blind source separation of nondisjoint sources in the time-frequency domain with model-based determination of source contribution.
Proceedings of the 2011 IEEE International Symposium on Signal Processing and Information Technology, 2011
A new approach for speech enhancement based on a constrained Nonnegative Matrix Factorization.
Proceedings of the International Symposium on Intelligent Signal Processing and Communications Systems, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the 19th European Signal Processing Conference, 2011
2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Musical genre classification based on a highly-resolved cepstral modulation spectrum.
Proceedings of the 18th European Signal Processing Conference, 2010
Proceedings of the 9. ITG-Fachtagung Sprachkommunikation 2010, 2010
2009
On the statistics of spectral amplitudes after variance reduction by temporal cepstrum smoothing and cepstral nulling.
IEEE Trans. Signal Process., 2009
Multi-microphone maximum a posteriori fundamental frequency estimation in the cepstral domain.
Proceedings of the IEEE International Conference on Acoustics, 2009
2008
Improved A Posteriori Speech Presence Probability Estimation Based on a Likelihood Ratio With Fixed Priors.
IEEE Trans. Speech Audio Process., 2008
A novel a priori SNR estimation approach based on selective cepstro-temporal smoothing.
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
Cepstral Smoothing of Spectral Filter Gains for Speech Enhancement Without Musical Noise.
IEEE Signal Process. Lett., 2007
2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006