Lukas Drude

Leif Rädel

Volker Leutnant

CoRR, 2024

Promptformer: Prompted Conformer Transducer for ASR.

[BibT_eX]

[DOI]

Alejandro Gomez-Alanis

Leif Rädel

Volker Leutnant

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

Contextual-Utterance Training for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Alejandro Gómez Alanís

Rupak Vignesh Swaminathan

Simon Wiesler

CoRR, 2022

Contextual-Utterance Training for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Alejandro Gomez-Alanis

Rupak Vignesh Swaminathan

Simon Wiesler

Proceedings of the 6th International Conference, 2022

2021

Far-Field Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proc. IEEE, 2021

Multi-Channel Opus Compression for Far-Field Automatic Speech Recognition with a Fixed Bitrate Budget.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

Integration of neural networks and probabilistic spatial models for acoustic blind source separation.

[BibT_eX]

[DOI]

PhD thesis, 2020

Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

End-to-End Training of Time Domain Audio Separation and Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Demystifying TasNet: A Dissecting Approach.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2019

SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition.

[BibT_eX]

[DOI]

CoRR, 2019

Unsupervised Training of Neural Mask-Based Beamforming.

[BibT_eX]

[DOI]

Jahn Heymann

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation.

[BibT_eX]

[DOI]

Daniel Hasenklever

Proceedings of the IEEE International Conference on Acoustics, 2019

Weakly Supervised Sound Activity Detection and Event Classification in Acoustic Sensor Networks.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, 2019

2018

Frame-Online DNN-WPE Dereverberation.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Listening to Each Speaker One by One with Recurrent Selective Hearing Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Deep Attractor Networks for Speaker Re-Identification and Blind Source Separation.

[BibT_eX]

[DOI]

Thilo von Neumann

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing.

[BibT_eX]

[DOI]

Proceedings of the 13th ITG Symposium on Speech Communication, 2018

2017

A generic neural acoustic beamforming architecture for robust multi-channel speech processing.

[BibT_eX]

[DOI]

Jahn Heymann

Comput. Speech Lang., 2017

Directional Statistics and Filtering Using libDirectional.

[BibT_eX]

[DOI]

CoRR, 2017

The Incredible Shrinking Neural Network: New Perspectives on Learning Representations Through The Lens of Pruning.

[BibT_eX]

[DOI]

CoRR, 2017

On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming.

[BibT_eX]

[DOI]

CoRR, 2017

Multi-stage coherence drift based sampling rate synchronization for acoustic beamforming.

[BibT_eX]

[DOI]

Joerg Schmalenstroeer

Proceedings of the 19th IEEE International Workshop on Multimedia Signal Processing, 2017

Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Tight Integration of Spatial and Spectral Features for BSS with Deep Clustering Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Beamnet: End-to-end training of a beamformer-supported multi-channel ASR system.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Optimizing neural-network supported acoustic beamforming by algorithmic differentiation.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

On the Appropriateness of Complex-Valued Neural Networks for Speech Enhancement.

[BibT_eX]

[DOI]

Bhiksha Raj

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Neural network based spectral mask estimation for acoustic beamforming.

[BibT_eX]

[DOI]

Jahn Heymann

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Blind speech separation based on complex spherical k-mode clustering.

[BibT_eX]

[DOI]

Christoph Böddeker

Mohammad Mahdi Momenzadeh

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Factor Graph Decoding for Speech Presence Probability Estimation.

[BibT_eX]

[DOI]

Thomas Glarner

Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Noise-Presence-Probability-Based Noise PSD Estimation by Using DNNs.

[BibT_eX]

[DOI]

Proceedings of the 12th ITG Symposium on Speech Communication, 2016

2015

Source counting in speech mixtures by nonparametric Bayesian estimation of an infinite Gaussian mixture model.

[BibT_eX]

[DOI]

Oliver Walter

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

DOA-estimation based on a complex Watson kernel method.

[BibT_eX]

[DOI]

Florian Jacob