Lukas Drude

Orcid: 0000-0003-3683-5432

According to our database1, Lukas Drude authored at least 42 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Promptformer: Prompted Conformer Transducer for ASR.
CoRR, 2024

Promptformer: Prompted Conformer Transducer for ASR.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Contextual-Utterance Training for Automatic Speech Recognition.
CoRR, 2022

Contextual-Utterance Training for Automatic Speech Recognition.
Proceedings of the 6th International Conference, 2022

2021
Far-Field Automatic Speech Recognition.
Proc. IEEE, 2021

Multi-Channel Opus Compression for Far-Field Automatic Speech Recognition with a Fixed Bitrate Budget.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Integration of neural networks and probabilistic spatial models for acoustic blind source separation.
PhD thesis, 2020

Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

End-to-End Training of Time Domain Audio Separation and Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Demystifying TasNet: A Dissecting Approach.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation.
IEEE J. Sel. Top. Signal Process., 2019

SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition.
CoRR, 2019

Unsupervised Training of Neural Mask-Based Beamforming.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR.
Proceedings of the IEEE International Conference on Acoustics, 2019

Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Weakly Supervised Sound Activity Detection and Event Classification in Acoustic Sensor Networks.
Proceedings of the 8th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, 2019

2018
Frame-Online DNN-WPE Dereverberation.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Listening to Each Speaker One by One with Recurrent Selective Hearing Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Deep Attractor Networks for Speaker Re-Identification and Blind Source Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

2017
A generic neural acoustic beamforming architecture for robust multi-channel speech processing.
Comput. Speech Lang., 2017

Directional Statistics and Filtering Using libDirectional.
CoRR, 2017

The Incredible Shrinking Neural Network: New Perspectives on Learning Representations Through The Lens of Pruning.
CoRR, 2017

On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming.
CoRR, 2017

Multi-stage coherence drift based sampling rate synchronization for acoustic beamforming.
Proceedings of the 19th IEEE International Workshop on Multimedia Signal Processing, 2017

Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Tight Integration of Spatial and Spectral Features for BSS with Deep Clustering Embeddings.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Beamnet: End-to-end training of a beamformer-supported multi-channel ASR system.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Optimizing neural-network supported acoustic beamforming by algorithmic differentiation.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
On the Appropriateness of Complex-Valued Neural Networks for Speech Enhancement.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Neural network based spectral mask estimation for acoustic beamforming.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Blind speech separation based on complex spherical k-mode clustering.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Factor Graph Decoding for Speech Presence Probability Estimation.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Noise-Presence-Probability-Based Noise PSD Estimation by Using DNNs.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

2015
Source counting in speech mixtures by nonparametric Bayesian estimation of an infinite Gaussian mixture model.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

DOA-estimation based on a complex Watson kernel method.
Proceedings of the 23rd European Signal Processing Conference, 2015

BLSTM supported GEV beamformer front-end for the 3RD CHiME challenge.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Towards online source counting in speech mixtures applying a variational EM for complex Watson mixture models.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014

Source counting in speech mixtures using a variational EM approach for complex WATSON mixture models.
Proceedings of the IEEE International Conference on Acoustics, 2014


  Loading...