Niko Moritz

According to our database1, Niko Moritz authored at least 43 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses.
CoRR, 2024

AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Effective Internal Language Model Training and Fusion for Factorized Transducer Model.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Directional Source Separation for Robust Speech Recognition on Smart Glasses.
CoRR, 2023

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision.
CoRR, 2023

Directional Speech Recognition for Speaker Disambiguation and Cross-talk Suppression.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Streaming Audio-Visual Speech Recognition with Alignment Regularization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Anchored Speech Recognition with Neural Transducers.
Proceedings of the IEEE International Conference on Acoustics, 2023

SynthVSR: Scaling Up Visual Speech RecognitionWith Synthetic Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Momentum Pseudo-Labeling: Semi-Supervised ASR With Continuously Improving Pseudo-Labels.
IEEE J. Sel. Top. Signal Process., 2022

An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Sequence Transduction with Graph-Based Supervision.
Proceedings of the IEEE International Conference on Acoustics, 2022

Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy.
Proceedings of the IEEE International Conference on Acoustics, 2022

Extended Graph Temporal Classification for Multi-Speaker End-to-End ASR.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Advanced Long-Context End-to-End Speech Recognition Using Context-Expanded Transformers.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Semi-Supervised Speech Recognition Via Graph-Based Temporal Classification.
Proceedings of the IEEE International Conference on Acoustics, 2021

Capturing Multi-Resolution Context by Dilated Self-Attention.
Proceedings of the IEEE International Conference on Acoustics, 2021

Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
All-in-One Transformer: Unifying Speech Recognition, Audio Tagging, and Event Detection.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Transformer-Based Long-Context End-to-End Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Unsupervised Speaker Adaptation Using Attention-Based Speaker Memory for End-to-End ASR.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Streaming Automatic Speech Recognition with the Transformer Model.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Vectorized Beam Search for CTC-Attention-Based Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Unidirectional Neural Network Architectures for End-to-End Automatic Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Triggered Attention for End-to-end Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Streaming End-to-End Speech Recognition with Joint CTC-Attention Based Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Objective Assessment of a Speech Enhancement Scheme with an Automatic Speech Recognition-Based System.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

2017
Classifier Architectures for Acoustic Scenes and Events: Implications for DNNs, TDNNs, and Perceptual Features from DCASE 2016.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Multi-Channel Speech Enhancement and Amplitude Modulation Analysis for Noise Robust Automatic Speech Recognition.
Comput. Speech Lang., 2017

2016
Integration of Optimized Modulation Filter Sets Into Deep Neural Networks for Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Probabilistic Spatial Filter Estimation for Signal Enhancement in Multi-Channel Automatic Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Acoustic Scene Classification using Time-Delay Neural Networks and Amplitude Modulation Filter Bank Features.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

2015
An Auditory Inspired Amplitude Modulation Filter Bank for Robust Feature Extraction in Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Front-end technologies for robust ASR in reverberant environments - spectral enhancement-based dereverberation and auditory modulation filterbank features.
EURASIP J. Adv. Signal Process., 2015

A CHiME-3 challenge system: Long-term acoustic features for noise robust automatic speech recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Should deep neural nets have ears? the role of auditory features in deep learning approaches.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013
On the use of spectro-temporal features for the IEEE AASP challenge 'detection and classification of acoustic scenes and events'.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Analysis of Trabecular Bone Microstructure Using Contour Tree Connectivity.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2013, 2013

2012
Amplitude Modulation Filters as Feature Sets for Robust ASR: Constant Absolute or Relative Bandwidth?
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Multimodal Human-Machine Interaction for Service Robots in Home-Care Environments.
Proceedings of the 1st Workshop on Speech and Multimodal Interaction in Assistive Environments, 2012

2011
Amplitude modulation spectrogram based features for robust speech recognition in noisy and reverberant environments.
Proceedings of the IEEE International Conference on Acoustics, 2011


  Loading...