Zoltán Tüske

According to our database1, Zoltán Tüske authored at least 65 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Alternating Weak Triphone/BPE Alignment Supervision from Hybrid Model Improves End-to-End ASR.
CoRR, 2024


2023
Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR.
CoRR, 2023

Speech Translation with Style: AppTek's Submissions to the IWSLT Subtitling and Formality Tracks in 2023.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

2022
Improving End-to-end Models for Set Prediction in Spoken Language Understanding.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
On the Limit of English Conversational Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Improving Customization of Neural Transducers by Mitigating Acoustic Mismatch of Synthesized Audio.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Integrating Dialog History into End-to-End Spoken Language Understanding Systems.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

4-Bit Quantization of LSTM-Based Speech Recognition Models.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Reducing Exposure Bias in Training Recurrent Neural Network Transducers.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Advancing RNN Transducer Technology for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

End-to-End Spoken Language Understanding Using Transformer Networks and Self-Supervised Pre-Trained Features.
Proceedings of the IEEE International Conference on Acoustics, 2021

RNN Transducer Models for Spoken Language Understanding.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Discriminative feature modeling for statistical speech recognition.
PhD thesis, 2020

Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard-300.
CoRR, 2020

Single Headed Attention Based Sequence-to-Sequence Model for State-of-the-Art Results on Switchboard.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

End-to-End Spoken Language Understanding Without Full Transcripts.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Alignment-Length Synchronous Decoding for RNN Transducer.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Advancing Sequence-to-Sequence Based Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Detection and Recovery of OOVs for Improved English Broadcast News Captioning.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Challenging the Boundaries of Speech Recognition: The MALACH Corpus.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Forget a Bit to Learn Better: Soft Forgetting for CTC-Based Automatic Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

English Broadcast News Speech Recognition by Humans and Machines.
Proceedings of the IEEE International Conference on Acoustics, 2019

Sequence Noise Injected Training for End-to-end Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Simplified LSTMS for Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Semi-Supervised Training and Data Augmentation for Adaptation of Automatic Broadcast News Captioning Systems.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Evolutionary Stochastic Gradient Descent for Optimization of Deep Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Investigation on LSTM Recurrent N-gram Language Models for Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Acoustic Modeling of Speech Waveform Based on Multi-Resolution, Neural Network Signal Processing.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
The 2016 RWTH Keyword Search System for Low-Resource Languages.
Proceedings of the Speech and Computer - 19th International Conference, 2017

Parallel Neural Network Features for Improved Tandem Acoustic Modeling.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016
Automatic Speech Recognition Based on Neural Networks.
Proceedings of the Speech and Computer - 18th International Conference, 2016

The RWTH Aachen LVCSR system for IWSLT-2016 German Skype conversation recognition task.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

LSTM, GRU, Highway and a Bit of Attention: An Empirical Overview for Language Modeling in Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Investigation on log-linear interpolation of multi-domain neural network language model.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Improvements in RWTH LVCSR evaluation systems for Polish, Portuguese, English, urdu, and Arabic.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Multilingual features based keyword search for very low-resource languages.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Convolutional neural networks for acoustic modeling of raw time signal in LVCSR.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Integrating Gaussian mixtures into deep neural networks: Softmax layer with hidden variables.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speaker adaptive joint training of Gaussian mixture models and bottleneck features.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Multilingual representations for low resource speech recognition and keyword search.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Acoustic modeling with deep neural networks using raw time signal for LVCSR.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Data augmentation, feature combination, and multilingual neural networks to improve ASR and KWS performance for low-resource languages.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Lattice decoding and rescoring with long-Span neural network language models.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

RWTH LVCSR systems for quaero and EU-bridge: German, Polish, Spanish and Portuguese.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

The RWTH English lecture recognition system.
Proceedings of the IEEE International Conference on Acoustics, 2014

Multilingual MRASTA features for low-resource keyword search and speech recognition systems.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
The RWTH Aachen German and English LVCSR systems for IWSLT-2013.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

Multilingual hierarchical MRASTA features for ASR.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Development of the RWTH transcription system for slovenian.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Deep hierarchical bottleneck MRASTA features for LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2013

Investigation on cross- and multilingual MLP features under matched and mismatched acoustical conditions.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Phase difference of filter-stable part-tones as acoustic feature.
Proceedings of the IEEE Statistical Signal Processing Workshop, 2012

Context-Dependent MLPs for LVCSR: TANDEM, Hybrid or Both?
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Non-stationary signal processing and its application in speech recognition.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Posterior-Scaled MPE: Novel Discriminative Training Criteria.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Comparison and combination of different CRBE based MLP features for LVCSR.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
A Study on Speaker Normalized MLP Features in LVCSR.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Non-stationary feature extraction for automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Improved Recognition of Spontaneous Hungarian Speech - Morphological and Acoustic Modeling Techniques for a Less Resourced Task.
IEEE Trans. Speech Audio Process., 2010

2009
Investigation of morph-based speech recognition improvements across speech genres.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2007
Towards Automatic Transcription of Large Spoken Archives in Agglutinating Languages - Hungarian ASR for the MALACH Project.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

A morpho-graphemic approach for the recognition of spontaneous speech in agglutinative languages - like Hungarian.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2005
Robust voice activity detection based on the entropy of noise-suppressed spectrum.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Evaluation and optimization of noise robust front-end technologies for the automatic recognition of Hungarian telephone speech.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005


  Loading...