Dmitriy Serdyuk

According to our database1, Dmitriy Serdyuk authored at least 25 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
USM RNN-T model weights binarization.
CoRR, 2024

Conformer is All You Need for Visual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Audio-visual fine-tuning of audio-only ASR models.
CoRR, 2023

Conformers are All You Need for Visual Speech Recogntion.
CoRR, 2023

2022
On Robustness to Missing Video for Audiovisual Speech Recognition.
Trans. Mach. Learn. Res., 2022

Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition.
CoRR, 2022

Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Muti-Person Video.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Audio-Visual Speech Recognition is Worth 32×32×8 Voxels.
CoRR, 2021

Accounting for Variance in Machine Learning Benchmarks.
CoRR, 2021

Audio-Visual Speech Recognition is Worth $32\times 32\times 8$ Voxels.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2018
Fortified Networks: Improving the Robustness of Deep Networks by Modeling the Manifold of Hidden Representations.
CoRR, 2018

Twin Regularization for Online Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Deep Complex Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Twin Networks: Matching the Future for Sequence Generation.
Proceedings of the 6th International Conference on Learning Representations, 2018

Towards End-to-end Spoken Language Understanding.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised adversarial domain adaptation for acoustic scene classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017
Twin Networks: Using the Future as a Regularizer.
CoRR, 2017

Deep Complex Networks.
CoRR, 2017

2016
Invariant Representations for Noisy Speech Recognition.
CoRR, 2016

Theano: A Python framework for fast computation of mathematical expressions.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2016

End-to-end attention-based large vocabulary speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Blocks and Fuel: Frameworks for deep learning.
CoRR, 2015

Task Loss Estimation for Sequence Prediction.
CoRR, 2015

Attention-Based Models for Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015


  Loading...