We stand with Ukraine

We stand with Ukraine

Dmitriy Serdyuk

According to our database¹, Dmitriy Serdyuk authored at least 25 papers between 2015 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

USM RNN-T model weights binarization.

[BibT_eX]

[DOI]

,

Dmitriy Serdyuk

,

Chengjian Zheng

CoRR, 2024

Conformer is All You Need for Visual Speech Recognition.

[BibT_eX]

[DOI]

,

,

Dmitriy Serdyuk

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Audio-visual fine-tuning of audio-only ASR models.

[BibT_eX]

[DOI]

,

Dmitriy Serdyuk

,

Ankit Parag Shah

,

,

CoRR, 2023

Conformers are All You Need for Visual Speech Recogntion.

[BibT_eX]

[DOI]

,

,

Dmitriy Serdyuk

,

Ankit Parag Shah

,

CoRR, 2023

2022

On Robustness to Missing Video for Audiovisual Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

Dmitriy Serdyuk

,

Trans. Mach. Learn. Res., 2022

Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition.

[BibT_eX]

[DOI]

Dmitriy Serdyuk

,

,

CoRR, 2022

Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Muti-Person Video.

[BibT_eX]

[DOI]

Dmitriy Serdyuk

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

Audio-Visual Speech Recognition is Worth 32×32×8 Voxels.

[BibT_eX]

[DOI]

Dmitriy Serdyuk

,

,

CoRR, 2021

Accounting for Variance in Machine Learning Benchmarks.

[BibT_eX]

[DOI]

Xavier Bouthillier

,

Pierre Delaunay

,

,

,

Brennan Nichyporuk

,

,

,

,

,

,

Samira Ebrahimi Kahou

,

Vincent Michalski

,

Dmitriy Serdyuk

,

,

,

Gaël Varoquaux

,

CoRR, 2021

Audio-Visual Speech Recognition is Worth $32\times 32\times 8$ Voxels.

[BibT_eX]

[DOI]

Dmitriy Serdyuk

,

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2018

Fortified Networks: Improving the Robustness of Deep Networks by Modeling the Manifold of Hidden Representations.

[BibT_eX]

[DOI]

,

,

,

Dmitriy Serdyuk

,

Sandeep Subramanian

,

Ioannis Mitliagkas

,

CoRR, 2018

Twin Regularization for Online Speech Recognition.

[BibT_eX]

[DOI]

Mirco Ravanelli

,

Dmitriy Serdyuk

,

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation.

[BibT_eX]

[DOI]

Konstantinos Drossos

,

Stylianos Ioannis Mimilakis

,

Dmitriy Serdyuk

,

Gerald Schuller

,

Tuomas Virtanen

,

Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Deep Complex Networks.

[BibT_eX]

[DOI]

Chiheb Trabelsi

,

,

,

Dmitriy Serdyuk

,

Sandeep Subramanian

,

João Felipe Santos

,

,

Negar Rostamzadeh

,

,

Christopher J. Pal

Proceedings of the 6th International Conference on Learning Representations, 2018

Twin Networks: Matching the Future for Sequence Generation.

[BibT_eX]

[DOI]

Dmitriy Serdyuk

,

Nan Rosemary Ke

,

Alessandro Sordoni

,

,

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Towards End-to-end Spoken Language Understanding.

[BibT_eX]

[DOI]

Dmitriy Serdyuk

,

,

Christian Fuegen

,

,

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised adversarial domain adaptation for acoustic scene classification.

[BibT_eX]

[DOI]

,

Konstantinos Drossos

,

,

Dmitriy Serdyuk

,

Tuomas Virtanen

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017

Twin Networks: Using the Future as a Regularizer.

[BibT_eX]

[DOI]

Dmitriy Serdyuk

,

Nan Rosemary Ke

,

Alessandro Sordoni

,

,

CoRR, 2017

Deep Complex Networks.

[BibT_eX]

[DOI]

Chiheb Trabelsi

,

,

Dmitriy Serdyuk

,

Sandeep Subramanian

,

João Felipe Santos

,

,

Negar Rostamzadeh

,

,

Christopher J. Pal

CoRR, 2017

2016

Invariant Representations for Noisy Speech Recognition.

[BibT_eX]

[DOI]

Dmitriy Serdyuk

,

Kartik Audhkhasi

,

Philemon Brakel

,

Bhuvana Ramabhadran

,

,

CoRR, 2016

Theano: A Python framework for fast computation of mathematical expressions.

[BibT_eX]

[DOI]

,

Guillaume Alain

,

Amjad Almahairi

,

Christof Angermüller

,

Dzmitry Bahdanau

,

,

Frédéric Bastien

,

,

Anatoly Belikov

,

Alexander Belopolsky

,

,

Arnaud Bergeron

,

,

Valentin Bisson

,

Josh Bleecher Snyder

,

Nicolas Bouchard

,

Nicolas Boulanger-Lewandowski

,

Xavier Bouthillier

,

Alexandre de Brébisson

,

Olivier Breuleux

,

Pierre Luc Carrier

,

,

,

Paul F. Christiano

,

,

Marc-Alexandre Côté

,

,

Aaron C. Courville

,

Yann N. Dauphin

,

Olivier Delalleau

,

,

Guillaume Desjardins

,

Sander Dieleman

,

,

Melanie Ducoffe

,

Vincent Dumoulin

,

Samira Ebrahimi Kahou

,

,

,

,

Mathieu Germain

,

,

Ian J. Goodfellow

,

,

Çaglar Gülçehre

,

,

Iban Harlouchet

,

Jean-Philippe Heng

,

,

,

,

Sébastien Jean

,

,

Mikhail Korobov

,

,

,

,

,

,

,

Simon Lefrançois

,

,

Nicholas Léonard

,

,

Jesse A. Livezey

,

,

,

,

Pierre-Antoine Manzagol

,

Olivier Mastropietro

,

Robert McGibbon

,

Roland Memisevic

,

Bart van Merriënboer

,

Vincent Michalski

,

,

Alberto Orlandi

,

Christopher Joseph Pal

,

,

Mohammad Pezeshki

,

,

,

Matthew Rocklin

,

,

,

,

,

François Savard

,

,

,

Gabriel Schwartz

,

Iulian Vlad Serban

,

Dmitriy Serdyuk

,

Samira Shabanian

,

,

Sigurd Spieckermann

,

S. Ramana Subramanyam

,

Jakub Sygnowski

,

Jérémie Tanguay

,

Gijs van Tulder

,

Joseph P. Turian

,

Sebastian Urban

,

,

Francesco Visin

,

,

David Warde-Farley

,

,

Matthew Willson

,

,

,

,

,

CoRR, 2016

End-to-end attention-based large vocabulary speech recognition.

[BibT_eX]

[DOI]

Dzmitry Bahdanau

,

,

Dmitriy Serdyuk

,

Philemon Brakel

,

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Blocks and Fuel: Frameworks for deep learning.

[BibT_eX]

[DOI]

Bart van Merriënboer

,

Dzmitry Bahdanau

,

Vincent Dumoulin

,

Dmitriy Serdyuk

,

David Warde-Farley

,

,

CoRR, 2015

Task Loss Estimation for Sequence Prediction.

[BibT_eX]

[DOI]

Dzmitry Bahdanau

,

Dmitriy Serdyuk

,

Philemon Brakel

,

Nan Rosemary Ke

,

,

Aaron C. Courville

,

CoRR, 2015

Attention-Based Models for Speech Recognition.

[BibT_eX]

[DOI]

,

Dzmitry Bahdanau

,

Dmitriy Serdyuk

,

,

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Loading...