We stand with Ukraine

We stand with Ukraine

Yatharth Saraf

According to our database¹, Yatharth Saraf authored at least 28 papers between 2005 and 2022.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2022

Pushing the performances of ASR models on English and Spanish accents.

[BibT_eX]

[DOI]

,

Morgane Rivière

,

,

,

CoRR, 2022

Improving Data Driven Inverse Text Normalization using Data Augmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2022

Scaling ASR Improves Zero and Few Shot Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Christian Fuegen

,

,

,

Abdelrahman Mohamed

Proceedings of the Interspeech 2022, 2022

XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale.

[BibT_eX]

[DOI]

,

,

,

Kushal Lakhotia

,

,

,

,

Patrick von Platen

,

,

,

,

,

Proceedings of the Interspeech 2022, 2022

Improved Language Identification Through Cross-Lingual Self-Supervised Learning.

[BibT_eX]

[DOI]

,

Diptanu Gon Choudhury

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Conformer-Based Self-Supervised Learning For Non-Speech Audio Tasks.

[BibT_eX]

[DOI]

Sangeeta Srivastava

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions.

[BibT_eX]

[DOI]

,

Michael Picheny

,

,

,

,

,

,

Andres Alvarado

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2021

Improved Language Identification Through Cross-Lingual Self-Supervised Learning.

[BibT_eX]

[DOI]

,

Diptanu Gon Choudhury

,

,

,

,

,

,

CoRR, 2021

Benchmarking LF-MMI, CTC And RNN-T Criteria For Streaming ASR.

[BibT_eX]

[DOI]

,

,

,

,

,

Pradyot Prakash

,

,

,

,

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Dual Application of Speech Enhancement for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Ashutosh Pandey

,

,

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Improving RNN Transducer Based ASR with Auxiliary Tasks.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Christian Fuegen

,

,

,

Michael L. Seltzer

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A Multi-View Approach to Audio-Visual Speaker Verification.

[BibT_eX]

[DOI]

,

,

,

Lorenzo Torresani

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

On Lattice-Free Boosted MMI Training of HMM and CTC-Based Full-Context ASR Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Kaizen: Continuously Improving Teacher Using Exponential Moving Average for Semi-Supervised Speech Recognition.

[BibT_eX]

[DOI]

,

Tatiana Likhomanenko

,

,

,

Ronan Collobert

,

,

,

Abdelrahman Mohamed

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Contextual RNN-T For Open Domain ASR.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2020

Multilingual Graphemic Hybrid ASR with Massive Data Augmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Interspeech 2020, 2020

Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR.

[BibT_eX]

[DOI]

,

,

,

,

Ross B. Girshick

,

Vitaliy Liptchinsky

,

Christian Fuegen

,

,

,

Abdelrahman Mohamed

Proceedings of the Interspeech 2020, 2020

Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model.

[BibT_eX]

[DOI]

,

,

,

Gabriel Synnaeve

,

,

Proceedings of the Interspeech 2020, 2020

Contextual RNN-T for Open Domain ASR.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Interspeech 2020, 2020

Training ASR Models By Generation of Contextual Information.

[BibT_eX]

[DOI]

,

,

,

,

,

Ross B. Girshick

,

,

,

,

,

Abdelrahman Mohamed

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Relevance Ranking for Real-Time Tweet Search.

[BibT_eX]

[DOI]

,

,

,

Juan Caicedo Carvajal

,

,

Bhargav Mangipudi

,

,

Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

2019

Multilingual ASR with Massive Data Augmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2019

2008

Computing the curve-skeletons of images.

[BibT_eX]

[DOI]

,

Raman Balasubramanian

,

Krishnan Swaminathan

Int. J. Comput. Math., 2008

2005

A classical approach for thinning of binary images using divergence of the potential field.

[BibT_eX]

[DOI]

,

Raman Balasubramanian

,

Krishnan Swaminathan

Int. J. Comput. Math., 2005

Loading...