Yatharth Saraf

According to our database1, Yatharth Saraf authored at least 28 papers between 2005 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
Pushing the performances of ASR models on English and Spanish accents.
CoRR, 2022

Improving Data Driven Inverse Text Normalization using Data Augmentation.
CoRR, 2022

Scaling ASR Improves Zero and Few Shot Learning.
Proceedings of the Interspeech 2022, 2022

XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale.
Proceedings of the Interspeech 2022, 2022

Improved Language Identification Through Cross-Lingual Self-Supervised Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

Conformer-Based Self-Supervised Learning For Non-Speech Audio Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2022

Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models.
CoRR, 2021

Improved Language Identification Through Cross-Lingual Self-Supervised Learning.
CoRR, 2021

Benchmarking LF-MMI, CTC And RNN-T Criteria For Streaming ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Dual Application of Speech Enhancement for Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Improving RNN Transducer Based ASR with Auxiliary Tasks.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A Multi-View Approach to Audio-Visual Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

On Lattice-Free Boosted MMI Training of HMM and CTC-Based Full-Context ASR Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Kaizen: Continuously Improving Teacher Using Exponential Moving Average for Semi-Supervised Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Contextual RNN-T For Open Domain ASR.
CoRR, 2020

Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces.
CoRR, 2020

Multilingual Graphemic Hybrid ASR with Massive Data Augmentation.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces.
Proceedings of the Interspeech 2020, 2020

Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR.
Proceedings of the Interspeech 2020, 2020

Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model.
Proceedings of the Interspeech 2020, 2020

Contextual RNN-T for Open Domain ASR.
Proceedings of the Interspeech 2020, 2020

Training ASR Models By Generation of Contextual Information.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Relevance Ranking for Real-Time Tweet Search.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

2019
Multilingual ASR with Massive Data Augmentation.
CoRR, 2019

2008
Computing the curve-skeletons of images.
Int. J. Comput. Math., 2008

2005
A classical approach for thinning of binary images using divergence of the potential field.
Int. J. Comput. Math., 2005


  Loading...