Kunal Dhawan

Orcid: 0000-0002-5276-2475

According to our database¹, Kunal Dhawan authored at least 21 papers between 2018 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.

[BibT_eX]

[DOI]

Fabian Ritter Gutierrez

CoRR, 2024

VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning.

[BibT_eX]

[DOI]

CoRR, 2024

META-CAT: Speaker-Informed Speech Embeddings via Meta Information Concatenation for Multi-talker ASR.

[BibT_eX]

[DOI]

CoRR, 2024

Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens.

[BibT_eX]

[DOI]

CoRR, 2024

NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks.

[BibT_eX]

[DOI]

CoRR, 2024

Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations.

[BibT_eX]

[DOI]

CoRR, 2024

Less is More: Accurate Speech Recognition & Translation without Web-Scale Data.

[BibT_eX]

[DOI]

CoRR, 2024

Large Language Model Based Generative Error Correction: A Challenge and Baselines For Speech Recognition, Speaker Tagging, and Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System.

[BibT_eX]

[DOI]

CoRR, 2023

Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Towards training Bilingual and Code-Switched Speech Recognition models from Monolingual data sources.

[BibT_eX]

[DOI]

Kunal Dhawan

Dima Rekesh

Boris Ginsburg

CoRR, 2023

2021

Phonetic Word Embeddings.

[BibT_eX]

[DOI]

Rahul Sharma

Kunal Dhawan

Balakrishna Pailla

CoRR, 2021

2020

Novel textual features for language modeling of intra-sentential code-switching data.

[BibT_eX]

[DOI]

Sreeram Ganji

Kunal Dhawan

Rohit Sinha

Comput. Speech Lang., 2020

Joint Language Identification of Code-Switching Speech using Attention-based E2E Network.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Signal Processing and Communications, 2020

Investigating Target Set Reduction for End-to-End Speech Recognition of Hindi-English Code-Switching Data.

[BibT_eX]

[DOI]

Proceedings of the 2020 National Conference on Communications, 2020

2019

IITG-HingCoS corpus: A Hinglish code-switching database for automatic speech recognition.

[BibT_eX]

[DOI]

Ganji Sreeram

Kunal Dhawan

Rohit Sinha

Speech Commun., 2019

Towards Adapting NMF Dictionaries Using Total Variability Modeling for Noise-Robust Acoustic Features.

[BibT_eX]

[DOI]

Kunal Dhawan

Colin Vaz

Ruchir Travadi

Shrikanth S. Narayanan

CoRR, 2019

2018

Hindi-English Code-Switching Speech Corpus.

[BibT_eX]

[DOI]

Ganji Sreeram

Kunal Dhawan

Rohit Sinha

CoRR, 2018

Kunal Dhawan

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...