Sunit Sivasankaran

According to our database1, Sunit Sivasankaran authored at least 22 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Target word activity detector: An approach to obtain ASR word boundaries without lexicon.
CoRR, 2024

WavLLM: Towards Robust and Adaptive Speech Large Language Model.
CoRR, 2024

NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription.
CoRR, 2024

WavLLM: Towards Robust and Adaptive Speech Large Language Model.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning.
CoRR, 2023

Simulating Realistic Speech Overlaps Improves Multi-Talker ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

Speech Separation with Large-Scale Self-Supervised Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

2021
Explaining Deep Learning Models for Speech Enhancement.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Localization guided speech separation. (Séparation de la parole guidée par la localisation).
PhD thesis, 2020

Asteroid: The PyTorch-Based Audio Source Separation Toolkit for Researchers.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

SLOGD: Speaker Location Guided Deflation Approach to Speech Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
VoiceHome-2, an extended corpus for multichannel speech processing in real homes.
Speech Commun., 2019

The Speed Submission to DIHARD II: Contributions & Lessons Learned.
CoRR, 2019

2018
Keyword Based Speaker Localization: Localizing a Target Speaker in a Multi-speaker Environment.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Phone Merging For Code-Switched Speech Recognition.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

2017
A combined evaluation of established and new approaches for speech recognition in varied reverberation conditions.
Comput. Speech Lang., 2017

Discriminative importance weighting of augmented training data for acoustic model training.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

An extended experimental investigation of DNN uncertainty propagation for noise robust ASR.
Proceedings of the Hands-free Speech Communications and Microphone Arrays, 2017

2016
A French Corpus for Distant-Microphone Speech Processing in Real Homes.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Robust ASR using neural network based speech enhancement and feature simulation.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2013
Statistics based features for unvoiced sound classification.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013


  Loading...