Kshitiz Kumar

According to our database1, Kshitiz Kumar authored at least 35 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
3D YOLO-SM: End-to-End Approach for Real-time Traffic Light Detection and Recognition in Complex Scenarios.
Proceedings of the 100th IEEE Vehicular Technology Conference, 2024

2023
Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss.
CoRR, 2023

2022
Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, and Pretraining: an Ablation Study.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Sequence-Level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Multi-Dialect Speech Recognition in English Using Attention on Ensemble of Experts.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Fast and Slow Acoustic Model.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Bandpass Noise Generation and Augmentation for Unified ASR.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

1-D Row-Convolution LSTM: Fast Streaming ASR at Accuracy Parity with LC-BLSTM.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Transfer Learning Approaches for Streaming End-to-End Speech Recognition System.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
Static and Dynamic State Predictions for Acoustic Model Combination.
Proceedings of the IEEE International Conference on Acoustics, 2019

Word Characters and Phone Pronunciation Embedding for ASR Confidence Classifier.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Speaker Adaptation for End-to-End CTC Models.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

A Comparative Study of Spatial Speech Separation Techniques to Improve Speech Recognition.
Proceedings of the Advances in Neural Networks - ISNN 2018, 2018

2017
Extended low-rank plus diagonal adaptation for deep and recurrent neural networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Challenges in and Solutions to Deep Learning Network Acoustic Modeling in Speech Recognition Products at Microsoft.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Investigations on speaker adaptation of LSTM RNN models for speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Non-negative intermediate-layer DNN adaptation for a 10-KB speaker adaptation profile.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Intermediate-layer DNN adaptation for offline and session-based iterative speaker adaptation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Delta-melspectra features for noise robustness to DNN-based ASR systems.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Confidence-features and confidence-scores for ASR applications in arbitration and DNN speaker adaptation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
Normalization of ASR confidence classifier scores via confidence mapping.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013
Predicting speech recognition confidence using deep learning with word identity and score features.
Proceedings of the IEEE International Conference on Acoustics, 2013

2011
A Spectro-Temporal Framework for Compensation of Reverberation for Speech Recognition.
PhD thesis, 2011

Gammatone sub-band magnitude-domain dereverberation for ASR.
Proceedings of the IEEE International Conference on Acoustics, 2011

An iterative least-squares technique for dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2011

Delta-spectral cepstral coefficients for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Binaural sound source separation motivated by auditory processing.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Maximum-likelihood-based cepstral inverse filtering for blind speech dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Robust audio-visual speech synchrony detection by generalized bimodal linear prediction.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Audio-visual speech synchronization detection using a bimodal linear prediction model.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

Robust speech recognition using a Small Power Boosting algorithm.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Environment-invariant compensation for reverberation using linear post-filtering for minimum distortion.
Proceedings of the IEEE International Conference on Acoustics, 2008

Noise robust speaker identification using Bhattacharyya distance in adapted Gaussian models space.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007
Profile View Lip Reading.
Proceedings of the IEEE International Conference on Acoustics, 2007


  Loading...