Pranay Dighe

According to our database1, Pranay Dighe authored at least 28 papers between 2012 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Apple Intelligence Foundation Language Models.
CoRR, 2024

Modality Drop-Out for Multimodal Device Directed Speech Detection Using Verbal and Non-Verbal Features.
Proceedings of the IEEE International Conference on Acoustics, 2024

Leveraging Large Language Models for Exploiting ASR Uncertainty.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal Features.
CoRR, 2023

Less Is More: A Unified Architecture for Device-Directed Speech Detection with Multiple Invocation Types.
Proceedings of the IEEE International Conference on Acoustics, 2023

Audio-to-Intent Using Acoustic-Textual Subword Representations from End-to-End ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Streaming on-Device Detection of Device Directed Speech from Voice and Touch-Based Invocation.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Knowledge Transfer for Efficient on-Device False Trigger Mitigation.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
On quantifying the quality of acoustic models in hybrid DNN-HMM ASR.
Speech Commun., 2020

Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Lattice-Based Improvements for Voice Triggering Using Graph Neural Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Low-rank and sparse subspace modeling of speech for DNN based acoustic modeling.
Speech Commun., 2019

Analyzing Uncertainties in Speech Recognition Using Dropout.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Far-Field ASR Using Low-Rank and Sparse Soft Targets from Parallel Data.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

2017
Exploiting Eigenposteriors for Semi-Supervised Training of DNN Acoustic Models with Sequence Discrimination.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Low-rank and sparse soft targets to learn better DNN acoustic models.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Sparse modeling of neural network posterior probabilities for exemplar-based speech recognition.
Speech Commun., 2016

Low-Rank Representation of Nearest Neighbor Posterior Probabilities to Enhance DNN Based Acoustic Modeling.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Exploiting low-dimensional structures to enhance DNN based acoustic modeling in speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Sparse modeling of posterior exemplars for keyword detection.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
Modeling Overlapping Speech using Vector Taylor Series.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Detecting and labeling speakers on overlapping speech using vector taylor series.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013
Swara Histogram Based Structural Analysis And Identification Of Indian Classical Ragas.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Scale independent raga identification using chromagram patterns and swara based features.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

2012
Language identification using spectro-temporal patch features.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Audio event detection from acoustic unit occurrence patterns.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012


  Loading...