Aparna Khare

Orcid: 0000-0001-7151-3055

According to our database1, Aparna Khare authored at least 16 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition.
CoRR, 2024

Turn-Taking and Backchannel Prediction with Acoustic and Large Language Model Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2024

Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Cross-Utterance ASR Rescoring with Graph-Based Label Propagation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Two-Pass Endpoint Detection for Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Guided Contrastive Self-Supervised Pre-Training for Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

ASR-Aware End-to-End Neural Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Self-Supervised Learning with Cross-Modal Transformers for Emotion Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Audiovisual Highlight Detection in Videos.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Multiresolution and Multimodal Speech Recognition with Transformers.
CoRR, 2020

Multi-channel Acoustic Modeling using Mixed Bitrate OPUS Compression.
CoRR, 2020

Multi-Modal Embeddings Using Multi-Task Learning for Emotion Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fully Learnable Front-End for Multi-Channel Acoustic Modeling Using Semi-Supervised Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Multimodal and Multiresolution Speech Recognition with Transformers.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2016
Multi-Task Learning and Weighted Cross-Entropy for DNN-Based Keyword Spotting.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2012
Cisco's speaker segmentation and recognition system.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012


  Loading...