Aparna Khare

Shalini Ghosh

CoRR, 2024

Converging Vulnerability Insights: Unifying Vulnerability Intelligence For Enhanced Application Security With Collaboration.

[BibT_eX]

[DOI]

Proceedings of the ITU Kaleidoscope 2024: Innovation and Digital Transformation for a Sustainable World, 2024

Turn-Taking and Backchannel Prediction with Acoustic and Large Language Model Fusion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Shalini Ghosh

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023

Cross-Utterance ASR Rescoring with Graph-Based Label Propagation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Two-Pass Endpoint Detection for Speech Recognition.

[BibT_eX]

[DOI]

Roland Maas

Ariya Rastrow

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Guided Contrastive Self-Supervised Pre-Training for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

ASR-Aware End-to-End Neural Diarization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Self-Supervised Learning with Cross-Modal Transformers for Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Audiovisual Highlight Detection in Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Multiresolution and Multimodal Speech Recognition with Transformers.

[BibT_eX]

[DOI]

Georgios Paraskevopoulos

CoRR, 2020

Multi-channel Acoustic Modeling using Mixed Bitrate OPUS Compression.

[BibT_eX]

[DOI]

Minhua Wu

CoRR, 2020

Multi-Modal Embeddings Using Multi-Task Learning for Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fully Learnable Front-End for Multi-Channel Acoustic Modeling Using Semi-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Multimodal and Multiresolution Speech Recognition with Transformers.

[BibT_eX]

[DOI]

Georgios Paraskevopoulos