Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

AugSumm: Towards Generalizable Speech Summarization Using Synthetic Labels from Large Language Models.

[BibT_eX]

[DOI]

Jee-Weon Jung

Proceedings of the IEEE International Conference on Acoustics, 2024

Dynamic-Superb: Towards a Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark For Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

On the Evaluation of Speech Foundation Models for Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model.

[BibT_eX]

[DOI]

CoRR, 2023

UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network.

[BibT_eX]

[DOI]

CoRR, 2023

Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech.

[BibT_eX]

[DOI]

CoRR, 2023

Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech.

[BibT_eX]

[DOI]

CoRR, 2023

BASS: Block-wise Adaptation for Speech Summarization.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Speech Summarization of Long Spoken Document: Improving Memory Efficiency of Speech/Text Encoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers.

[BibT_eX]

[DOI]

Roshan Sharma

Bhiksha Raj

CoRR, 2022

Unifying the Discrete and Continuous Emotion labels for Speech Emotion Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction.

[BibT_eX]

[DOI]

CoRR, 2022

Cross-utterance context for multimodal video transcription.

[BibT_eX]

[DOI]

Roshan Sharma

Bhiksha Raj

Proceedings of the 56th Asilomar Conference on Signals, Systems, and Computers, ACSSC 2022, Pacific Grove, CA, USA, October 31, 2022

2020

A Summary of the First Workshop on Language Technology for Language Documentation and Revitalization.

[BibT_eX]

[DOI]

Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Roshan S. Sharma

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...