Shruti Palaskar

Orcid: 0000-0001-8637-1897

According to our database1, Shruti Palaskar authored at least 21 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection.
CoRR, 2024

2022
End-to-End Speech Summarization Using Restricted Self-Attention.
Proceedings of the IEEE International Conference on Acoustics, 2022

On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Speech Summarization using Restricted Self-Attention.
CoRR, 2021

Multimodal Speech Summarization Through Semantic Concept Learning.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

How2Sign: A Large-Scale Multimodal Dataset for Continuous American Sign Language.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Speech Technology for Unwritten Languages.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Grounded Sequence to Sequence Transduction.
IEEE J. Sel. Top. Signal Process., 2020

Transfer learning for multimodal dialog.
Comput. Speech Lang., 2020

How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language.
CoRR, 2020

ASR Error Correction and Domain Adaptation Using Machine Translation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Learned in Speech Recognition: Contextual Acoustic Word Embeddings.
Proceedings of the IEEE International Conference on Acoustics, 2019

Learning from Multiview Correlations in Open-domain Videos.
Proceedings of the IEEE International Conference on Acoustics, 2019

Multimodal Grounding for Sequence-to-sequence Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Multimodal Abstractive Summarization for How2 Videos.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
How2: A Large-scale Dataset for Multimodal Language Understanding.
CoRR, 2018


Acoustic-to-Word Recognition with Sequence-to-Sequence Models.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

End-to-end Multimodal Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Combining LSTM and Latent Topic Modeling for Mortality Prediction.
CoRR, 2017


  Loading...