Simran Khanuja

According to our database1, Simran Khanuja authored at least 19 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
An image speaks a thousand words, but can everyone listen? On translating images for cultural relevance.
CoRR, 2024

What Is Missing in Multilingual Visual Reasoning and How to Fix It.
CoRR, 2024

DeMuX: Data-efficient Multilingual Learning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
GlobalBench: A Benchmark for Global Progress in Natural Language Processing.
CoRR, 2023

GlobalBench: A Benchmark for Global Progress in Natural Language Processing.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Evaluating the Diversity, Equity, and Inclusion of NLP Technology: A Case Study for Indian Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Multi-lingual and Multi-cultural Figurative Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Evaluating Inclusivity, Equity, and Accessibility of NLP Technology: A Case Study for Indian Languages.
CoRR, 2022

mSLAM: Massively multilingual joint pre-training for speech and text.
CoRR, 2022

FLEURS: FEW-Shot Learning Evaluation of Universal Representations of Speech.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

XTREME-S: Evaluating Cross-lingual Speech Representations.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
MergeDistill: Merging Pre-trained Language Models using Distillation.
CoRR, 2021

MuRIL: Multilingual Representations for Indian Languages.
CoRR, 2021

MergeDistill: Merging Language Models using Pre-trained Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Cross-lingual and Multilingual Spoken Term Detection for Low-Resource Indian Languages.
CoRR, 2020

GLUECoS: An Evaluation Benchmark for Code-Switched NLP.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

A New Dataset for Natural Language Inference from Code-mixed Conversations.
Proceedings of the The 4th Workshop on Computational Approaches to Code Switching, 2020

2019
Unsung Challenges of Building and Deploying Language Technologies for Low Resource Language Communities.
CoRR, 2019


  Loading...