Vishrav Chaudhary

According to our database1, Vishrav Chaudhary authored at least 68 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization.
CoRR, 2024

Scaling Laws for Multilingual Language Models.
CoRR, 2024

Scaling Optimal LR Across Token Horizon.
CoRR, 2024

GRIN: GRadient-INformed MoE.
CoRR, 2024

Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads.
CoRR, 2024

The Hitchhiker's Guide to Human Alignment with *PO.
CoRR, 2024

sPhinX: Sample Efficient Multilingual Instruction Fine-Tuning Through N-shot Guided Prompting.
CoRR, 2024

Beyond Metrics: Evaluating LLMs' Effectiveness in Culturally Nuanced, Low-Resource Real-World Scenarios.
CoRR, 2024

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone.
CoRR, 2024

ODIN: A Single Model for 2D and 3D Perception.
CoRR, 2024

ODIN: A Single Model for 2D and 3D Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Glitch in the Matrix? Locating and Detecting Language Model Grounding with Fakepedia.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Holistic Evaluation of Language Models.
Trans. Mach. Learn. Res., 2023

Implicit Chain of Thought Reasoning via Knowledge Distillation.
CoRR, 2023

DUBLIN - Document Understanding By Language-Image Network.
CoRR, 2023

Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation.
CoRR, 2023

Language Is Not All You Need: Aligning Perception with Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Magneto: A Foundation Transformer.
Proceedings of the International Conference on Machine Learning, 2023

DUBLIN: Visual Document Understanding By Language-Image Network.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

Performance and Risk Trade-offs for Multi-word Text Prediction at Scale.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Language Model Decoding as Likelihood-Utility Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Everything you need to know about Multilingual LLMs: Towards fair, performant and reliable models for languages of the world.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2023

A Length-Extrapolatable Transformer.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
The Flores-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation.
Trans. Assoc. Comput. Linguistics, 2022

AmericasNLI: Machine translation and natural language inference systems for Indigenous languages of the Americas.
Frontiers Artif. Intell., 2022

TorchScale: Transformers at Scale.
CoRR, 2022

Language Model Decoding as Likelihood-Utility Alignment.
CoRR, 2022

Foundation Transformers.
CoRR, 2022

MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Data Selection Curriculum for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Few-shot Learning with Multilingual Generative Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training?
Proceedings of the 15th biennial conference of the Association for Machine Translation in the Americas (Volume 1: Research Track), 2022

Alternative Input Signals Ease Transfer in Multilingual Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

OCR Improves Machine Translation for Low-Resource Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Beyond English-Centric Multilingual Machine Translation.
J. Mach. Learn. Res., 2021

Few-shot Learning with Multilingual Language Models.
CoRR, 2021

LAWDR: Language-Agnostic Weighted Document Representations from Pre-trained Models.
CoRR, 2021

AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages.
CoRR, 2021

Findings of the WMT 2021 Shared Task on Large-Scale Multilingual Machine Translation.
Proceedings of the Sixth Conference on Machine Translation, 2021

Findings of the WMT 2021 Shared Task on Quality Estimation.
Proceedings of the Sixth Conference on Machine Translation, 2021


Self-training Improves Pre-training for Natural Language Understanding.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Classification-based Quality Estimation: Small and Efficient Models for Real-world Applications.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Quality Estimation without Human-labeled Data.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Multilingual Translation from Denoising Pre-Training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Unsupervised Quality Estimation for Neural Machine Translation.
Trans. Assoc. Comput. Linguistics, 2020

Beyond English-Centric Multilingual Machine Translation.
CoRR, 2020

MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset.
CoRR, 2020

Multilingual Translation with Extensible Multilingual Pretraining and Finetuning.
CoRR, 2020

Findings of the WMT 2020 Shared Task on Machine Translation Robustness.
Proceedings of the Fifth Conference on Machine Translation, 2020

Findings of the WMT 2020 Shared Task on Quality Estimation.
Proceedings of the Fifth Conference on Machine Translation, 2020

Findings of the WMT 2020 Shared Task on Parallel Corpus Filtering and Alignment.
Proceedings of the Fifth Conference on Machine Translation, 2020

BERGAMOT-LATTE Submissions for the WMT20 Quality Estimation Shared Task.
Proceedings of the Fifth Conference on Machine Translation, 2020

CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

An Exploratory Study on Multilingual Quality Estimation.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

CCAligned: A Massive Collection of Cross-Lingual Web-Document Pairs.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

A Survey of Qualitative Error Analysis for Neural Machine Translation Systems.
Proceedings of the 14th Conference of the Association for Machine Translation in the Americas, 2020

Unsupervised Cross-lingual Representation Learning at Scale.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
A Massive Collection of Cross-Lingual Web-Document Pairs.
CoRR, 2019

Two New Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English.
CoRR, 2019

Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low-Resource Conditions.
Proceedings of the Fourth Conference on Machine Translation, 2019

Low-Resource Corpus Filtering Using Multilingual Sentence Embeddings.
Proceedings of the Fourth Conference on Machine Translation, 2019

The FLORES Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Facebook AI's WAT19 Myanmar-English Translation Task Submission.
Proceedings of the 6th Workshop on Asian Translation, 2019


  Loading...