Leshem Choshen

According to our database1, Leshem Choshen authored at least 81 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Sloth: scaling laws for LLM skills to predict multi-benchmark performance across families.
CoRR, 2024

Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora.
CoRR, 2024

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation.
CoRR, 2024

ZipNN: Lossless Compression for AI Models.
CoRR, 2024

Model merging with SVD to tie the Knots.
CoRR, 2024

A Hitchhiker's Guide to Scaling Law Estimation.
CoRR, 2024

LiveXiv - A Multi-Modal Live Benchmark Based on Arxiv Papers Content.
CoRR, 2024

Unforgettable Generalization in Language Models.
CoRR, 2024

The Future of Open Human Feedback.
CoRR, 2024

Can You Trust Your Metric? Automatic Concatenation-Based Tests for Metric Validity.
CoRR, 2024

Beneath the Surface of Consistency: Exploring Cross-lingual Knowledge Representation Sharing in LLMs.
CoRR, 2024

The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community.
CoRR, 2024

A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning.
CoRR, 2024

Data Contamination Report from the 2024 CONDA Shared Task.
CoRR, 2024

Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation.
CoRR, 2024

Learning from Naturally Occurring Feedback.
CoRR, 2024

Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead.
CoRR, 2024

Efficient multi-prompt evaluation of LLMs.
CoRR, 2024

Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models.
CoRR, 2024

Holmes: Benchmark the Linguistic Competence of Language Models.
CoRR, 2024

Lossless and Near-Lossless Compression for Foundation Models.
CoRR, 2024

[Call for Papers] The 2nd BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus.
CoRR, 2024

Genie: Achieving Human Parity in Content-Grounded Datasets Generation.
CoRR, 2024

Efficient Benchmarking (of Language Models).
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations, 2024

Asymmetry in Low-Rank Adapters of Foundation Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

tinyBenchmarks: evaluating LLMs with fewer examples.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Achieving Human Parity in Content-Grounded Datasets Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Fuse to Forget: Bias Reduction and Selective Memorization through Model Fusion.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

NumeroLogic: Number Encoding for Enhanced LLMs' Numerical Reasoning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Jump to Conclusions: Short-Cutting Transformers with Linear Transformations.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Label-Efficient Model Selection for Text Generation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization.
CoRR, 2023

Resolving Interference When Merging Models.
CoRR, 2023

Call for Papers - The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus.
CoRR, 2023

TIES-Merging: Resolving Interference When Merging Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Knowledge is a Region in Weight Space for Fine-tuned Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Human Learning by Model Feedback: The Dynamics of Iterative Prompting with Midjourney.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Where to start? Analyzing the potential value of intermediate models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MuLER: Detailed and Scalable Reference-based Evaluation.
Proceedings of the 27th Conference on Computational Natural Language Learning, 2023

DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning.
CoRR, 2022

Label Sleuth: From Unlabeled Text to a Classifier in a Few Hours.
CoRR, 2022

Some Grammatical Errors are Frequent, Others are Important.
CoRR, 2022

Fusing finetuned models for better pretraining.
CoRR, 2022

Semantics-aware Attention Improves Neural Machine Translation.
Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, 2022

GrASP: A Library for Extracting and Exploring Human-Interpretable Textual Patterns.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Label Sleuth: From Unlabeled Text to a Classifier in a Few Hours.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

PreQuEL: Quality Estimation of Machine Translation Outputs in Advance.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

On Neurons Invariant to Sentence Structural Changes in Neural Machine Translation.
Proceedings of the 26th Conference on Computational Natural Language Learning, 2022

Enhancing the Transformer Decoder with Transition-based Syntax.
Proceedings of the 26th Conference on Computational Natural Language Learning, 2022

Reinforcement Learning with Large Action Spaces for Neural Machine Translation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Cluster & Tune: Boost Cold Start Performance in Text Classification.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

The Grammar-Learning Trajectories of Neural Language Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
An autonomous debating system.
Nat., 2021

ComSum: Commit Messages Summarization and Meaning Preservation.
CoRR, 2021

Part of Speech and Universal Dependency effects on English Arabic Machine Translation.
CoRR, 2021

Q<sup>2</sup>: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering.
CoRR, 2021

SERRANT: a syntactic classifier for English Grammatical Error Types.
CoRR, 2021

Transition based Graph Decoder for Neural Machine Translation.
CoRR, 2021

Mediators in Determining what Processing BERT Performs First.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

$Q^2$: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Let's Agree to Agree: Neural Networks Share Classification Order on Real Datasets.
Proceedings of the 37th International Conference on Machine Learning, 2020

On the Weaknesses of Reinforcement Learning for Neural Machine Translation.
Proceedings of the 8th International Conference on Learning Representations, 2020

Unsupervised Expressive Rules Provide Explainability and Assist Human Experts Grasping New Domains.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Active Learning for BERT: An Empirical Study.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Classifying Syntactic Errors in Learner Language.
Proceedings of the 24th Conference on Computational Natural Language Learning, 2020

Corpus Wide Argument Mining - A Working Solution.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
SemEval-2019 Task 1: Cross-lingual Semantic Parsing with UCCA.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Automatically Extracting Challenge Sets for Non-Local Phenomena in Neural Machine Translation.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Learning to combine Grammatical Error Corrections.
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, 2019

Are You Convinced? Choosing the More Convincing Evidence with a Siamese Network.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

The Language of Legal and Illegal Activity on the Darknet.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Inherent Biases in Reference-based Evaluation for Grammatical Error Correction and Text Simplification.
CoRR, 2018

Reference-less Measure of Faithfulness for Grammatical Error Correction.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

DORA The Explorer: Directed Outreaching Reinforcement Action-Selection.
Proceedings of the 6th International Conference on Learning Representations, 2018

Will it Blend? Blending Weak and Strong Labeled Data in a Neural Network for Argumentation Mining.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Automatic Metric Validation for Grammatical Error Correction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Inherent Biases in Reference-based Evaluation for Grammatical Error Correction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018


  Loading...