Leandro von Werra

According to our database1, Leandro von Werra authored at least 20 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
SmolLM2: When Smol Goes Big - Data-Centric Training of a Small Language Model.
CoRR, February, 2025

Towards Best Practices for Open Datasets for LLM Training.
CoRR, January, 2025

2024
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions.
CoRR, 2024

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models.
CoRR, 2024

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

SelfCodeAlign: Self-Alignment for Code Generation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

OctoPack: Instruction Tuning Code Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
StarCoder: may the source be with you!
Trans. Mach. Learn. Res., 2023

The Stack: 3 TB of permissively licensed source code.
Trans. Mach. Learn. Res., 2023

The BigCode Project Governance Card.
CoRR, 2023

Zephyr: Direct Distillation of LM Alignment.
CoRR, 2023

SantaCoder: don't reach for the stars!
CoRR, 2023

2022
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements.
CoRR, 2022


Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2019
Radiometric Characterization of a Water-Based Conical Blackbody Calibration Target for Millimeter-Wave Remote Sensing.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2019

Unsupervised Anomaly Detection for Seasonal Time Series.
Proceedings of the 6th Swiss Conference on Data Science, 2019

Generative Adversarial Networks in Precision Oncology.
Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval, 2019

2017
ETH Zurich at TREC Precision Medicine 2017.
Proceedings of The Twenty-Sixth Text REtrieval Conference, 2017


  Loading...