Thomas Wolf

Affiliations:
  • Hugging Face, Brooklyn, NY, USA


According to our database1, Thomas Wolf authored at least 31 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale.
CoRR, 2024

GAIA: a benchmark for General AI Assistants.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Federated benchmarking of medical artificial intelligence with MedPerf.
Nat. Mac. Intell., July, 2023

StarCoder: may the source be with you!
Trans. Mach. Learn. Res., 2023

The Stack: 3 TB of permissively licensed source code.
Trans. Mach. Learn. Res., 2023

GAIA: a benchmark for General AI Assistants.
CoRR, 2023

Zephyr: Direct Distillation of LM Alignment.
CoRR, 2023

Scaling Data-Constrained Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


2022
BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model.
CoRR, 2022

Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements.
CoRR, 2022


2021
Multitask Prompted Training Enables Zero-Shot Task Generalization.
CoRR, 2021

VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning.
CoRR, 2021

Distributed Deep Learning In Open Collaborations.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Training Transformers Together.
Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021

Learning from others' mistakes: Avoiding dataset biases without modeling them.
Proceedings of the 9th International Conference on Learning Representations, 2021


2020
TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation.
CoRR, 2020

Movement Pruning: Adaptive Sparsity by Fine-Tuning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Transformers: State-of-the-Art Natural Language Processing.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

Overview of the SustaiNLP 2020 Shared Task.
Proceedings of SustaiNLP: Workshop on Simple and Efficient Natural Language Processing, 2020

The Amazing World of Neural Language Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Tutorial Abstracts, 2020

2019
HuggingFace's Transformers: State-of-the-art Natural Language Processing.
CoRR, 2019

DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter.
CoRR, 2019

TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents.
CoRR, 2019

Transfer Learning in Natural Language Processing.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Large-Scale Transfer Learning for Natural Language Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

A Hierarchical Multi-Task Approach for Learning Embeddings from Semantic Tasks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Meta-Learning a Dynamical Language Model.
Proceedings of the 6th International Conference on Learning Representations, 2018

Continuous Learning in a Hierarchical Multiscale Neural Network.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018


  Loading...