Kushal Tirumala

According to our database1, Kushal Tirumala authored at least 13 papers between 2020 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model.
CoRR, 2024

Brevity is the soul of wit: Pruning long files for code generation.
CoRR, 2024

An Introduction to Vision-Language Modeling.
CoRR, 2024

Text Quality-Based Pruning for Efficient Training of Language Models.
CoRR, 2024

The Unreasonable Ineffectiveness of the Deeper Layers.
CoRR, 2024

Effective pruning of web-scale datasets based on complexity of concept clusters.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Decoding Data Quality via Synthetic Corruptions: Embedding-guided Pruning of Code Data.
CoRR, 2023

SemDeDup: Data-efficient learning at web-scale through semantic deduplication.
CoRR, 2023

D4: Improving LLM Pretraining via Document De-Duplication and Diversification.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Investigating Generalization by Controlling Normalized Margin.
Proceedings of the International Conference on Machine Learning, 2022

Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

2020
Ensemble Machine Learning Methods for Modeling COVID19 Deaths.
CoRR, 2020


  Loading...