Dan Alistarh
Orcid: 0000-0003-3650-940XAffiliations:
- IST Austria, Klosterneuburg, Austria
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, USA (former)
According to our database1,
Dan Alistarh
authored at least 212 papers
between 2008 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Distributed Computing Column 87 <i>Distributed and Algorithmic Thinking across Domains</i>.
SIGACT News, June, 2024
Trans. Mach. Learn. Res., 2024
CoRR, 2024
CoRR, 2024
The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order Information.
CoRR, 2024
CoRR, 2024
Panza: A Personalized Text Writing Assistant via Data Playback and Local Fine-Tuning.
CoRR, 2024
MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence.
CoRR, 2024
CoRR, 2024
Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment.
CoRR, 2024
Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization.
CoRR, 2024
Proceedings of the 43rd ACM Symposium on Principles of Distributed Computing, 2024
L-GreCo: Layerwise-adaptive Gradient Compression For Efficient Data-parallel Deep Learning.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the 44th IEEE International Conference on Distributed Computing Systems, 2024
Mathador-LM: A Dynamic Benchmark for Mathematical Reasoning on Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024
2023
SIGACT News, December, 2023
Distributed Computing Column 87 Recent Advances in Multi-Pass Graph Streaming Lower Bounds.
SIGACT News, September, 2023
Distributed Comput., September, 2023
Proc. ACM Program. Lang., 2023
How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry" Benchmark.
CoRR, 2023
CoRR, 2023
QIGen: Generating Efficient Kernels for Quantized Inference on Large Language Models.
CoRR, 2023
CoRR, 2023
CoRR, 2023
Proceedings of the 35th ACM Symposium on Parallelism in Algorithms and Architectures, 2023
Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks at the Edge.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the Computer Aided Verification - 35th International Conference, 2023
2022
SIGACT News, 2022
Distributed Computing Column 85 Elastic Consistency: A Consistency Criterion for Distributed Optimization.
SIGACT News, 2022
L-GreCo: An Efficient and General Framework for Layerwise-Adaptive Gradient Compression.
CoRR, 2022
CoRR, 2022
Hybrid Decentralized Optimization: First- and Zeroth-Order Optimizers Can Be Jointly Leveraged For Faster Convergence.
CoRR, 2022
CoRR, 2022
CoRR, 2022
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022
Proceedings of the PODC '22: ACM Symposium on Principles of Distributed Computing, Salerno, Italy, July 25, 2022
Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Middleware '22: 23rd International Middleware Conference, Quebec, QC, Canada, November 7, 2022
Proceedings of the International Conference on Machine Learning, 2022
The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
Breaking (Global) Barriers in Parallel Stochastic Optimization With Wait-Avoiding Group Averaging.
IEEE Trans. Parallel Distributed Syst., 2021
Distributed Computing Column 84: Perspectives on the Paper "CCS Expressions, Finite State Processes, and Three Problems of Equivalence".
SIGACT News, 2021
Distributed Computing Column 83 Five Ways Not To Fool Yourself: Designing Experiments for Understanding Performance.
SIGACT News, 2021
Distributed Computing Column 82 <i>Distributed Computability</i>: <i>A Few Results Masters Students Should Know</i>.
SIGACT News, 2021
Distributed Computing Column 81: Byzantine Agreement with Less Communication: Recent Advances.
SIGACT News, 2021
NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization.
J. Mach. Learn. Res., 2021
Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks.
J. Mach. Learn. Res., 2021
CoRR, 2021
Efficient Matrix-Free Approximations of Second-Order Information, with Applications to Pruning and Optimization.
CoRR, 2021
Proceedings of the 35th International Symposium on Distributed Computing, 2021
Proceedings of the 35th International Symposium on Distributed Computing, 2021
Proceedings of the SPAA '21: 33rd ACM Symposium on Parallelism in Algorithms and Architectures, 2021
Proceedings of the Structural Information and Communication Complexity, 2021
Proceedings of the PODC '21: ACM Symposium on Principles of Distributed Computing, 2021
Proceedings of the 25th International Conference on Principles of Distributed Systems, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Elastic Consistency: A Practical Consistency Model for Distributed Stochastic Gradient Descent.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Asynchronous Optimization Methods for Efficient Training of Deep Neural Networks with Guarantees.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Compressive Sensing Using Iterative Hard Thresholding With Low Precision Data Representation: Theory and Applications.
IEEE Trans. Signal Process., 2020
SIGACT News, 2020
Distributed Computing Column 78: 60 Years of Mastering Concurrent Computing through Sequential Thinking.
SIGACT News, 2020
Breaking (Global) Barriers in Parallel Stochastic Optimization with Wait-Avoiding Group Averaging.
CoRR, 2020
Elastic Consistency: A General Consistency Model for Distributed Stochastic Gradient Descent.
CoRR, 2020
Proceedings of the SPAA '20: 32nd ACM Symposium on Parallelism in Algorithms and Architectures, 2020
Taming unbalanced training workloads in deep learning with partial collective operations.
Proceedings of the PPoPP '20: 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020
Proceedings of the PPoPP '20: 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020
Proceedings of the PPoPP '20: 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020
Proceedings of the PODC '20: ACM Symposium on Principles of Distributed Computing, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Inducing and Exploiting Activation Sparsity for Fast Inference on Deep Neural Networks.
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
2019
CoRR, 2019
Proceedings of the 31st ACM on Symposium on Parallelism in Algorithms and Architectures, 2019
Proceedings of the International Conference for High Performance Computing, 2019
Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019
Proceedings of the 23rd International Conference on Principles of Distributed Systems, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the Euro-Par 2019: Parallel Processing, 2019
2018
ACM Trans. Parallel Comput., 2018
Compressive Sensing with Low Precision Data Representation: Radio Astronomy and Beyond.
CoRR, 2018
DataBright: Towards a Global Exchange for Decentralized Data Ownership and Trusted Computation.
CoRR, 2018
Proceedings of the 30th on Symposium on Parallelism in Algorithms and Architectures, 2018
Proceedings of the 30th on Symposium on Parallelism in Algorithms and Architectures, 2018
Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, 2018
Proceedings of the 2018 IEEE International Workshop on Signal Processing Systems, 2018
Proceedings of the 2018 ACM Symposium on Principles of Distributed Computing, 2018
Proceedings of the 2018 ACM Symposium on Principles of Distributed Computing, 2018
Proceedings of the 2018 ACM Symposium on Principles of Distributed Computing, 2018
Proceedings of the 2018 ACM Symposium on Principles of Distributed Computing, 2018
Proceedings of the 2018 ACM Symposium on Principles of Distributed Computing, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Synchronous Multi-GPU Training for Deep Learning with Low-Precision Communications: An Empirical Study.
Proceedings of the 21st International Conference on Extending Database Technology, 2018
Proceedings of the 57th IEEE Conference on Decision and Control, 2018
2017
ACM Trans. Parallel Comput., 2017
Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms, 2017
Proceedings of the ACM Symposium on Principles of Distributed Computing, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
ZipML: Training Linear Models with End-to-End Low Precision, and a Little Bit of Deep Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 25th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2017
Proceedings of the Twelfth European Conference on Computer Systems, 2017
Proceedings of the DNA Computing and Molecular Programming - 23rd International Conference, 2017
Proceedings of the 13th International Conference on emerging Networking EXperiments and Technologies, 2017
2016
CoRR, 2016
CoRR, 2016
2015
Polylogarithmic-Time Leader Election in Population Protocols Using Polylogarithmic States.
CoRR, 2015
Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication, 2015
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015
Proceedings of the 2015 ACM Symposium on Principles of Distributed Computing, 2015
Proceedings of the 2015 ACM Symposium on Principles of Distributed Computing, 2015
Proceedings of the 2015 ACM Symposium on Principles of Distributed Computing, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the Automata, Languages, and Programming - 42nd International Colloquium, 2015
2014
Proceedings of the Twenty-Fifth Annual ACM-SIAM Symposium on Discrete Algorithms, 2014
Proceedings of the ACM Symposium on Principles of Distributed Computing, 2014
Proceedings of the ACM Symposium on Principles of Distributed Computing, 2014
Proceedings of the IEEE 34th International Conference on Distributed Computing Systems, 2014
Proceedings of the Ninth Eurosys Conference 2014, 2014
Distributed Algorithms.
Proceedings of the Computing Handbook, 2014
2013
Proceedings of the ACM Symposium on Principles of Distributed Computing, 2013
2012
PhD thesis, 2012
Algorithmica, 2012
Proceedings of the 24th ACM Symposium on Parallelism in Algorithms and Architectures, 2012
Proceedings of the Structural Information and Communication Complexity, 2012
Proceedings of the 53rd Annual IEEE Symposium on Foundations of Computer Science, 2012
2011
Proceedings of the Distributed Computing - 25th International Symposium, 2011
Proceedings of the 30th Annual ACM Symposium on Principles of Distributed Computing, 2011
Proceedings of the IEEE 52nd Annual Symposium on Foundations of Computer Science, 2011
2010
Proceedings of the Distributed Computing, 24th International Symposium, 2010
Proceedings of the Distributed Computing, 24th International Symposium, 2010
Proceedings of the SPAA 2010: Proceedings of the 22nd Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2010
Proceedings of the Automata, Languages and Programming, 37th International Colloquium, 2010
2008
Proceedings of the Distributed Computing, 22nd International Symposium, 2008