Vijay Anand Korthikanti

According to our database1, Vijay Anand Korthikanti authored at least 17 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Upcycling Large Language Models into Mixture of Experts.
CoRR, 2024

An Empirical Study of Mamba-based Language Models.
CoRR, 2024

2023
Reducing Activation Recomputation in Large Transformer Models.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model.
CoRR, 2022

2021
Efficient Large-Scale Language Model Training on GPU Clusters.
CoRR, 2021

Efficient large-scale language model training on GPU clusters using megatron-LM.
Proceedings of the International Conference for High Performance Computing, 2021

2011
Towards energy-performance trade-off analysis of parallel applications
PhD thesis, 2011

Energy-performance trade-off analysis of parallel algorithms for shared memory architectures.
Sustain. Comput. Informatics Syst., 2011

Model Checking MDPs with a Unique Compact Invariant Set of Distributions.
Proceedings of the Eighth International Conference on Quantitative Evaluation of Systems, 2011

Synthesizing geometry constructions.
Proceedings of the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation, 2011

On the Energy Complexity of Parallel Algorithms.
Proceedings of the International Conference on Parallel Processing, 2011

2010
Towards optimizing energy costs of algorithms for shared memory architectures.
Proceedings of the SPAA 2010: Proceedings of the 22nd Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2010

Reasoning about MDPs as Transformers of Probability Distributions.
Proceedings of the QEST 2010, 2010

Avoiding energy wastage in parallel applications.
Proceedings of the International Green Computing Conference 2010, 2010

2009
Analysis of Parallel Algorithms for Energy Conservation in Scalable Multicore Architectures.
Proceedings of the ICPP 2009, 2009

2008
Fair K Mutual Exclusion Algorithm for Peer to Peer Systems.
Proceedings of the 28th IEEE International Conference on Distributed Computing Systems (ICDCS 2008), 2008


  Loading...