Mohammad Shoeybi
According to our database1,
Mohammad Shoeybi
authored at least 54 papers
between 2006 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
CoRR, 2023
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023
Adding Instructions during Pretraining: Effective way of Controlling Toxicity in Language Models.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
2022
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model.
CoRR, 2022
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
2021
CoRR, 2021
Proceedings of the International Conference for High Performance Computing, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
CoRR, 2020
MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
2019
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism.
CoRR, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
2017
CoRR, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
2010
An adaptive implicit-explicit scheme for the DNS and LES of compressible flows on unstructured grids.
J. Comput. Phys., 2010
2008
J. Comput. Phys., 2008
2006
Multiscale Model. Simul., 2006