Sheng Zha
According to our database1,
Sheng Zha
authored at least 28 papers
between 2018 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
2018
2019
2020
2021
2022
2023
2024
0
1
2
3
4
5
6
7
8
9
10
2
3
2
2
3
4
6
2
1
2
1
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Revisiting SMoE Language Models by Evaluating Inefficiencies with Task Specific Expert Pruning.
CoRR, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Fine-tuning Language Models for Joint Rewriting and Completion of Code with Potential Bugs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
On the accuracy and efficiency of group-wise clipping in differentially private optimization.
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the International Conference on Machine Learning, 2023
Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Better Context Makes Better Code Language Models: A Case Study on Function Call Argument Completion.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Parameter and Data Efficient Continual Pre-training for Robustness to Dialectal Variance in Arabic.
CoRR, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
Distiller: A Systematic Study of Model Distillation Methods in Natural Language Processing.
Proceedings of the Second Workshop on Simple and Efficient Natural Language Processing, 2021
2020
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.
J. Mach. Learn. Res., 2020
2019
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.
CoRR, 2019
Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources.
CoRR, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP, 2019
2018
Proceedings of the Computer Vision - ECCV 2018, 2018