Aohan Zeng

Orcid: 0000-0002-8766-0153

According to our database¹, Aohan Zeng authored at least 23 papers between 2021 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Does RLHF Scale? Exploring the Impacts From Data, Model, and Method.

[BibT_eX]

[DOI]

CoRR, 2024

GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken Chatbot.

[BibT_eX]

[DOI]

CoRR, 2024

Scaling Speech-Text Pre-training with Synthetic Interleaved Data.

[BibT_eX]

[DOI]

CoRR, 2024

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents.

[BibT_eX]

[DOI]

CoRR, 2024

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools.

[BibT_eX]

[DOI]

CoRR, 2024

ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback.

[BibT_eX]

[DOI]

CoRR, 2024

APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding.

[BibT_eX]

[DOI]

CoRR, 2024

xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein.

[BibT_eX]

[DOI]

CoRR, 2024

Understanding Emergent Abilities of Language Models from the Loss Perspective.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AgentBench: Evaluating LLMs as Agents.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

AgentTuning: Enabling Generalized Agent Abilities for LLMs.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Revisiting Parallel Context Windows: A Frustratingly Simple Alternative and Chain-of-Thought Deterioration.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation.

[BibT_eX]

[DOI]

CoRR, 2023

CogDL: A Comprehensive Library for Graph Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the ACM Web Conference 2023, 2023

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

GLM-130B: An Open Bilingual Pre-trained Model.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

GLM-130B: An Open Bilingual Pre-trained Model.

[BibT_eX]

[DOI]

CoRR, 2022

BaGuaLu: targeting brain scale pretrained models with over 37 million cores.

[BibT_eX]

[DOI]

Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

2021

FastMoE: A Fast Mixture-of-Expert Training System.

[BibT_eX]

[DOI]

CoRR, 2021

CogDL: An Extensive Toolkit for Deep Learning on Graphs.

[BibT_eX]

[DOI]

CoRR, 2021

Aohan Zeng

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...