Aohan Zeng

Orcid: 0000-0002-8766-0153

According to our database1, Aohan Zeng authored at least 23 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Does RLHF Scale? Exploring the Impacts From Data, Model, and Method.
CoRR, 2024

GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken Chatbot.
CoRR, 2024

Scaling Speech-Text Pre-training with Synthetic Interleaved Data.
CoRR, 2024

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents.
CoRR, 2024

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools.
CoRR, 2024

ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback.
CoRR, 2024

Understanding Emergent Abilities of Language Models from the Loss Perspective.
CoRR, 2024

APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding.
CoRR, 2024

xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein.
CoRR, 2024

AgentBench: Evaluating LLMs as Agents.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

AgentTuning: Enabling Generalized Agent Abilities for LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Revisiting Parallel Context Windows: A Frustratingly Simple Alternative and Chain-of-Thought Deterioration.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation.
CoRR, 2023

CogDL: A Comprehensive Library for Graph Deep Learning.
Proceedings of the ACM Web Conference 2023, 2023

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

GLM-130B: An Open Bilingual Pre-trained Model.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
GLM-130B: An Open Bilingual Pre-trained Model.
CoRR, 2022

BaGuaLu: targeting brain scale pretrained models with over 37 million cores.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

2021
FastMoE: A Fast Mixture-of-Expert Training System.
CoRR, 2021

CogDL: An Extensive Toolkit for Deep Learning on Graphs.
CoRR, 2021


  Loading...