Zehan Qi

According to our database¹, Zehan Qi authored at least 16 papers between 2023 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

AutoGLM: Autonomous Foundation Agents for GUIs.

[BibT_eX]

[DOI]

CoRR, 2024

Long<sup>2</sup>RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall.

[BibT_eX]

[DOI]

CoRR, 2024

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents.

[BibT_eX]

[DOI]

CoRR, 2024

DebateQA: Evaluating Question Answering on Debatable Knowledge.

[BibT_eX]

[DOI]

CoRR, 2024

MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools.

[BibT_eX]

[DOI]

CoRR, 2024

NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Prompts.

[BibT_eX]

[DOI]

CoRR, 2024

Knowledge Conflicts for LLMs: A Survey.

[BibT_eX]

[DOI]

CoRR, 2024

Prejudice and Caprice: A Statistical Framework for Measuring Social Discrimination in Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Knowledge Conflicts for LLMs: A Survey.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LONG²RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Queries.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Preemptive Answer "Attacks" on Chain-of-Thought Reasoning.

[BibT_eX]

[DOI]

Rongwu Xu

Zehan Qi

Wei Xu

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity.

[BibT_eX]

[DOI]

CoRR, 2023

Zehan Qi

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...