Zhiheng Xi
According to our database1,
Zhiheng Xi
authored at least 38 papers
between 2022 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training.
CoRR, February, 2025
CoRR, January, 2025
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use.
CoRR, January, 2025
2024
Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback.
CoRR, 2024
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment.
CoRR, 2023
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning.
CoRR, 2023
CoRR, 2023
Towards Understanding the Capability of Large Language Models on Code Clone Detection: A Survey.
CoRR, 2023
CoRR, 2023
RealBehavior: A Framework for Faithfully Characterizing Foundation Models' Human-like Behavior Mechanisms.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022