Shihan Dou
Orcid: 0009-0002-6013-3035
According to our database1,
Shihan Dou
authored at least 47 papers
between 2020 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Revisiting Jailbreaking for Large Language Models: A Representation Engineering Perspective.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
2024
COCL: An Intelligent Framework for Enhancing Deep Learning-Based Vulnerability Detection.
IEEE Trans. Ind. Informatics, March, 2024
CC2Vec: Combining Typed Tokens with Contrastive Learning for Effective Code Clone Detection.
Proc. ACM Softw. Eng., 2024
Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision.
CoRR, 2024
CoRR, 2024
SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance.
CoRR, 2024
Aligning Large Language Models from Self-Reference AI Feedback with one General Principle.
CoRR, 2024
CoRR, 2024
CodeChameleon: Personalized Encryption Framework for Jailbreaking Large Language Models.
CoRR, 2024
Advancing Translation Preference Modeling with RLHF: A Step Towards Cost-Effective Solution.
CoRR, 2024
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback.
CoRR, 2024
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback.
CoRR, 2024
ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios.
CoRR, 2024
Proceedings of the Natural Language Processing and Chinese Computing, 2024
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment.
CoRR, 2023
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning.
CoRR, 2023
Towards Understanding the Capability of Large Language Models on Code Clone Detection: A Survey.
CoRR, 2023
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023
Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
On the Universal Adversarial Perturbations for Efficient Data-free Adversarial Detection.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
Decorrelate Irrelevant, Purify Relevant: Overcome Textual Spurious Correlations from a Feature Perspective.
CoRR, 2022
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Decorrelate Irrelevant, Purify Relevant: Overcome Textual Spurious Correlations from a Feature Perspective.
Proceedings of the 29th International Conference on Computational Linguistics, 2022
MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
ACM Trans. Softw. Eng. Methodol., 2021
Boosting the Capability of Intelligent Vulnerability Detection by Training in a Human-Learning Manner.
CoRR, 2021
CoRR, 2021
2020
Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering, 2020