Muling Wu
According to our database1,
Muling Wu
authored at least 11 papers
between 2023 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Revisiting Jailbreaking for Large Language Models: A Representation Engineering Perspective.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
2024
Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing.
CoRR, 2024
CoRR, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Aligning Large Language Models with Human Preferences through Representation Engineering.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
CoRR, 2023
Parameter Efficient Multi-task Fine-tuning by Learning to Transfer Token-wise Prompts.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Watermarking PLMs on Classification Tasks by Combining Contrastive Learning with Weight Perturbation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023