Junlin Wu

Orcid: 0009-0006-1037-1827

Affiliations:
  • Washington University, Saint Louis, MO, USA


According to our database1, Junlin Wu authored at least 11 papers between 2021 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Verified Safe Reinforcement Learning for Neural Network Dynamic Models.
CoRR, 2024

Axioms for AI Alignment from Human Feedback.
CoRR, 2024

Preference Poisoning Attacks on Reward Model Learning.
CoRR, 2024

Certifying Safety in Reinforcement Learning under Adversarial Perturbation Attacks.
Proceedings of the IEEE Security and Privacy, 2024

RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with Human Feedback in Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
On the Exploitability of Reinforcement Learning with Human Feedback for Large Language Models.
CoRR, 2023

Exact Verification of ReLU Neural Control Barrier Functions.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Neural Lyapunov Control for Discrete-Time Systems.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
Manipulating Elections by Changing Voter Perceptions.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum.
Proceedings of the International Conference on Machine Learning, 2022

2021
Learning Generative Deception Strategies in Combinatorial Masking Games.
Proceedings of the Decision and Game Theory for Security - 12th International Conference, 2021


  Loading...