Jan Wehner

Orcid: 0009-0008-8581-819X

According to our database1, Jan Wehner authored at least 7 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

2012
2014
2016
2018
2020
2022
2024
0
1
2
3
4
5
6
3
1
2
1

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Representation noising effectively prevents harmful fine-tuning on LLMs.
CoRR, 2024

Immunization against harmful fine-tuning attacks.
CoRR, 2024

Explaining Learned Reward Functions with Counterfactual Trajectories.
CoRR, 2024

Representation Noising: A Defence Mechanism Against Harmful Finetuning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Immunization against harmful fine-tuning attacks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
On robust vs fast solving of qualitative constraints.
J. Heuristics, December, 2023

2012
T.F.O.: tangible flying objects.
Proceedings of the 6th International Conference on Tangible and Embedded Interaction 2012, 2012


  Loading...