Judd Rosenblatt

According to our database1, Judd Rosenblatt authored at least 3 papers in 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Towards Safe and Honest AI Agents with Neural Self-Other Overlap.
CoRR, 2024

Unexpected Benefits of Self-Modeling in Neural Systems.
CoRR, 2024

Rethinking harmless refusals when fine-tuning foundation models.
CoRR, 2024


  Loading...