Suraj Anand

According to our database1, Suraj Anand authored at least 4 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Are PPO-ed Language Models Hackable?
CoRR, 2024

Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting.
CoRR, 2024

Suppressing Pink Elephants with Direct Principle Feedback.
CoRR, 2024

2019
Springer Nature and online courses with iversity.
Proceedings of Work in Progress Papers of the Research, 2019


  Loading...