Danny Halawi

According to our database1, Danny Halawi authored at least 6 papers between 2022 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Dominion: A New Frontier for AI Research.
CoRR, 2024

Approaching Human-Level Forecasting with Language Models.
CoRR, 2024

Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Overthinking the Truth: Understanding how Language Models Process False Demonstrations.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Eliciting Latent Predictions from Transformers with the Tuned Lens.
CoRR, 2023

2022
Trophic analysis of a historical network reveals temporal information.
Appl. Netw. Sci., 2022


  Loading...