Cindy Wu

According to our database1, Cindy Wu authored at least 3 papers between 2023 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Targeted Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs.
CoRR, 2024

Using Degeneracy in the Loss Landscape for Mechanistic Interpretability.
CoRR, 2024

2023
What Mechanisms Does Knowledge Distillation Distill?
Proceedings of UniReps: the First Workshop on Unifying Representations in Neural Models, 2023


  Loading...