Udari Madhushani Sehwag

According to our database1, Udari Madhushani Sehwag authored at least 4 papers in 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Can LLMs be Scammed? A Baseline Measurement Study.
CoRR, 2024

AdvBDGen: Adversarially Fortified Prompt-Specific Fuzzy Backdoor Generator Against LLM Alignment.
CoRR, 2024

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment.
CoRR, 2024

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors.
CoRR, 2024


  Loading...