Sean Hendryx

According to our database1, Sean Hendryx authored at least 8 papers between 2023 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents.
CoRR, 2024

Revisiting the Superficial Alignment Hypothesis.
CoRR, 2024

Planning In Natural Language Improves LLM Search For Code Generation.
CoRR, 2024

Pre-Training Multimodal Hallucination Detectors with Corrupted Grounding Data.
CoRR, 2024

Learning Goal-Conditioned Representations for Language Reward Models.
CoRR, 2024

A Careful Examination of Large Language Model Performance on Grade School Arithmetic.
CoRR, 2024

Out-of-Distribution Detection & Applications With Ablated Learned Temperature Energy.
CoRR, 2024

2023
A Baseline Analysis of Reward Models' Ability To Accurately Analyze Foundation Models Under Distribution Shift.
CoRR, 2023


  Loading...