Tu Trinh

Orcid: 0000-0002-2373-1469

According to our database1, Tu Trinh authored at least 7 papers between 2022 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Getting By Goal Misgeneralization With a Little Help From a Mentor.
CoRR, 2024

Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents.
CoRR, 2024

Softmax Probabilities (Mostly) Predict Large Language Model Correctness on Multiple-Choice Q&A.
CoRR, 2024

A StrongREJECT for Empty Jailbreaks.
CoRR, 2024

Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning.
Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024

2022
Efficient Game-Theoretic Planning With Prediction Heuristic for Socially-Compliant Autonomous Driving.
IEEE Robotics Autom. Lett., 2022

Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning.
CoRR, 2022


  Loading...