Tu Trinh

Orcid: 0000-0002-2373-1469

According to our database1, Tu Trinh authored at least 12 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
HiL-Bench (Human-in-Loop Benchmark): Do Agents Know When to Ask for Help?
CoRR, April, 2026

2025
Learning to Coordinate with Experts.
CoRR, February, 2025

Probabilities of Chat LLMs Are Miscalibrated but Still Predict Correctness on Multiple-Choice Q&A.
Trans. Mach. Learn. Res., 2025

YRC-Bench: A Benchmark for Learning to Coordinate with Experts.
Trans. Mach. Learn. Res., 2025

Aligned LLMs Are Not Aligned Browser Agents.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Getting By Goal Misgeneralization With a Little Help From a Mentor.
CoRR, 2024

Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents.
CoRR, 2024

Softmax Probabilities (Mostly) Predict Large Language Model Correctness on Multiple-Choice Q&A.
CoRR, 2024

A StrongREJECT for Empty Jailbreaks.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning.
Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024

2022
Efficient Game-Theoretic Planning With Prediction Heuristic for Socially-Compliant Autonomous Driving.
IEEE Robotics Autom. Lett., 2022

Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning.
CoRR, 2022


  Loading...