Sheldon Yu

According to our database1, Sheldon Yu authored at least 6 papers between 2025 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
OLIVIA: Online Learning via Inference-time Action Adaptation for Decision Making in LLM ReAct Agents.
CoRR, May, 2026

MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization.
CoRR, May, 2026

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning.
CoRR, May, 2026

A Low-Latency Fraud Detection Layer for Detecting Adversarial Interaction Patterns in LLM-Powered Agents.
CoRR, May, 2026

WS-GRPO: Weakly-Supervised Group-Relative Policy Optimization for Rollout-Efficient Reasoning.
CoRR, February, 2026

2025
Explainable Chain-of-Thought Reasoning: An Empirical Analysis on State-Aware Reasoning Dynamics.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025


  Loading...