Derek Shi
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Oracle-RLAIF: An Improved Fine-Tuning Framework for Multi-modal Video Models through Reinforcement Learning from Ranking Feedback.
CoRR, October, 2025
VERIRAG: Healthcare Claim Verification via Statistical Audit in Retrieval-Augmented Generation.
CoRR, July, 2025