Amy Zhang

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026
Reinforcement Learning via Value Gradient Flow.
CoRR, April, 2026

Filtered Reasoning Score: Evaluating Reasoning Quality on a Model's Most-Confident Traces.
CoRR, April, 2026

The PokeAgent Challenge: Competitive and Long-Context Learning at Scale.
CoRR, March, 2026

A Recipe for Stable Offline Multi-agent Reinforcement Learning.
CoRR, March, 2026

Self-Refining Vision Language Model for Robotic Failure Detection and Reasoning.
CoRR, February, 2026

Learning Robust Reasoning through Guided Adversarial Self-Play.
CoRR, February, 2026

CARE-RFT: Confidence-Anchored Reinforcement Finetuning for Reliable Reasoning in Large Language Models.
CoRR, February, 2026

2025
Multi-agent Coordination via Flow Matching.
CoRR, November, 2025

Learning to Interact in World Latent for Team Coordination.
CoRR, September, 2025

Efficient RL for optimizing conversation level outcomes with an LLM-based tutor.
CoRR, July, 2025

ExPO: Unlocking Hard Reasoning with Self-Explanation-Guided Reinforcement Learning.
CoRR, July, 2025

Approximate Cross-Validated Mean Estimates for Bayesian Hierarchical Regression Models.
J. Comput. Graph. Stat., 2025

2024
Fairness-Sensitive Policy-Gradient Reinforcement Learning for Reducing Bias in Robotic Assistance.
Proceedings of the 33rd IEEE International Conference on Robot and Human Interactive Communication, 2024

2023
Dissecting IoT Device Provisioning Process.
CoRR, 2023

Fairness-Sensitive Policy-Gradient Reinforcement Learning for Reducing Bias in Robotic Assistance.
CoRR, 2023

2022
Predicting the Influence of Fake and Real News Spreaders (Student Abstract).
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2020
Approximate Cross-validated Mean Estimates for Bayesian Hierarchical Regression Models.
CoRR, 2020

2019
Whole genomes define concordance of matched primary, xenograft, and organoid models of pancreas cancer.
PLoS Comput. Biol., 2019

2018
A user-centered design and analysis of an electrostatic haptic touchscreen system for students with visual impairments.
Int. J. Hum. Comput. Stud., 2018

2013
Using Machine Learning and HL7 LOINC DO for Classification of Clinical Documents.
Proceedings of the AMIA 2013, 2013

2010
Knowledge transmission and engineering teaching.
Proceedings of the Learning in the Disciplines: Proceedings of the 9th International Conference of the Learning Sciences, 2010

2006
Improving Traffic Locality in BitTorrent via Biased Neighbor Selection.
Proceedings of the 26th IEEE International Conference on Distributed Computing Systems (ICDCS 2006), 2006


  Loading...