Junchi Yao

According to our database1, Junchi Yao authored at least 18 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Beyond State Consistency: Behavior Consistency in Text-Based World Models.
CoRR, April, 2026

UGID: Unified Graph Isomorphism for Debiasing Large Language Models.
CoRR, March, 2026

Functional Subspace Watermarking for Large Language Models.
CoRR, March, 2026

FaithSteer-BENCH: A Deployment-Aligned Stress-Testing Benchmark for Inference-Time Steering.
CoRR, March, 2026

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads.
CoRR, February, 2026

Hearing is Believing? Evaluating and Analyzing Audio Language Model Sycophancy with SYAUDIO.
CoRR, January, 2026

2025
Towards Reasoning-Preserving Unlearning in Multimodal Large Language Models.
CoRR, December, 2025

P1: Mastering Physics Olympiads with Reinforcement Learning.
CoRR, November, 2025

POLIS-Bench: Towards Multi-Dimensional Evaluation of LLMs for Bilingual Policy Tasks in Governmental Scenarios.
CoRR, November, 2025

PhysicsMinions: Winning Gold Medals in the Latest Physics Olympiads with a Coevolutionary Multimodal Multi-Agent System.
CoRR, September, 2025

SCI-Verifier: Scientific Verifier with Thinking.
CoRR, September, 2025

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
CoRR, September, 2025

Mitigating Behavioral Hallucination in Multimodal Large Language Models for Sequential Images.
CoRR, June, 2025

Is Your LLM-Based Multi-Agent a Reliable Real-World Planner? Exploring Fraud Detection in Travel Planning.
CoRR, May, 2025

Scaling Physical Reasoning with the PHYSICS Dataset.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Understanding the Repeat Curse in Large Language Models from a Feature Perspective.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Fusing Dynamics Equation: A Social Opinions Prediction Algorithm with LLM-based Agents.
CoRR, 2024


  Loading...