Yihang Yao

Orcid: 0009-0005-6093-853X

According to our database1, Yihang Yao authored at least 15 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Behavior Injection: Preparing Language Models for Reinforcement Learning.
CoRR, May, 2025

Signal, Image, or Symbolic: Exploring the Best Input Representation for Electrocardiogram-Language Models Through a Unified Framework.
CoRR, May, 2025

CrashAgent: Crash Scenario Generation via Multi-modal Reasoning.
CoRR, May, 2025

QuietPaw: Learning Quadrupedal Locomotion with Versatile Noise Preference Alignment.
CoRR, March, 2025

Safety is Not Only About Refusal: Reasoning-Enhanced Fine-tuning for Interpretable LLM Safety.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Your Language Model May Think Too Rigidly: Achieving Reasoning Consistency with Symmetry-Enhanced Training.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Gradient shaping for multi-constraint safe reinforcement learning.
Proceedings of the 6th Annual Learning for Dynamics & Control Conference, 2024

Feasibility Consistent Representation Learning for Safe Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Learning from Sparse Offline Datasets via Conservative Density Estimation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
An Integrated in Situ Image Acquisition and Annotation Scheme for Instance Segmentation Models in Open Scenes With a Human-Robot Interaction Approach.
IEEE Trans. Hum. Mach. Syst., October, 2023

Datasets and Benchmarks for Offline Safe Reinforcement Learning.
CoRR, 2023

Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Constrained Decision Transformer for Offline Safe Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

Towards Robust and Safe Reinforcement Learning with Benign Off-policy Data.
Proceedings of the International Conference on Machine Learning, 2023


  Loading...