We stand with Ukraine

We stand with Ukraine

Yihang Yao

Orcid: 0009-0005-6093-853X

According to our database¹, Yihang Yao authored at least 15 papers between 2023 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Behavior Injection: Preparing Language Models for Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, May, 2025

Signal, Image, or Symbolic: Exploring the Best Input Representation for Electrocardiogram-Language Models Through a Unified Framework.

[BibT_eX]

[DOI]

,

,

,

,

,

Atharva Mhaskar

,

,

Michael A. Rosenberg

,

,

CoRR, May, 2025

CrashAgent: Crash Scenario Generation via Multi-modal Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, May, 2025

QuietPaw: Learning Quadrupedal Locomotion with Versatile Noise Preference Alignment.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

Safety is Not Only About Refusal: Reasoning-Enhanced Fine-tuning for Interpretable LLM Safety.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Your Language Model May Think Too Rigidly: Achieving Reasoning Consistency with Symmetry-Enhanced Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Gradient shaping for multi-constraint safe reinforcement learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 6th Annual Learning for Dynamics & Control Conference, 2024

Feasibility Consistent Representation Learning for Safe Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Learning from Sparse Offline Datasets via Conservative Density Estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

An Integrated in Situ Image Acquisition and Annotation Scheme for Instance Segmentation Models in Open Scenes With a Human-Robot Interaction Approach.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

IEEE Trans. Hum. Mach. Syst., October, 2023

Datasets and Benchmarks for Offline Safe Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Constrained Decision Transformer for Offline Safe Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2023

Towards Robust and Safe Reinforcement Learning with Benign Off-policy Data.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2023

Loading...