Yi-Chen Li

Orcid: 0009-0004-9908-5303

Affiliations:

Nanjing University, School of Artificial Intelligence, China

According to our database¹, Yi-Chen Li authored at least 27 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

RMGAP: Benchmarking the Generalization of Reward Models across Diverse Preferences.

[BibT_eX]

[DOI]

Yangyang Zhou

Yi-Chen Li

CoRR, May, 2026

Off-Policy Value-Based Reinforcement Learning for Large Language Models.

[BibT_eX]

[DOI]

CoRR, March, 2026

Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Constraints.

[BibT_eX]

[DOI]

CoRR, March, 2026

Multi-agent In-context Coordination via Decentralized Memory Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Generalizable Offline Multiobjective Reinforcement Learning via Preference-Conditioned Diffuser.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., December, 2025

Improving Sample Efficiency of Reinforcement Learning With Background Knowledge From Large Language Models.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., November, 2025

Generalizable Multi-Modal Adversarial Imitation Learning for Non-Stationary Dynamics.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., July, 2025

Generalist Reward Models: Found Inside Large Language Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

Controlling Large Language Model with Latent Actions.

[BibT_eX]

[DOI]

CoRR, March, 2025

Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference.

[BibT_eX]

[DOI]

CoRR, March, 2025

Constraining an Unconstrained Multi-agent Policy with offline data.

[BibT_eX]

[DOI]

Neural Networks, 2025

Learning to Reuse Policies in State Evolvable Environments.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Controlling Large Language Model with Latent Action.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Q-Adapter: Training Your LLM Adapter as a Residual Q-Function.

[BibT_eX]

[DOI]

CoRR, 2024

BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Dynamics Adaptive Safe Reinforcement Learning with a Misspecified Simulator.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2024

Continual Multi-Objective Reinforcement Learning via Reward Model Rehearsal.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Deep Demonstration Tracing: Learning Generalizable Imitator Policy for Runtime Imitation from a Single Demonstration.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Cost-aware Offline Safe Meta Reinforcement Learning with Robust In-Distribution Online Task Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

2023

Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments.

[BibT_eX]

[DOI]

CoRR, 2023

Policy Regularization with Dataset Constraint for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Generalizable Batch Active Learning Strategies via Deep Q-networks (Student Abstract).

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Yi-Chen Li

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...