Yi-Chen Li

Orcid: 0009-0004-9908-5303

Affiliations:
  • Nanjing University, School of Artificial Intelligence, China


According to our database1, Yi-Chen Li authored at least 27 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
RMGAP: Benchmarking the Generalization of Reward Models across Diverse Preferences.
CoRR, May, 2026

Off-Policy Value-Based Reinforcement Learning for Large Language Models.
CoRR, March, 2026

Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Constraints.
CoRR, March, 2026

Multi-agent In-context Coordination via Decentralized Memory Retrieval.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Generalizable Offline Multiobjective Reinforcement Learning via Preference-Conditioned Diffuser.
IEEE Trans. Neural Networks Learn. Syst., December, 2025

Improving Sample Efficiency of Reinforcement Learning With Background Knowledge From Large Language Models.
IEEE Trans. Neural Networks Learn. Syst., November, 2025

Generalizable Multi-Modal Adversarial Imitation Learning for Non-Stationary Dynamics.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2025

Generalist Reward Models: Found Inside Large Language Models.
CoRR, June, 2025

Controlling Large Language Model with Latent Actions.
CoRR, March, 2025

Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference.
CoRR, March, 2025

Constraining an Unconstrained Multi-agent Policy with offline data.
Neural Networks, 2025

Learning to Reuse Policies in State Evolvable Environments.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Controlling Large Language Model with Latent Action.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Q-Adapter: Training Your LLM Adapter as a Residual Q-Function.
CoRR, 2024

BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation.
CoRR, 2024

Dynamics Adaptive Safe Reinforcement Learning with a Misspecified Simulator.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2024

Continual Multi-Objective Reinforcement Learning via Reward Model Rehearsal.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Deep Demonstration Tracing: Learning Generalizable Imitator Policy for Runtime Imitation from a Single Demonstration.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Cost-aware Offline Safe Meta Reinforcement Learning with Robust In-Distribution Online Task Adaptation.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

2023
Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments.
CoRR, 2023

Policy Regularization with Dataset Constraint for Offline Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Generalizable Batch Active Learning Strategies via Deep Q-networks (Student Abstract).
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023


  Loading...