Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Zeyuan Liu

Kai Yang

Jiafei Lyu

Xiu Li

Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Leveraging Score-based Models for Generating Penalization in Model-based Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Cross-Domain Offline Policy Adaptation with Optimal Transport and Dataset Constraint.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Enhancing visual reinforcement learning with State-Action Representation.

[BibT_eX]

[DOI]

Mengbei Yan

Jiafei Lyu

Xiu Li

Knowl. Based Syst., 2024

Understanding What Affects the Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2024

Off-policy RL algorithms can be sample-efficient for continuous control via sample multiple reuse.

[BibT_eX]

[DOI]

Inf. Sci., 2024

A two-stage reinforcement learning-based approach for multi-entity task allocation.

[BibT_eX]

[DOI]

Eng. Appl. Artif. Intell., 2024

ODRL: A Benchmark for Off-Dynamics Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Exploration and Anti-Exploration with Distributional Random Network Distillation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Cross-Domain Policy Adaptation by Capturing Representation Mismatch.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

SEABO: A Simple Search-Based Method for Offline Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Enhancing Visual Generalization in Reinforcement Learning with Cycling Augmentation.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2024, 2024

Mind the Model, Not the Agent: The Primacy Bias in Model-Based RL.

[BibT_eX]

[DOI]

Zhongjian Qiao

Jiafei Lyu

Xiu Li

Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Understanding How to Reduce Generalization Gap in Visual Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Normalization Enhances Generalization in Visual Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

2023

Value activation for bias alleviation: Generalized-activated deep double deterministic policy gradients.

[BibT_eX]

[DOI]

Neurocomputing, 2023

Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model.

[BibT_eX]

[DOI]

CoRR, 2023

The primacy bias in Model-based RL.

[BibT_eX]

[DOI]

Zhongjian Qiao

Jiafei Lyu

Xiu Li

CoRR, 2023

Zero-shot Preference Learning for Offline RL via Optimal Transport.

[BibT_eX]

[DOI]

CoRR, 2023

Uncertainty-driven Trajectory Truncation for Model-based Offline Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

State Advantage Weighting for Offline RL.

[BibT_eX]

[DOI]

Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Uncertainty-Driven Trajectory Truncation for Data Augmentation in Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

2022

PRAG: Periodic Regularized Action Gradient for Efficient Continuous Control.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2022: Trends in Artificial Intelligence, 2022

Mildly Conservative Q-Learning for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination.

[BibT_eX]

[DOI]

Jiafei Lyu

Xiu Li

Zongqing Lu

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Efficient Continuous Control with Double Actors and Regularized Critics.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Bias-reduced multi-step hindsight experience replay.

[BibT_eX]

[DOI]

CoRR, 2021

2020

Nuclear Power Plants With Artificial Intelligence in Industry 4.0 Era: Top-Level Design and Current Applications - A Systemic Review.

[BibT_eX]

[DOI]

IEEE Access, 2020

Jiafei Lyu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...