Jiafei Lyu

Orcid: 0000-0001-6616-417X

According to our database1, Jiafei Lyu authored at least 38 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning.
CoRR, May, 2025

Exploration by Random Distribution Distillation.
CoRR, May, 2025

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning.
CoRR, April, 2025

VLP: Vision-Language Preference Learning for Embodied Manipulation.
CoRR, February, 2025

A large language model-driven reward design framework via dynamic feedback for reinforcement learning.
Knowl. Based Syst., 2025

World Models with Hints of Large Language Models for Goal Achieving.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Leveraging Score-based Models for Generating Penalization in Model-based Offline Reinforcement Learning.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Cross-Domain Offline Policy Adaptation with Optimal Transport and Dataset Constraint.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent Reinforcement Learning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Enhancing visual reinforcement learning with State-Action Representation.
Knowl. Based Syst., 2024

Understanding What Affects the Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence.
J. Artif. Intell. Res., 2024

Off-policy RL algorithms can be sample-efficient for continuous control via sample multiple reuse.
Inf. Sci., 2024

A two-stage reinforcement learning-based approach for multi-entity task allocation.
Eng. Appl. Artif. Intell., 2024

ODRL: A Benchmark for Off-Dynamics Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Exploration and Anti-Exploration with Distributional Random Network Distillation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Cross-Domain Policy Adaptation by Capturing Representation Mismatch.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

SEABO: A Simple Search-Based Method for Offline Imitation Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Enhancing Visual Generalization in Reinforcement Learning with Cycling Augmentation.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2024, 2024

Mind the Model, Not the Agent: The Primacy Bias in Model-Based RL.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Understanding How to Reduce Generalization Gap in Visual Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Normalization Enhances Generalization in Visual Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

2023
Value activation for bias alleviation: Generalized-activated deep double deterministic policy gradients.
Neurocomputing, 2023

Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model.
CoRR, 2023

The primacy bias in Model-based RL.
CoRR, 2023

Zero-shot Preference Learning for Offline RL via Optimal Transport.
CoRR, 2023

Uncertainty-driven Trajectory Truncation for Model-based Offline Reinforcement Learning.
CoRR, 2023

State Advantage Weighting for Offline RL.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Uncertainty-Driven Trajectory Truncation for Data Augmentation in Offline Reinforcement Learning.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

2022
PRAG: Periodic Regularized Action Gradient for Efficient Continuous Control.
Proceedings of the PRICAI 2022: Trends in Artificial Intelligence, 2022

Mildly Conservative Q-Learning for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Efficient Continuous Control with Double Actors and Regularized Critics.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Bias-reduced multi-step hindsight experience replay.
CoRR, 2021

2020
Nuclear Power Plants With Artificial Intelligence in Industry 4.0 Era: Top-Level Design and Current Applications - A Systemic Review.
IEEE Access, 2020


  Loading...