Jinyi Liu

Orcid: 0000-0002-4537-348X

Affiliations:

Tianjin University, College of Intelligence and Computing, China

According to our database¹, Jinyi Liu authored at least 26 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model.

[BibT_eX]

[DOI]

CoRR, July, 2025

From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, May, 2025

From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models.

[BibT_eX]

[DOI]

CoRR, March, 2025

SheetAgent: Towards a Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the ACM on Web Conference 2025, 2025

Multi-Reward Fusion: Learning from Other Policies through Distillation.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2025

DualRAG: A Dual-Process Approach to Integrate Reasoning and Retrieval for Multi-Hop Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

War of Thoughts: Competition Stimulates Stronger Reasoning in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., July, 2024

CellAgent: An LLM-driven Multi-Agent Framework for Automated Single-cell Data Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

PERIA: Perceive, Reason, Imagine, Act via Holistic Language and Vision Planning for Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics Demonstrations.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

vMFER: von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement of Actor-Critic Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

A Trajectory Perspective on the Role of Data Sampling Techniques in Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Ensemble-based Offline-to-Online Reinforcement Learning: From Pessimistic Learning to Optimistic Exploration.

[BibT_eX]

[DOI]

CoRR, 2023

HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach.

[BibT_eX]

[DOI]

CoRR, 2023

EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2021

ED2: An Environment Dynamics Decomposition Framework for World Model Construction.

[BibT_eX]

[DOI]

CoRR, 2021

Exploration in Deep Reinforcement Learning: A Comprehensive Survey.

[BibT_eX]

[DOI]

CoRR, 2021

FIGCPS: Effective Failure-inducing Input Generation for Cyber-Physical Systems with Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering, 2021

Jinyi Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...