Jinyi Liu
Orcid: 0000-0002-4537-348XAffiliations:
- Tianjin University, College of Intelligence and Computing, China
According to our database1,
Jinyi Liu
authored at least 26 papers
between 2021 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model.
CoRR, July, 2025
CoRR, May, 2025
From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models.
CoRR, March, 2025
SheetAgent: Towards a Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models.
Proceedings of the ACM on Web Conference 2025, 2025
Proceedings of the International Joint Conference on Neural Networks, 2025
DualRAG: A Dual-Process Approach to Integrate Reasoning and Retrieval for Multi-Hop Question Answering.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
2024
IEEE Trans. Neural Networks Learn. Syst., July, 2024
CellAgent: An LLM-driven Multi-Agent Framework for Automated Single-cell Data Analysis.
CoRR, 2024
SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models.
CoRR, 2024
Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models.
CoRR, 2024
PERIA: Perceive, Reason, Imagine, Act via Holistic Language and Vision Planning for Manipulation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics Demonstrations.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
vMFER: von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement of Actor-Critic Algorithms.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
A Trajectory Perspective on the Role of Data Sampling Techniques in Offline Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning.
CoRR, 2023
Ensemble-based Offline-to-Online Reinforcement Learning: From Pessimistic Learning to Optimistic Exploration.
CoRR, 2023
HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach.
CoRR, 2023
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
2021
CoRR, 2021
FIGCPS: Effective Failure-inducing Input Generation for Cyber-Physical Systems with Deep Reinforcement Learning.
Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering, 2021