Jie Liu

Orcid: 0000-0002-1782-2081

Affiliations:
  • Chinese University of Hong Kong, CUHK, MMLab, Hong Kong
  • Shanghai AI Laboratory, Intelligence Laboratory, China


According to our database1, Jie Liu authored at least 37 papers between 2020 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Leveraging Verifier-Based Reinforcement Learning in Image Editing.
CoRR, April, 2026

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation.
CoRR, March, 2026

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

2025
Global 1km Resolution Solar Photovoltaic Resource,Technical,and Economic Potential:Raster Datasets,Country-Level Statistics and Google Earth Engine Code.
Dataset, November, 2025

Global 1km Resolution Solar Photovoltaic Resource,Technical,and Economic Potential:Raster Datasets,Country-Level Statistics and Google Earth Engine Code.
Dataset, November, 2025

Global 1km Resolution Solar Photovoltaic Resource,Technical,and Economic Potential:Raster Datasets,Country-Level Statistics and Google Earth Engine Code.
Dataset, November, 2025

Global 1km Resolution Solar Photovoltaic Resource,Technical,and Economic Potential:Raster Datasets,Country-Level Statistics and Google Earth Engine Code.
Dataset, November, 2025

GRPO-Guard: Mitigating Implicit Over-Optimization in Flow Matching via Regulated Clipping.
CoRR, October, 2025

VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning.
CoRR, October, 2025

HiPO: Hybrid Policy Optimization for Dynamic Reasoning in LLMs.
CoRR, September, 2025

RewardDance: Reward Scaling in Visual Generation.
CoRR, September, 2025

Global 1km Resolution Solar Photovoltaic Resource,Technical,and Economic Potential:Raster Datasets,Country-Level Statistics and Google Earth Engine Code.
Dataset, August, 2025

Semantic-Based Resource Management Based on D2D Multicast Content Delivery: A Game-Theoretic Approach.
IEEE Trans. Veh. Technol., May, 2025

Flow-GRPO: Training Flow Matching Models via Online RL.
CoRR, May, 2025

Improving Video Generation with Human Feedback.
CoRR, January, 2025

Improving Video Generation with Human Feedback.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

2024
MaskMA: Towards Zero-Shot Multi-Agent Decision Making with Mask-Based Collaborative Learning.
Trans. Mach. Learn. Res., 2024

Adaptive pessimism via target Q-value for offline reinforcement learning.
Neural Networks, 2024

Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level.
CoRR, 2024

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series.
CoRR, 2024

Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

DDK: Distilling Domain Knowledge for Efficient Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Masked Pretraining for Multi-Agent Decision Making.
CoRR, 2023

Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement Learning with Sub-optimal Demonstrations.
CoRR, 2023

Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization for Language Models.
CoRR, 2023

Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors.
Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

ACE: Cooperative Multi-Agent Q-learning with Bidirectional Action-Dependency.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2021
Truncation-Free Matching System for Display Advertising at Alibaba.
CoRR, 2021

Inception Convolution With Efficient Dilation Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Adaptive Gradient Method with Resilience and Momentum.
CoRR, 2020


  Loading...