Rui Liu

Orcid: 0000-0003-2115-8491

Affiliations:
  • Huawei, Hong Kong
  • Chinese University of Hong Kong, Department of Electrical Engineering, CUHK-SenseTime Joint Laboratory, Hong Kong
  • University of Electronic Science and Technology of China, Chengdu, China (former)


According to our database1, Rui Liu authored at least 29 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
UI-KOBE: Knowledge-Oriented Behavior Exploration for Lightweight Graph-Guided GUI Agents.
CoRR, May, 2026

OmniInteract: Benchmarking Real-World Streaming Interaction for Real-Time Omnimodal Assistants.
CoRR, May, 2026

Beyond Text Prompts: Visual-to-Visual Generation as A Unified Paradigm.
CoRR, May, 2026

Self-Distilled Trajectory-Aware Boltzmann Modeling: Bridging the Training-Inference Discrepancy in Diffusion Language Models.
CoRR, May, 2026

PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent Recommendation Agents.
CoRR, March, 2026

CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification.
CoRR, March, 2026


MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation.
CoRR, December, 2025

Beyond Confidence: Adaptive and Coherent Decoding for Diffusion Language Models.
CoRR, December, 2025

MMA-ASIA: A Multilingual and Multimodal Alignment Framework for Culturally-Grounded Evaluation.
CoRR, October, 2025

Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer.
CoRR, October, 2025

LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding.
CoRR, September, 2025

MMSearch-Plus: A Simple Yet Challenging Benchmark for Multimodal Browsing Agents.
CoRR, August, 2025

PUSA V1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation.
CoRR, July, 2025

GHPO: Adaptive Guidance for Stable and Efficient LLM Reinforcement Learning.
CoRR, July, 2025

Ming-Omni: A Unified Multimodal Model for Perception and Generation.
CoRR, June, 2025

Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction.
CoRR, May, 2025

Disentangling Instruction Influence in Diffusion Transformers for Parallel Multi-Instruction-Guided Image Editing.
CoRR, April, 2025

LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2022

2021
Decoupled Spatial-Temporal Transformer for Video Inpainting.
CoRR, 2021

FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
StereoGAN: Bridging Synthetic-to-Real Domain Gap by Joint Optimization of Domain Translation and Stereo Matching.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Conditional Adversarial Generative Flow for Controllable Image Synthesis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Discrete Factorization Machines for Fast Feature-based Recommendation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

2017
Discrete Content-aware Matrix Factorization.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017


  Loading...