Runtao Liu

According to our database1, Runtao Liu authored at least 33 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
AvatarPointillist: AutoRegressive 4D Gaussian Avatarization.
CoRR, April, 2026

Multi-granularity cross-modal representation for occlusion-invariant group re-identification.
Vis. Comput., March, 2026

RenderFlow: Single-Step Neural Rendering via Flow Matching.
CoRR, January, 2026

LongVideoAgent: Multi-Agent Reasoning with Long Videos.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Follow-Your-Instruction: A Comprehensive MLLM Agent for World Data Synthesis.
CoRR, August, 2025

VL-GenRM: Enhancing Vision-Language Verification via Vision Experts and Iterative Training.
CoRR, June, 2025

Fake it till You Make it: Reward Modeling as Discriminative Prediction.
CoRR, June, 2025

Expert-scoring guided global information interaction network for lightweight image super-resolution.
Image Vis. Comput., 2025

I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

AlignGuard: Scalable Safety Alignment for Text-to-Image Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

UTMath: A Benchmark for Math Evaluation with Unit Test.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Pointing to a Llama and Call it a Camel: On the Sycophancy of Multimodal Large Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

VideoDPO: Omni-Preference Alignment for Video Diffusion Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Bridge-Coder: Transferring Model Capabilities from High-Resource to Low-Resource Programming Language.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
ModelGrow: Continual Text-to-Video Pre-training with Model Expansion and Language Understanding Enhancement.
CoRR, 2024

SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation.
CoRR, 2024

UTMath: Math Evaluation with Unit Test via Reasoning-to-Coding Thoughts.
CoRR, 2024

Bridge-Coder: Unlocking LLMs' Potential to Overcome Language Gaps in Low-Resource Code.
CoRR, 2024

LLMs Meet Multimodal Generation and Editing: A Survey.
CoRR, 2024

Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization.
Proceedings of the Computer Vision - ECCV 2024, 2024

Latent Guard: A Safety Framework for Text-to-Image Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
SketchInverter: Multi-Class Sketch-Based Image Generation via GAN Inversion.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

2022
3D Shape Reconstruction from Free-Hand Sketches.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

2021
The Emergence of Objectness: Learning Zero-shot Segmentation from Videos.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Unsupervised Sketch to Photo Synthesis.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
An Unpaired Sketch-to-Photo Translation Model.
CoRR, 2019

CLEVR-Ref+: Diagnosing Visual Reasoning With Referring Expressions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2017
Using Deep Learning Method for Classification: A Proposed Algorithm for the ISIC 2017 Skin Lesion Classification Challenge.
CoRR, 2017

Automatic Document Metadata Extraction Based on Deep Networks.
Proceedings of the Natural Language Processing and Chinese Computing, 2017

A Symbol Dominance Based Formulae Recognition Approach for PDF Documents.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

CNN Based Page Object Detection in Document Images.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

Citation Metadata Extraction via Deep Neural Network-based Segment Sequence Labeling.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017


  Loading...