Ruichuan An

Orcid: 0009-0000-3758-4335

According to our database¹, Ruichuan An authored at least 38 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Rethinking VLM Representation for VLA Initialization.

[BibT_eX]

[DOI]

CoRR, May, 2026

TaskGround: Structured Executable Task Inference for Full-Scene Household Reasoning.

[BibT_eX]

[DOI]

CoRR, May, 2026

VGGT-Edit: Feed-forward Native 3D Scene Editing with Residual Field Prediction.

[BibT_eX]

[DOI]

CoRR, May, 2026

Uni-Synergy: Bridging Understanding and Generation for Personalized Reasoning via Co-operative Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, May, 2026

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models.

[BibT_eX]

[DOI]

CoRR, April, 2026

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models.

[BibT_eX]

[DOI]

CoRR, March, 2026

PEARL: Personalized Streaming Video Understanding Model.

[BibT_eX]

[DOI]

CoRR, March, 2026

MME-CoF-Pro: Evaluating Reasoning Coherence in Video Generative Models with Text and Visual Hints.

[BibT_eX]

[DOI]

CoRR, March, 2026

GENIUS: Generative Fluid Intelligence Evaluation Suite.

[BibT_eX]

[DOI]

CoRR, February, 2026

GEBench: Benchmarking Image Generation Models as GUI Environments.

[BibT_eX]

[DOI]

CoRR, February, 2026

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining.

[BibT_eX]

[DOI]

CoRR, February, 2026

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing.

[BibT_eX]

[DOI]

CoRR, February, 2026

Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks.

[BibT_eX]

[DOI]

CoRR, February, 2026

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning.

[BibT_eX]

[DOI]

CoRR, January, 2026

CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation.

[BibT_eX]

[DOI]

CoRR, January, 2026

LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts.

[BibT_eX]

[DOI]

Proceedings of the ACM Web Conference 2026, 2026

2025

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI.

[BibT_eX]

[DOI]

CoRR, December, 2025

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models.

[BibT_eX]

[DOI]

CoRR, December, 2025

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark.

[BibT_eX]

[DOI]

CoRR, October, 2025

Jarvis: Towards Personalized AI Assistant via Personal KV-Cache Retrieval.

[BibT_eX]

[DOI]

CoRR, October, 2025

Robobench: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models as Embodied Brain.

[BibT_eX]

[DOI]

CoRR, October, 2025

MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning.

[BibT_eX]

[DOI]

CoRR, October, 2025

CapGeo: A Caption-Assisted Approach to Geometric Reasoning.

[BibT_eX]

[DOI]

CoRR, October, 2025

CodeRankEval: Benchmarking and Analyzing LLM Performance for Code Ranking.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., September, 2025

WoW: Towards a World omniscient World model Through Embodied Interaction.

[BibT_eX]

[DOI]

CoRR, September, 2025

Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos.

[BibT_eX]

[DOI]

CoRR, June, 2025

Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking.

[BibT_eX]

[DOI]

CoRR, May, 2025

SpikeGen: Generative Framework for Visual Spike Stream Processing.

[BibT_eX]

[DOI]

CoRR, May, 2025

LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts.

[BibT_eX]

[DOI]

CoRR, May, 2025

Concept-as-Tree: Synthetic Data is All You Need for VLM Personalization.

[BibT_eX]

[DOI]

CoRR, March, 2025

UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

MC-LLaVA: Multi-Concept Personalized Vision-Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

Can Modifying Data Address Graph Domain Adaptation?

[BibT_eX]

[DOI]

Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Split & Merge: Unlocking the Potential of Visual Adapters via Sparse Training.

[BibT_eX]

[DOI]

CoRR, 2023

MoEC: Mixture of Experts Implicit Neural Compression.

[BibT_eX]

[DOI]

CoRR, 2023

Ruichuan An

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...