Yiran Qin

Orcid: 0009-0008-4561-0685

According to our database1, Yiran Qin authored at least 36 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
ComSim: Building Scalable Real-World Robot Data Generation via Compositional Simulation.
CoRR, April, 2026

Select-then-Solve: Paradigm Routing as Inference-Time Optimization for LLM Agents.
CoRR, April, 2026

Referring-Aware Visuomotor Policy Learning for Closed-Loop Manipulation.
CoRR, April, 2026

CoEnv: Driving Embodied Multi-Agent Collaboration via Compositional Environment.
CoRR, April, 2026

Ego to World: Collaborative Spatial Reasoning in Embodied Systems via Reinforcement Learning.
CoRR, March, 2026

Reading ≠ Seeing: Diagnosing and Closing the Typography Gap in Vision-Language Models.
CoRR, March, 2026

State Rank Dynamics in Linear Attention LLMs.
CoRR, February, 2026

TouchGuide: Inference-Time Steering of Visuomotor Policies via Touch Guidance.
CoRR, January, 2026

Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge.
CoRR, January, 2026

Toward Efficient Agents: Memory, Tool learning, and Planning.
CoRR, January, 2026

2025
LiveSearchBench: An Automatically Constructed Benchmark for Retrieval and Reasoning over Dynamic Knowledge.
CoRR, November, 2025

GauDP: Reinventing Multi-Agent Collaboration through Gaussian-Image Synergy in Diffusion Policies.
CoRR, November, 2025

BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities.
CoRR, October, 2025

CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion.
CoRR, June, 2025

Boosting 3D Object Detection via Self-Distilling Introspective Data.
IEEE Trans. Intell. Transp. Syst., May, 2025

Towards Robust Evaluation of STEM Education: Leveraging MLLMs in Project-Based Learning.
CoRR, May, 2025

A Survey of Interactive Generative Video.
CoRR, April, 2025

Position: Interactive Generative Video as Next-Generation Game Engine.
CoRR, March, 2025

GameFactory: Creating New Games with Generative Interactive Videos.
CoRR, January, 2025

Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval.
Proceedings of the SIGGRAPH Asia 2025 Conference Papers, 2025

VIKI‑R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Chain-of-Imagination for Reliable Instruction Following in Decision Making.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

WorldSimBench: Towards Video Generation Models as World Simulators.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

High-Dynamic Radar Sequence Prediction for Weather Nowcasting Using Spatiotemporal Coherent Gaussian Representation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

GameFactorly: Creating New Games with Generative Interactive Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

ReSo: A Reward-driven Self-organizing LLM-based Multi-Agent System for Reasoning Tasks.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
WorldSimBench: Towards Video Generation Models as World Simulators.
CoRR, 2024

Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models.
CoRR, 2024

MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control.
CoRR, 2024

Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Impact of monetary and non-monetary social functions on users' knowledge-sharing intentions in online social Q&A communities.
Internet Res., 2023

SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023


  Loading...