Jiabing Yang

Orcid: 0009-0009-9406-5545

According to our database1, Jiabing Yang authored at least 16 papers between 2025 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Multi-View Video Diffusion Policy: A 3D Spatio-Temporal-Aware Video Action Model.
CoRR, April, 2026

UAOR: Uncertainty-aware Observation Reinjection for Vision-Language-Action Models.
CoRR, February, 2026

Beyond Closed-Pool Video Retrieval: A Benchmark and Agent Framework for Real-World Video Search and Moment Localization.
CoRR, February, 2026

PaperX: A Unified Framework for Multimodal Academic Presentation Generation with Scholar DAG.
CoRR, February, 2026

BridgeV2W: Bridging Video Generation Models to Embodied World Models via Embodiment Masks.
CoRR, February, 2026

ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search.
CoRR, January, 2026

ToolWeaver: Weaving Collaborative Semantics for Scalable Tool Use in Large Language Models.
CoRR, January, 2026

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions.
Trans. Mach. Learn. Res., 2026

Privacy preserving person re-identification via anonymizing diffusion model.
Pattern Recognit., 2026

2025
AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding and Efficiency in Audio LLMs.
CoRR, October, 2025

EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation.
CoRR, September, 2025

Prune2Drive: A Plug-and-Play Framework for Accelerating Vision-Language Models in Autonomous Driving.
CoRR, August, 2025

DTPA: Dynamic Token-level Prefix Augmentation for Controllable Text Generation.
CoRR, August, 2025

IKOD: Mitigating Visual Attention Degradation in Large Vision-Language Models.
CoRR, August, 2025

EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow.
CoRR, July, 2025

EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025


  Loading...