Xiyang Wu

According to our database1, Xiyang Wu authored at least 23 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks.
CoRR, April, 2026

Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills.
CoRR, April, 2026

SABER: A Stealthy Agentic Black-Box Attack Framework for Vision-Language-Action Models.
CoRR, March, 2026

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data.
CoRR, March, 2026

2025
MASS: Motion-Aware Spatial-Temporal Grounding for Physics Reasoning and Comprehension in Vision-Language Models.
CoRR, November, 2025

First Frame Is the Place to Go for Video Content Customization.
CoRR, November, 2025

CaughtCheating: Is Your MLLM a Good Cheating Detective? Exploring the Boundary of Visual Perception and Reasoning.
CoRR, July, 2025

Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation.
CoRR, June, 2025

Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey.
CoRR, January, 2025

VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

On the Vulnerability of LLM/VLM-Controlled Robotics.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

A Survey of State of the Art Large Vision Language Models: Benchmark Evaluations and Challenges.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

2024
SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining.
CoRR, 2024

On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities.
CoRR, 2024

LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

AGL-Net: Aerial-Ground Cross-Modal Global Localization with Varying Scales.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Hallusionbench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
A novel image registration-based dynamic photometric stereo method for online defect detection in aluminum alloy castings.
Digit. Signal Process., September, 2023

LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments.
CoRR, 2023

iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning.
CoRR, 2023

Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning.
Proceedings of the Conference on Robot Learning, 2023

2020
FireCommander: An Interactive, Probabilistic Multi-agent Environment for Joint Perception-Action Tasks.
CoRR, 2020


  Loading...