Yuwei Niu

According to our database1, Yuwei Niu authored at least 20 papers between 2024 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
From Pixels to Words - Towards Native One-Vision Models at Scale.
CoRR, May, 2026

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture.
CoRR, May, 2026

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents.
CoRR, April, 2026

iFSQ: Improving FSQ for Image Generation with 1 Line of Code.
CoRR, January, 2026

Look-Back: Implicit Visual Re-focusing in MLLM Reasoning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

OmniDPO: A Preference Optimization Framework to Address Omni-Modal Hallucination.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Memory in the Age of AI Agents.
CoRR, December, 2025

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward.
CoRR, November, 2025

Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback.
CoRR, October, 2025

SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models.
CoRR, October, 2025

GIR-Bench: Versatile Benchmark for Generating Images with Reasoning.
CoRR, October, 2025

OmniDPO: A Preference Optimization Framework to Address Omni-Modal Hallucination.
CoRR, September, 2025

UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation.
CoRR, June, 2025

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation.
CoRR, March, 2025

LanP: Rethinking the Impact of Language Priors in Large Vision-Language Models.
CoRR, February, 2025

Test-Time Multimodal Backdoor Detection by Contrastive Prompting.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

LangBridge: Interpreting Image as a Combination of Language Embeddings.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Tuning Vision-Language Models with Candidate Labels by Prompt Alignment.
Proceedings of the Database Systems for Advanced Applications, 2025

ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
BDetCLIP: Multimodal Prompting Contrastive Test-Time Backdoor Detection.
CoRR, 2024


  Loading...