Yuwei Niu

According to our database¹, Yuwei Niu authored at least 20 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

From Pixels to Words - Towards Native One-Vision Models at Scale.

[BibT_eX]

[DOI]

CoRR, May, 2026

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture.

[BibT_eX]

[DOI]

CoRR, May, 2026

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents.

[BibT_eX]

[DOI]

CoRR, April, 2026

iFSQ: Improving FSQ for Image Generation with 1 Line of Code.

[BibT_eX]

[DOI]

CoRR, January, 2026

Look-Back: Implicit Visual Re-focusing in MLLM Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

OmniDPO: A Preference Optimization Framework to Address Omni-Modal Hallucination.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Memory in the Age of AI Agents.

[BibT_eX]

[DOI]

CoRR, December, 2025

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward.

[BibT_eX]

[DOI]

CoRR, November, 2025

Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback.

[BibT_eX]

[DOI]

CoRR, October, 2025

SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

GIR-Bench: Versatile Benchmark for Generating Images with Reasoning.

[BibT_eX]

[DOI]

CoRR, October, 2025

OmniDPO: A Preference Optimization Framework to Address Omni-Modal Hallucination.

[BibT_eX]

[DOI]

CoRR, September, 2025

UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation.

[BibT_eX]

[DOI]

CoRR, June, 2025

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation.

[BibT_eX]

[DOI]

CoRR, March, 2025

LanP: Rethinking the Impact of Language Priors in Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, February, 2025

Test-Time Multimodal Backdoor Detection by Contrastive Prompting.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

LangBridge: Interpreting Image as a Combination of Language Embeddings.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Tuning Vision-Language Models with Candidate Labels by Prompt Alignment.

[BibT_eX]

[DOI]

Proceedings of the Database Systems for Advanced Applications, 2025

ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

BDetCLIP: Multimodal Prompting Contrastive Test-Time Backdoor Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Yuwei Niu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...