We stand with Ukraine

We stand with Ukraine

Yueze Wang

According to our database¹, Yueze Wang authored at least 24 papers between 2022 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Emu3.5: Native Multimodal Models are World Learners.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

MR<sup>2</sup>-Bench: Going Beyond Matching to Reasoning in Multimodal Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

OmniGen2: Exploration to Advanced Multimodal Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

Fine-Grained Visual Text Prompting.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., March, 2025

MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos.

[BibT_eX]

[DOI]

,

,

,

,

Zhengyang Liang

,

,

,

,

CoRR, February, 2025

EVEv2: Improved Baselines for Encoder-Free Vision-Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, February, 2025

OmniGen: Unified Image Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly.

[BibT_eX]

[DOI]

,

Zhengyang Liang

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Chen Jason Zhang

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Emu3: Next-Token Prediction is All You Need.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

OmniGen: Unified Image Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Seeing Clearly, Answering Incorrectly: A Multimodal Robustness Benchmark for Evaluating MLLMs on Leading Questions.

[BibT_eX]

[DOI]

,

Zhengyang Liang

,

,

,

,

CoRR, 2024

Efficient Multimodal Learning from Data-centric Perspective.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Unveiling Encoder-Free Vision-Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Universal Prompt Optimizer for Safe Text-to-Image Generation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Emu: Generative Pretraining in Multimodality.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Generative Multimodal Models are In-Context Learners.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

DSMENet: Detail and Structure Mutually Enhancing Network for under-sampled MRI reconstruction.

[BibT_eX]

[DOI]

,

,

Comput. Biol. Medicine, March, 2023

Generative Multimodal Models are In-Context Learners.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

Generative Pretraining in Multimodality.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2023

Fine-Grained Visual Prompting.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Fine-Grained Visual Prompting.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

HIWDNet: A hybrid image-wavelet domain network for fast magnetic resonance image reconstruction.

[BibT_eX]

[DOI]

,

,

Comput. Biol. Medicine, 2022

Loading...