Yueze Wang

According to our database1, Yueze Wang authored at least 22 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
OmniGen2: Exploration to Advanced Multimodal Generation.
CoRR, June, 2025

Fine-Grained Visual Text Prompting.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2025

MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos.
CoRR, February, 2025

EVEv2: Improved Baselines for Encoder-Free Vision-Language Models.
CoRR, February, 2025

OmniGen: Unified Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Emu3: Next-Token Prediction is All You Need.
CoRR, 2024

OmniGen: Unified Image Generation.
CoRR, 2024

Seeing Clearly, Answering Incorrectly: A Multimodal Robustness Benchmark for Evaluating MLLMs on Leading Questions.
CoRR, 2024

Efficient Multimodal Learning from Data-centric Perspective.
CoRR, 2024

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Unveiling Encoder-Free Vision-Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Universal Prompt Optimizer for Safe Text-to-Image Generation.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Emu: Generative Pretraining in Multimodality.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Generative Multimodal Models are In-Context Learners.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
DSMENet: Detail and Structure Mutually Enhancing Network for under-sampled MRI reconstruction.
Comput. Biol. Medicine, March, 2023

Generative Multimodal Models are In-Context Learners.
CoRR, 2023

Generative Pretraining in Multimodality.
CoRR, 2023

Fine-Grained Visual Prompting.
CoRR, 2023

Fine-Grained Visual Prompting.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
HIWDNet: A hybrid image-wavelet domain network for fast magnetic resonance image reconstruction.
Comput. Biol. Medicine, 2022


  Loading...