Bohan Zeng

Orcid: 0009-0009-0999-6231

According to our database¹, Bohan Zeng authored at least 30 papers between 2022 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks.

[BibT_eX]

[DOI]

CoRR, October, 2025

MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning.

[BibT_eX]

[DOI]

CoRR, October, 2025

Implicit Diffusion Models for Continuous Super-Resolution.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., September, 2025

RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark.

[BibT_eX]

[DOI]

CoRR, September, 2025

Multimodal Reasoning for Science: Technical Report and 1st Place Solution to the ICML 2025 SeePhys Challenge.

[BibT_eX]

[DOI]

CoRR, September, 2025

Spatio-Temporal Energy-Guided Diffusion Model for Zero-Shot Video Synthesis and Editing.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., June, 2025

Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks.

[BibT_eX]

[DOI]

CoRR, June, 2025

Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification.

[BibT_eX]

[DOI]

CoRR, June, 2025

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios.

[BibT_eX]

[DOI]

CoRR, May, 2025

Let's Verify Math Questions Step by Step.

[BibT_eX]

[DOI]

CoRR, May, 2025

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model.

[BibT_eX]

[DOI]

CoRR, April, 2025

WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes.

[BibT_eX]

[DOI]

CoRR, March, 2025

Any2AnyTryon: Leveraging Adaptive Position Embeddings for Versatile Virtual Clothing Tasks.

[BibT_eX]

[DOI]

CoRR, January, 2025

IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Detection of Military Targets on Ground and Sea by UAVs with Low-Altitude Oblique Perspective.

[BibT_eX]

[DOI]

Remote. Sens., April, 2024

Semantic Score Distillation Sampling for Compositional Text-to-3D Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis.

[BibT_eX]

[DOI]

CoRR, 2024

EditWorld: Simulating World Dynamics for Instruction-Following Image Editing.

[BibT_eX]

[DOI]

CoRR, 2024

LaDiffGAN: Training GANs with Diffusion Supervision in Latent Spaces.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ZONE: Zero-Shot Instruction-Guided Local Editing.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

UV-IDM: Identity-Conditioned Latent Diffusion Model for Face UV-Texture Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Controllable Mind Visual Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

ZONE: Zero-Shot Instruction-Guided Local Editing.

[BibT_eX]

[DOI]

CoRR, 2023

IPDreamer: Appearance-Controllable 3D Object Generation with Image Prompts.

[BibT_eX]

[DOI]

CoRR, 2023

Face Animation with an Attribute-Guided Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Implicit Diffusion Models for Continuous Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

TerViT: An Efficient Ternary Vision Transformer.

[BibT_eX]

[DOI]

CoRR, 2022

FNeVR: Neural Volume Rendering for Face Animation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

IDa-Det: An Information Discrepancy-Aware Distillation for 1-Bit Detectors.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Bohan Zeng

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...