Bohan Zeng

Orcid: 0009-0009-0999-6231

According to our database1, Bohan Zeng authored at least 25 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Spatio-Temporal Energy-Guided Diffusion Model for Zero-Shot Video Synthesis and Editing.
IEEE Trans. Circuits Syst. Video Technol., June, 2025

Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models.
CoRR, June, 2025

VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks.
CoRR, June, 2025

Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification.
CoRR, June, 2025

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios.
CoRR, May, 2025

Let's Verify Math Questions Step by Step.
CoRR, May, 2025

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model.
CoRR, April, 2025

WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes.
CoRR, March, 2025

Any2AnyTryon: Leveraging Adaptive Position Embeddings for Versatile Virtual Clothing Tasks.
CoRR, January, 2025

IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Detection of Military Targets on Ground and Sea by UAVs with Low-Altitude Oblique Perspective.
Remote. Sens., April, 2024

Semantic Score Distillation Sampling for Compositional Text-to-3D Generation.
CoRR, 2024

Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis.
CoRR, 2024

EditWorld: Simulating World Dynamics for Instruction-Following Image Editing.
CoRR, 2024

LaDiffGAN: Training GANs with Diffusion Supervision in Latent Spaces.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ZONE: Zero-Shot Instruction-Guided Local Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

UV-IDM: Identity-Conditioned Latent Diffusion Model for Face UV-Texture Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Controllable Mind Visual Diffusion Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
ZONE: Zero-Shot Instruction-Guided Local Editing.
CoRR, 2023

IPDreamer: Appearance-Controllable 3D Object Generation with Image Prompts.
CoRR, 2023

Face Animation with an Attribute-Guided Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Implicit Diffusion Models for Continuous Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
TerViT: An Efficient Ternary Vision Transformer.
CoRR, 2022

FNeVR: Neural Volume Rendering for Face Animation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

IDa-Det: An Information Discrepancy-Aware Distillation for 1-Bit Detectors.
Proceedings of the Computer Vision - ECCV 2022, 2022


  Loading...