Bohan Li

Orcid: 0000-0002-6959-7517

Affiliations:
  • Shanghai Jiao Tong University, Shanghai, China


According to our database1, Bohan Li authored at least 30 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Hierarchical Context Alignment With Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2026

From Articulated Kinematics to Routed Visual Control for Action-Conditioned Surgical Video Generation.
CoRR, May, 2026

PAM: A Pose-Appearance-Motion Engine for Sim-to-Real HOI Video Generation.
CoRR, March, 2026

MedVAR: Towards Scalable and Efficient Medical Image Generation via Next-scale Autoregressive Prediction.
CoRR, February, 2026

GRADRobot: Geometry-Aware Rendering with Articulation and Diffusion for Robot Modeling.
Proceedings of the International Conference on 3D Visio, 2026

2025
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation.
CoRR, December, 2025

Light-X: Generative 4D Video Rendering with Camera and Illumination Control.
CoRR, December, 2025

Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method.
CoRR, October, 2025

OmniNWM: Omniscient Driving Navigation World Models.
CoRR, October, 2025

Stability Under Scrutiny: Benchmarking Representation Paradigms for Online HD Mapping.
CoRR, October, 2025

One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation.
CoRR, September, 2025

MapKD: Unlocking Prior Knowledge with Cross-Modal Distillation for Efficient Online HD Map Construction.
CoRR, August, 2025

Light of Normals: Unified Feature Representation for Universal Photometric Stereo.
CoRR, June, 2025

Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting.
CoRR, June, 2025

ORV: 4D Occupancy-centric Robot Video Generation.
CoRR, June, 2025

Challenger: Affordable Adversarial Driving Video Generation.
CoRR, May, 2025

Hybrid-Grained Feature Aggregation with Coarse-to-Fine Language Guidance for Self-Supervised Monocular Depth Estimation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

UniScene: Unified Occupancy-centric Driving Scene Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation.
CoRR, 2024

Bridging Stereo Geometry and BEV Representation with Reliable Mutual Interaction for Semantic Scene Completion.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Hierarchical Temporal Context Learning for Camera-Based Semantic Scene Completion.
Proceedings of the Computer Vision - ECCV 2024, 2024

Closed-Loop Unsupervised Representation Disentanglement with β-VAE Distillation and Diffusion Probabilistic Feedback.
Proceedings of the Computer Vision - ECCV 2024, 2024

One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene Perception.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
One at A Time: Multi-step Volumetric Probability Distribution Diffusion for Depth Estimation.
CoRR, 2023

EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model.
CoRR, 2023

StereoScene: BEV-Assisted Stereo Matching Empowers 3D Semantic Scene Completion.
CoRR, 2023

NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Robust Scale-Aware Stereo Matching Network.
IEEE Trans. Artif. Intell., 2022

Improved stereo matching framework with embedded multilevel attention.
J. Electronic Imaging, 2022


  Loading...