Zhenbo Song

Orcid: 0000-0002-5020-4277

According to our database1, Zhenbo Song authored at least 51 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
2K Retrofit: Entropy-Guided Efficient Sparse Refinement for High-Resolution 3D Geometry Prediction.
CoRR, March, 2026

MemFly: On-the-Fly Memory Optimization via Information Bottleneck.
CoRR, February, 2026

RRFormer: A new transformer-based method for ultra high-definition reflection removal.
Inf. Fusion, 2026

2025
APD-Agents: A Large Language Model-Driven Multi-Agents Collaborative Framework for Automated Page Design.
CoRR, November, 2025

MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training.
CoRR, September, 2025

Efficient Active Training for Deep LiDAR Odometry.
CoRR, September, 2025

Generalizing Unsupervised Lidar Odometry Model from Normal to Snowy Weather Conditions.
CoRR, September, 2025

HieroAction: Hierarchically Guided VLM for Fine-Grained Action Analysis.
CoRR, August, 2025

SatDreamer360: Geometry Consistent Street-View Video Generation from Satellite Imagery.
CoRR, June, 2025

Toward Physically Stable Motion Generation: A New Paradigm of Human Pose Representation.
IEEE Trans. Circuits Syst. Video Technol., May, 2025

Multi-Scale Semantic-Guidance Networks: Robust Blind Face Restoration Against Adversarial Attacks.
IEEE Trans. Inf. Forensics Secur., 2025

V2DGS:Visual Voxel Map-Based 2D Gaussian Splatting for Accurate Outdoor Reconstruction.
Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

UV-GA: UV-Guided Gaussian Avatar Reconstruction from Single Image.
Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

Efficient 3D-Gaussian-Splatting-Based Path Planning for Ground Vehicles on Uneven Terrain.
Proceedings of the 28th IEEE International Conference on Intelligent Transportation Systems, 2025

MonoSC: Enhancing Monocular 3D Object Detection by 2D Segmentation and Completion.
Proceedings of the 28th IEEE International Conference on Intelligent Transportation Systems, 2025

Lightweight Yet High-Performance Defect Detector for Uav-Based Large-Scale Infrastructure Real-Time Inspection.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Gradient-Based Adversarial Attacks on Deep LiDAR Odometry.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Controllable Satellite-to-Street-View Synthesis with Precise Pose Alignment and Zero-Shot Environmental Control.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
MonoSIM: Simulating Learning Behaviors of Heterogeneous Point Cloud Object Detectors for Monocular 3-D Object Detection.
IEEE Trans. Instrum. Meas., 2024

Road Structure Inspired UGV-Satellite Cross-View Geo-Localization.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

AS-FIBA: Adaptive Selective Frequency-Injection for Backdoor Attack on Deep Face Restoration.
CoRR, 2024

AS-FIBA: Adaptive Selective Frequency-Injection for Backdoor Attack on Deep Face Restoration.
Proceedings of the 23rd IEEE International Conference on Trust, 2024

Harmonizing Stochasticity and Determinism: Scene-responsive Diverse Human Motion Prediction.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

On the Robustness of Deep Face Inpainting: An Adversarial Perspective.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

STDG: Semi-Teacher-Student Training Paradigm for Depth-guided One-stage Scene Graph Generation.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

ACR-Pose: Adversarial Canonical Representation Reconstruction Network for Category Level 6D Object Pose Estimation.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Universal Video Face Restoration Method Based on Vision-Language Model.
Proceedings of the Asian Conference on Machine Learning, 2024

2023
Deep semantic-aware remote sensing image deblurring.
Signal Process., October, 2023

Camera and LiDAR Fusion for Urban Scene Reconstruction and Novel View Synthesis via Voxel-Based Neural Radiance Fields.
Remote. Sens., September, 2023

MMCAN: Multi-Modal Cross-Attention Network for Free-Space Detection with Uncalibrated Hyperspectral Sensors.
Remote. Sens., February, 2023

STDG: Semi-Teacher-Student Training Paradigram for Depth-guided One-stage Scene Graph Generation.
CoRR, 2023

Benchmarking Ultra-High-Definition Image Reflection Removal.
CoRR, 2023

Learning Dense Flow Field for Highly-accurate Cross-view Camera Localization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

GIDP: Learning a Good Initialization and Inducing Descriptor Post-enhancing for Large-scale Place Recognition.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Incorporating Global Correlation and Local Aggregation for Efficient Visual Localization.
Proceedings of the Image and Graphics - 12th International Conference, 2023

EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Robust Single Image Reflection Removal Against Adversarial Attacks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Self-Supervised Depth Completion From Direct Visual-LiDAR Odometry in Autonomous Driving.
IEEE Trans. Intell. Transp. Syst., 2022

SHLE: Devices Tracking and Depth Filtering for Stereo-based Height Limit Estimation.
CoRR, 2022

FuRPE: Learning Full-body Reconstruction from Part Experts.
CoRR, 2022

MonoPCNS: Monocular 3D Object Detection via Point Cloud Network Simulation.
CoRR, 2022

PilotAttnNet: Multi-modal Attention Network for End-to-End Steering Control.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

RPR-Net: A Point Cloud-Based Rotation-Aware Large Scale Place Recognition Network.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation from Monocular RGB Image.
Proceedings of the Computer Vision - ECCV 2022, 2022

Visual Localization Through Virtual Views.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Attentive Rotation Invariant Convolution for Point Cloud-based Large Scale Place Recognition.
CoRR, 2021

SVT-Net: A Super Light-Weight Network for Large Scale Place Recognition using Sparse Voxel Transformers.
CoRR, 2021

Incorporating Orientations into End-to-end Driving Model for Steering Control.
CoRR, 2021

2020
End-to-end Learning for Inter-Vehicle Distance and Relative Velocity Estimation in ADAS with a Monocular Camera.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Deep Novel View Synthesis from Colored 3D Point Clouds.
Proceedings of the Computer Vision - ECCV 2020, 2020


  Loading...