Zhenbo Song

Orcid: 0000-0002-5020-4277

According to our database1, Zhenbo Song authored at least 43 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training.
CoRR, September, 2025

Efficient Active Training for Deep LiDAR Odometry.
CoRR, September, 2025

Generalizing Unsupervised Lidar Odometry Model from Normal to Snowy Weather Conditions.
CoRR, September, 2025

HieroAction: Hierarchically Guided VLM for Fine-Grained Action Analysis.
CoRR, August, 2025

SatDreamer360: Geometry Consistent Street-View Video Generation from Satellite Imagery.
CoRR, June, 2025

Toward Physically Stable Motion Generation: A New Paradigm of Human Pose Representation.
IEEE Trans. Circuits Syst. Video Technol., May, 2025

Multi-Scale Semantic-Guidance Networks: Robust Blind Face Restoration Against Adversarial Attacks.
IEEE Trans. Inf. Forensics Secur., 2025

Lightweight Yet High-Performance Defect Detector for Uav-Based Large-Scale Infrastructure Real-Time Inspection.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Gradient-Based Adversarial Attacks on Deep LiDAR Odometry.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Controllable Satellite-to-Street-View Synthesis with Precise Pose Alignment and Zero-Shot Environmental Control.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
MonoSIM: Simulating Learning Behaviors of Heterogeneous Point Cloud Object Detectors for Monocular 3-D Object Detection.
IEEE Trans. Instrum. Meas., 2024

Road Structure Inspired UGV-Satellite Cross-View Geo-Localization.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

AS-FIBA: Adaptive Selective Frequency-Injection for Backdoor Attack on Deep Face Restoration.
CoRR, 2024

AS-FIBA: Adaptive Selective Frequency-Injection for Backdoor Attack on Deep Face Restoration.
Proceedings of the 23rd IEEE International Conference on Trust, 2024

Harmonizing Stochasticity and Determinism: Scene-responsive Diverse Human Motion Prediction.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

On the Robustness of Deep Face Inpainting: An Adversarial Perspective.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

STDG: Semi-Teacher-Student Training Paradigm for Depth-guided One-stage Scene Graph Generation.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

ACR-Pose: Adversarial Canonical Representation Reconstruction Network for Category Level 6D Object Pose Estimation.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Universal Video Face Restoration Method Based on Vision-Language Model.
Proceedings of the Asian Conference on Machine Learning, 2024

2023
Deep semantic-aware remote sensing image deblurring.
Signal Process., October, 2023

Camera and LiDAR Fusion for Urban Scene Reconstruction and Novel View Synthesis via Voxel-Based Neural Radiance Fields.
Remote. Sens., September, 2023

MMCAN: Multi-Modal Cross-Attention Network for Free-Space Detection with Uncalibrated Hyperspectral Sensors.
Remote. Sens., February, 2023

STDG: Semi-Teacher-Student Training Paradigram for Depth-guided One-stage Scene Graph Generation.
CoRR, 2023

Benchmarking Ultra-High-Definition Image Reflection Removal.
CoRR, 2023

Learning Dense Flow Field for Highly-accurate Cross-view Camera Localization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

GIDP: Learning a Good Initialization and Inducing Descriptor Post-enhancing for Large-scale Place Recognition.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Incorporating Global Correlation and Local Aggregation for Efficient Visual Localization.
Proceedings of the Image and Graphics - 12th International Conference, 2023

EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Robust Single Image Reflection Removal Against Adversarial Attacks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Self-Supervised Depth Completion From Direct Visual-LiDAR Odometry in Autonomous Driving.
IEEE Trans. Intell. Transp. Syst., 2022

SHLE: Devices Tracking and Depth Filtering for Stereo-based Height Limit Estimation.
CoRR, 2022

FuRPE: Learning Full-body Reconstruction from Part Experts.
CoRR, 2022

MonoPCNS: Monocular 3D Object Detection via Point Cloud Network Simulation.
CoRR, 2022

PilotAttnNet: Multi-modal Attention Network for End-to-End Steering Control.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

RPR-Net: A Point Cloud-Based Rotation-Aware Large Scale Place Recognition Network.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation from Monocular RGB Image.
Proceedings of the Computer Vision - ECCV 2022, 2022

Visual Localization Through Virtual Views.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Attentive Rotation Invariant Convolution for Point Cloud-based Large Scale Place Recognition.
CoRR, 2021

SVT-Net: A Super Light-Weight Network for Large Scale Place Recognition using Sparse Voxel Transformers.
CoRR, 2021

Incorporating Orientations into End-to-end Driving Model for Steering Control.
CoRR, 2021

2020
End-to-end Learning for Inter-Vehicle Distance and Relative Velocity Estimation in ADAS with a Monocular Camera.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Deep Novel View Synthesis from Colored 3D Point Clouds.
Proceedings of the Computer Vision - ECCV 2020, 2020


  Loading...