Zhizhong Su

Orcid: 0000-0003-2312-9985

According to our database1, Zhizhong Su authored at least 42 papers between 2015 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
HoloMotion-1 Technical Report.
CoRR, May, 2026

3D-Fixer: Coarse-to-Fine In-place Completion for 3D Scenes from a Single Image.
CoRR, April, 2026

Scaling Sim-to-Real Reinforcement Learning for Robot VLAs with Generative 3D Worlds.
CoRR, March, 2026

Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning.
CoRR, February, 2026

IRIS-SLAM: Unified Geo-Instance Representations for Robust Semantic Localization and Mapping.
CoRR, February, 2026

HoloBrain-0 Technical Report.
CoRR, February, 2026

RISE: Self-Improving Robot Policy with Compositional World Model.
CoRR, February, 2026

MapDream: Task-Driven Map Learning for Vision-Language Navigation.
CoRR, February, 2026

MonoDream: Monocular Vision-Language Navigation with Panoramic Dreaming.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
RecurGS: Interactive Scene Modeling via Discrete-State Recurrent Gaussian Fusion.
CoRR, December, 2025

Motus: A Unified Latent Action World Model.
CoRR, December, 2025

Progress-Think: Semantic Progress Reasoning for Vision-Language Navigation.
CoRR, November, 2025

FSR-VLN: Fast and Slow Reasoning for Vision-Language Navigation with Hierarchical Multi-modal Scene Graph.
CoRR, September, 2025

DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation.
CoRR, September, 2025

Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images.
CoRR, August, 2025

Theoretical Analysis of Relative Errors in Gradient Computations for Adversarial Attacks with CE Loss.
CoRR, July, 2025

FineGrasp: Towards Robust Grasping for Delicate Objects.
CoRR, July, 2025

EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence.
CoRR, June, 2025

RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer.
CoRR, May, 2025

SEM: Enhancing Spatial Understanding for Robust Robot Manipulation.
CoRR, May, 2025

Aux-Think: Exploring Reasoning Strategies for Data-Efficient Vision-Language Navigation.
CoRR, May, 2025

GeoFlow-SLAM: A Robust Tightly-Coupled RGBD-Inertial Fusion SLAM for Dynamic Legged Robotics.
CoRR, March, 2025

DIPO: Dual-State Images Controlled Articulated Object Generation Powered by Diverse Data.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

GeoFlow-SLAM: A Robust Tightly-Coupled RGBD-Inertial and Legged Odometry Fusion SLAM for Dynamic Legged Robotics.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Gaussian Object Carver: Object-Compositional Gaussian Splatting with surfaces completion.
CoRR, 2024

GLS: Geometry-aware 3D Language Gaussian Splatting.
CoRR, 2024

2023
Dataset construction method of cross-lingual summarization based on filtering and text augmentation.
Dataset, March, 2023

Dataset construction method of cross-lingual summarization based on filtering and text augmentation.
PeerJ Comput. Sci., 2023

Sparse4D v3: Advancing End-to-End 3D Detection and Tracking.
CoRR, 2023

Sparse4D v2: Recurrent Temporal Fusion with Sparse Model.
CoRR, 2023

2022
Sparse4D: Multi-view 3D Object Detection with Sparse Spatial-Temporal Fusion.
CoRR, 2022

2021
HybridGazeNet: Geometric model guided Convolutional Neural Networks for gaze estimation.
CoRR, 2021

A Stance Detection Approach Based on Generalized Autoregressive pretrained Language Model in Chinese Microblogs.
Proceedings of the ICMLC 2021: 13th International Conference on Machine Learning and Computing, 2021

2020
Gaussian Vector: An Efficient Solution for Facial Landmark Detection.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
A New Parallel Detection-Recognition Approach for End-to-End Scene Text Extraction.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

VarGFaceNet: An Efficient Variable Group Convolutional Neural Network for Lightweight Face Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

2016
STAR-Net: A SpaTial Attention Residue Network for Scene Text Recognition.
Proceedings of the British Machine Vision Conference 2016, 2016

2015
Conditional Random Fields as Recurrent Neural Networks.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015


  Loading...