Zheng Ge
Orcid: 0009-0008-4457-0781
According to our database1,
Zheng Ge authored at least 76 papers
between 2018 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments.
CoRR, April, 2026
WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics.
CoRR, March, 2026
CoRR, February, 2026
PRIME: A Process-Outcome Alignment Benchmark for Verifiable Reasoning in Mathematics and Engineering.
CoRR, February, 2026
CoRR, February, 2026
CoRR, February, 2026
CoRR, January, 2026
Reliability in Statistically Dependent Networks: Bounds, Linear Programming, and Scalability.
IEEE Trans. Commun., 2026
2025
GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning.
CoRR, December, 2025
Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction.
CoRR, November, 2025
CoRR, August, 2025
CoRR, August, 2025
Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning.
CoRR, July, 2025
CoRR, June, 2025
CoRR, June, 2025
CoRR, April, 2025
M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?
CoRR, March, 2025
Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model.
CoRR, March, 2025
CoRR, February, 2025
CoRR, January, 2025
Proceedings of the 102nd IEEE Vehicular Technology Conference, 2025
DistTrain: Addressing Model and Data Heterogeneity with Disaggregated Training for Multimodal Large Language Models.
Proceedings of the ACM SIGCOMM 2025 Conference, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the IEEE International Conference on Communications, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Reliable Connectivity Optimization in Wireless Networks with Dependent Link Failures.
Proceedings of the 59th Asilomar Conference on Signals, 2025
2024
IEEE Robotics Autom. Lett., November, 2024
IEEE Robotics Autom. Lett., July, 2024
Fourier-Transform-Based Unmixing Method for Fusion of Multiresolution Satellite Images.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024
DistTrain: Addressing Model and Data Heterogeneity with Disaggregated Training for Multimodal Large Language Models.
CoRR, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the 35th British Machine Vision Conference, 2024
2023
GMM: Delving into Gradient Aware and Model Perceive Depth Mining for Monocular 3D Detection.
CoRR, 2023
The 1st-place Solution for CVPR 2023 OpenLane Topology in Autonomous Driving Challenge.
CoRR, 2023
BEVStereo++: Accurate Depth Estimation in Multi-view 3D Object Detection via Dynamic Temporal Stereo.
CoRR, 2023
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining.
Proceedings of the International Conference on Machine Learning, 2023
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
BEVStereo: Enhancing Depth Estimation in Multi-View 3D Object Detection with Temporal Stereo.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
CoRR, 2022
BEVStereo: Enhancing Depth Estimation in Multi-view 3D Object Detection with Dynamic Temporal Stereo.
CoRR, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
2021
Neurocomputing, 2021
Neurocomputing, 2021
Workshop on Autonomous Driving at CVPR 2021: Technical Report for Streaming Perception Challenge.
CoRR, 2021
Premium Power Value-Added Service Product Decision-Making Method Based on Multi-Index Two-Sided Matching.
IEEE Access, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
CoRR, 2020
DualBox: Generating BBox Pair with Strong Correspondence via Occlusion Pattern Clustering and Proposal Refinement.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
PS-RCNN: Detecting Secondary Human Instances in a Crowd via Primary Object Suppression.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020
NMS by Representative Region: Towards Crowded Pedestrian Detection by Proposal Pairing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2018
Proceedings of the Neural Information Processing - 25th International Conference, 2018