Xinzhu Ma

Orcid: 0000-0003-0504-0186

According to our database1, Xinzhu Ma authored at least 44 papers between 2016 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Intern-S1: A Scientific Multimodal Foundation Model.
CoRR, August, 2025

Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion.
CoRR, August, 2025

Fitness aligned structural modeling enables scalable virtual screening with AuroBind.
CoRR, August, 2025

PhysUniBench: An Undergraduate-Level Physics Reasoning Benchmark for Multimodal Models.
CoRR, June, 2025

SP-VLA: A Joint Model Scheduling and Token Pruning Approach for VLA Model Acceleration.
CoRR, June, 2025

CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming.
CoRR, May, 2025

Point2Primitive: CAD Reconstruction from Point Cloud by Direct Primitive Prediction.
CoRR, May, 2025

CMT: A Cascade MAR with Topology Predictor for Multimodal Conditional CAD Generation.
CoRR, April, 2025

GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2025

Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion.
IEEE Trans. Image Process., 2025

3DAxisPrompt: Promoting the 3D grounding and reasoning in GPT-4o.
Neurocomputing, 2025

Point2skh: End-to-end Parametric Primitive Inference from Point Clouds with Improved Denoising Transformer.
Comput. Aided Des., 2025

Revisiting Convolution Architecture in the Realm of DNA Foundation Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Push-and-Pull: A General Training Framework With Differential Augmentor for Domain Generalized Point Cloud Classification.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

3D Object Detection From Images for Autonomous Driving: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Learning Pixel-Wise Continuous Depth Representation via Clustering for Depth Completion.
IEEE Trans. Circuits Syst. Video Technol., 2024

COMET: Benchmark for Comprehensive Biological Multi-omics Evaluation Tasks and Language Models.
CoRR, 2024

EMS: Adaptive Evict-then-Merge Strategy for Head-wise KV Cache Compression Based on Global-Local Importance.
CoRR, 2024

PRANCE: Joint Token-Optimization and Structural Channel-Pruning for Adaptive ViT Inference.
CoRR, 2024

Evaluating the Generalization Ability of Quantized LLMs: Benchmark, Analysis, and Toolbox.
CoRR, 2024

TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models.
CoRR, 2024

BEACON: Benchmark for Comprehensive RNA Tasks and Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

ProSST: Protein Language Modeling with Quantized Structure and Disentangled Attention.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Retraining-free Model Quantization via One-Shot Weight-Coupling Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Rethinking the BERT-like Pretraining for DNA Sequences.
CoRR, 2023

Better Teacher Better Student: Dynamic Prior Knowledge for Knowledge Distillation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Towards Fair and Comprehensive Comparisons for Image-Based 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
An Empirical Study of Pseudo-Labeling for Image-based 3D Object Detection.
CoRR, 2022

Better Teacher Better Student: Dynamic Prior Knowledge for Knowledge Distillation.
CoRR, 2022

MonoDistill: Learning Spatial Features for Monocular 3D Object Detection.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Scale-Prior Deformable Convolution for Exemplar-Guided Class-Agnostic Counting.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Learning Geometry-Guided Depth via Projective Modeling for Monocular 3D Object Detection.
CoRR, 2021

Geometry Uncertainty Projection Network for Monocular 3D Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Delving Into Localization Errors for Monocular 3D Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Rethinking Pseudo-LiDAR Representation.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Learning to Segment Unseen Category Objects using Gradient Gaussian Attention.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Self-Adaption Multi-classifier Fusion Networks for Image Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Accurate Monocular 3D Object Detection via Color-Embedded 3D Reconstruction for Autonomous Driving.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Disparity-Based Robust Unstructured Terrain Segmentation.
Proceedings of the Pattern Recognition and Computer Vision - First Chinese Conference, 2018

User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2016
An Efficient Protocol With Bidirectional Verification for Storage Security in Cloud Computing.
IEEE Access, 2016


  Loading...