Qi Zhu

Orcid: 0000-0002-1545-1854

Affiliations:
  • University of Science and Technology of China, Hefei, China


According to our database1, Qi Zhu authored at least 28 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Claw AI Lab: An Autonomous Multi-Agent Research Team.
CoRR, May, 2026

4DVGGT-D: 4D Visual Geometry Transformer with Improved Dynamic Depth Estimation.
CoRR, May, 2026

The First Challenge on Mobile Real-World Image Super-Resolution at NTIRE 2026: Benchmark Results and Method Overview.
CoRR, April, 2026

StreamCacheVGGT: Streaming Visual Geometry Transformers with Robust Scoring and Hybrid Cache Compression.
CoRR, April, 2026

Robust 4D Visual Geometry Transformer with Uncertainty-Aware Priors.
CoRR, April, 2026

HD-VGGT: High-Resolution Visual Geometry Transformer.
CoRR, March, 2026

PulseMind: A Multi-Modal Medical Model for Real-World Clinical Diagnosis.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
WaterWave: Bridging Underwater Image Enhancement into Video Streams via Wavelet-based Temporal Consistency Field.
CoRR, December, 2025

SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation.
CoRR, November, 2025

Breaking the Box: Enhancing Remote Sensing Image Segmentation with Freehand Sketches.
CoRR, March, 2025

ADMIRE: ADaptive method to enhance Multiple Image REsolutions in text-rich multi-image understanding.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

FourierMamba: Fourier Learning Integration with State Space Models for Image Deraining.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Learning Spatio-Temporal Sharpness Map for Video Deblurring.
IEEE Trans. Circuits Syst. Video Technol., May, 2024

Prototype Clustered Diffusion Models for Versatile Inverse Problems.
CoRR, 2024

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report.
CoRR, 2024

Empowering Resampling Operation for Ultra-High-Definition Image Enhancement with Model-Aware Guidance.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
FouriDown: Factoring Down-Sampling into Shuffling and Superposing.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Non-Uniform-Sampling for Ultra-High-Definition Image Enhancement.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Exploring Temporal Frequency Spectrum in Deep Video Deblurring.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Frequency-consistent Optimization for Image Enhancement Networks.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

Learning Semantic Degradation-Aware Guidance for Recognition-Driven Unsupervised Low-Light Image Enhancement.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Enhancement by Your Aesthetic: An Intelligible Unsupervised Personalized Enhancer for Low-Light Images.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Source-Free Domain Adaptation for Real-World Image Dehazing.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Dast-Net: Depth-Aware Spatio-Temporal Network for Video Deblurring.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022



  Loading...