Kai Zhao

Orcid: 0000-0002-2496-0829

Affiliations:
  • Shanghai University, School of Communication and Information Engineering, Shanghai, China
  • University of California, Department of Radiological Sciences, Los Angeles, CA (2022 - 2025)
  • Nankai University, College of Computer Science, Tianjin, China (PhD 2020)


According to our database1, Kai Zhao authored at least 47 papers between 2016 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Sub-JEPA: Subspace Gaussian Regularization for Stable End-to-End World Models.
CoRR, May, 2026

SeGPruner: Semantic-Geometric Visual Token Pruner for 3D Question Answering.
CoRR, March, 2026

MIST: A Benchmark and Baseline for Multi-Frame Infrared Small Target Detection in Complex Motion.
IEEE Trans. Image Process., 2026

PCa-Mamba: Spatiotemporal state space models for prostate cancer detection in multi-parametric MRI.
Medical Image Anal., 2026

Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language Models.
Comput. Vis. Media, 2026

AICM: An Anatomical-Prior Integrated Conditional Consistency Model for Self-Supervised MRI Through-Plane Super-Resolution.
Proceedings of the 23rd IEEE International Symposium on Biomedical Imaging, 2026

HiMaC: Histogram-Driven and Mask-Aware Consistency Models for MRI Reconstruction.
Proceedings of the 23rd IEEE International Symposium on Biomedical Imaging, 2026

Beyond Predictive Resampling: Learning Input-Agnostic Downsampling for Efficient Aligned Vision Recognition.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Vision-Motion-Reference Alignment for Referring Multi-Object Tracking via Multi-Modal Large Language Models.
CoRR, November, 2025

PoRe: Position-Reweighted Visual Token Pruning for Vision Language Models.
CoRR, August, 2025

MRI Super-Resolution With Partial Diffusion Models.
IEEE Trans. Medical Imaging, March, 2025

NExpR: Neural Explicit Representation for fast arbitrary-scale medical image super-resolution.
Comput. Biol. Medicine, 2025

MPR-Diff: A Self-Supervised Diffusion Model for Multi-Planar Reformation in Prostate Micro-Ultrasound Imaging.
Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025

Minimally User-Guided 3D Micro-Ultrasound Prostate Segmentation.
Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025

2024
Refining Uncertain Features With Self-Distillation for Face Recognition and Person Re-Identification.
IEEE Trans. Multim., 2024

Adaptive feature alignment for adversarial training.
Pattern Recognit. Lett., 2024

Dominant Shuffle: A Simple Yet Powerful Data Augmentation for Time-series Prediction.
CoRR, 2024

CSAM: A 2.5D Cross-Slice Attention Module for Anisotropic Volumetric Medical Image Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Cross-Slice Attention and Evidential Critical Loss for Uncertainty-Aware Prostate Cancer Detection.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

2023
PartDiff: Image Super-resolution with Partial Diffusion Models.
CoRR, 2023

High-Resolution 3D MRI With Deep Generative Networks via Novel Slice-Profile Transformation Super-Resolution.
IEEE Access, 2023

RPG-Palm: Realistic Pseudo-data Generation for Palmprint Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Adaptive Material Matching for Hyperspectral Imagery Destriping.
IEEE Trans. Geosci. Remote. Sens., 2022

Distribution alignment for cross-device palmprint recognition.
Pattern Recognit., 2022

Deep Hough Transform for Semantic Line Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Rethinking mask heads for partially supervised instance segmentation.
Neurocomputing, 2022

Video Polyp Segmentation: A Deep Learning Perspective.
Int. J. Autom. Comput., 2022

Video Polyp Segmentation: A Deep Learning Perspective.
CoRR, 2022

Geometric Synthesis: A Free lunch for Large-scale Palmprint Recognition Model Pretraining.
CoRR, 2022

BézierPalm: A Free Lunch for Palmprint Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

ContrastMask: Contrastive Learning to Segment Every Thing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Deep Differentiable Random Forests for Age Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Res2Net: A New Multi-Scale Backbone Architecture.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Adaptive Feature Alignment for Adversarial Training.
CoRR, 2021

Structured sparsification with joint optimization of group convolution and channel shuffle.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

2020
Dependency Aware Filter Pruning.
CoRR, 2020

Model-Agnostic Structured Sparsification with Learnable Channel Shuffle.
CoRR, 2020

Deep Hough Transform for Semantic Line Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Optimizing the F-Measure for Threshold-Free Salient Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

RegularFace: Deep Face Recognition via Exclusive Regularization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
DHNet: working double hard to learn a convolutional neural network-based local descriptor.
J. Electronic Imaging, 2018

Optimizing the F-measure for Threshold-free Salient Object Detection.
CoRR, 2018

Hi-Fi: Hierarchical Feature Integration for Skeleton Detection.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Deep Regression Forests for Age Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
DeepSkeleton: Learning Multi-Task Scale-Associated Deep Side Outputs for Object Skeleton Extraction in Natural Images.
IEEE Trans. Image Process., 2017

Label Distribution Learning Forests.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

2016
Object Skeleton Extraction in Natural Images by Fusing Scale-Associated Deep Side Outputs.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016


  Loading...