Kai Han

Orcid: 0000-0002-9761-2702

Affiliations:

Huawei Technologies, Noah's Ark Lab
Peking University, MOE Key Laboratory of Machine Perception / Cooperative Medianet Innovation Center, Beijing, China (former)

According to our database¹, Kai Han authored at least 131 papers between 2017 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

Near-Policy: Accelerating On-Policy Distillation via Asynchronous Generation and Selective Packing.

[BibT_eX]

[DOI]

CoRR, May, 2026

Beyond Masks: Efficient, Flexible Diffusion Language Models via Deletion-Insertion Processes.

[BibT_eX]

[DOI]

CoRR, March, 2026

Mask Is What DLLM Needs: A Masked Data Training Paradigm for Diffusion LLMs.

[BibT_eX]

[DOI]

CoRR, March, 2026

DLLM Agent: See Farther, Run Faster.

[BibT_eX]

[DOI]

CoRR, February, 2026

Multimodal Latent Reasoning via Hierarchical Visual Cues Injection.

[BibT_eX]

[DOI]

CoRR, February, 2026

An Empirical Study of World Model Quantization.

[BibT_eX]

[DOI]

CoRR, February, 2026

Toward Effective Knowledge Distillation: Navigating Beyond Small-Data Pitfall.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., January, 2026

Top 10 Open Challenges Steering the Future of Diffusion Language Model and Its Variants.

[BibT_eX]

[DOI]

CoRR, January, 2026

Diffusion In Diffusion: Reclaiming Global Coherence in Semi-Autoregressive Diffusion.

[BibT_eX]

[DOI]

CoRR, January, 2026

MATCH: Modulating Attention via In-Context Retrieval for Long-Context Transformers.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

PocketLLM: Ultimate Compression of Large Language Models via Meta Networks.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

VersatileFFN: Achieving Parameter Efficiency in LLMs via Adaptive Wide-and-Deep Reuse.

[BibT_eX]

[DOI]

CoRR, December, 2025

From Next-Token to Next-Block: A Principled Adaptation Path for Diffusion LLMs.

[BibT_eX]

[DOI]

CoRR, December, 2025

Nexus: Higher-Order Attention Mechanisms in Transformers.

[BibT_eX]

[DOI]

CoRR, December, 2025

ROOT: Robust Orthogonalized Optimizer for Neural Network Training.

[BibT_eX]

[DOI]

CoRR, November, 2025

Positional Preservation Embedding for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

ScaleNet: Scaling up Pretrained Neural Networks with Incremental Parameters.

[BibT_eX]

[DOI]

CoRR, October, 2025

Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, September, 2025

OmniEval: A Benchmark for Evaluating Omni-modal Models with Visual, Auditory, and Textual Inputs.

[BibT_eX]

[DOI]

CoRR, June, 2025

EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization.

[BibT_eX]

[DOI]

CoRR, June, 2025

Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition.

[BibT_eX]

[DOI]

CoRR, May, 2025

Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity.

[BibT_eX]

[DOI]

CoRR, May, 2025

A Physics-guided Multimodal Transformer Path to Weather and Climate Sciences.

[BibT_eX]

[DOI]

CoRR, April, 2025

Transferable text data distillation by trajectory matching.

[BibT_eX]

[DOI]

CoRR, April, 2025

Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping.

[BibT_eX]

[DOI]

CoRR, March, 2025

Rethinking Feature Reconstruction via Category Prototype in Semantic Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2025

ScaleNet: Scaling up Pretrained Neural Networks With Incremental Parameters.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2025

GPT4Image: Large Pre-trained Models Help Vision Models Learn Better on Perception Task.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025

EMS-SD: Efficient Multi-sample Speculative Decoding for Accelerating Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

DenseSSM: State Space Models with Dense Hidden Connection for Efficient Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Mixture of Lookup Experts.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

LLM Data Selection and Utilization via Dynamic Bi-level Optimization.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

L-Man: A Large Multi-modal Model Unifying Human-centric Tasks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Eve: Efficient Multimodal Vision Language Models with Elastic Visual Experts.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Local Means Binary Networks for Image Super-Resolution.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., May, 2024

Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting.

[BibT_eX]

[DOI]

CoRR, 2024

GhostNetV3: Exploring the Training Strategies for Compact Models.

[BibT_eX]

[DOI]

CoRR, 2024

DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution.

[BibT_eX]

[DOI]

CoRR, 2024

A Survey on Transformer Compression.

[BibT_eX]

[DOI]

CoRR, 2024

Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2024

An Empirical Study of Scaling Law for OCR.

[BibT_eX]

[DOI]

CoRR, 2024

Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Kangaroo: Lossless Self-Speculative Decoding for Accelerating LLMs via Double Early Exiting.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

MemoryFormer : Minimize Transformer Computation by Removing Fully-Connected Layers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Rethinking Optimization and Architecture for Tiny Language Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Data-efficient Large Vision Models through Sequential Autoregression.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

A Robust Audio Deepfake Detection System via Multi-View Feature.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Adapt Without Forgetting: Distill Proximity from Dual Teachers in Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Token Compensator: Altering Inference Cost of Vision Transformer Without Re-tuning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

An Empirical Study of Scaling Law for Scene Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ParameterNet: Parameters are All You Need for Large-Scale Visual Pretraining of Mobile Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Complementary Sparsity: Accelerating Sparse CNNs with High Accuracy on General-Purpose Computing Platforms.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

A Survey on Vision Transformer.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2023

PanGu-π: Enhancing Language Model Architectures via Nonlinearity Compensation.

[BibT_eX]

[DOI]

CoRR, 2023

LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Category Feature Transformer for Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

GPT4Image: Can Large Pre-trained Models Help Vision Models on Perception Tasks?

[BibT_eX]

[DOI]

CoRR, 2023

VanillaKD: Revisit the Power of Vanilla Knowledge Distillation from Small Scale to Large Scale.

[BibT_eX]

[DOI]

CoRR, 2023

Gold-YOLO: Efficient Object Detector via Gather-and-Distribute Mechanism.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Species196: A One-Million Semi-supervised Dataset for Fine-grained Species Recognition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

One-for-All: Bridge the Gap Between Heterogeneous Architectures in Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Revisit the Power of Vanilla Knowledge Distillation: from Small Scale to Large Scale.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

GhostRNN: Reducing State Redundancy in RNN with Cheap Operations.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Boosting Semantic Segmentation from the Perspective of Explicit Class Embeddings.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Masked Image Modeling with Local Multi-Scale Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Network Expansion For Practical Training Acceleration.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

GhostSR: Learning Ghost Features for Efficient Image Super-Resolution.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2022

Learning Versatile Convolution Filters for Efficient Visual Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

GhostNets on Heterogeneous Devices via Cheap Operations.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2022

FastMIM: Expediting Masked Image Modeling Pre-training for Vision.

[BibT_eX]

[DOI]

CoRR, 2022

PyramidTNT: Improved Transformer-in-Transformer Baselines with Pyramid Architecture.

[BibT_eX]

[DOI]

CoRR, 2022

GhostNetV2: Enhance Cheap Operation with Long-Range Attention.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Accelerating Sparse Convolution with Column Vector-Wise Sparsity.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Redistribution of Weights and Activations for AdderNet Quantization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Transformer-Based Object Detector with Coarse-Fine Crossing Representations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Vision GNN: An Image is Worth Graph of Nodes.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

An Image Patch is a Wave: Phase-Aware Vision MLP.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Patch Slimming for Efficient Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Network Amplification with Efficient MACs Allocation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Hire-MLP: Vision MLP via Hierarchical Rearrangement.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CMT: Convolutional Neural Networks Meet Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Instance-Aware Dynamic Neural Network Quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Greedy Network Enlarging.

[BibT_eX]

[DOI]

CoRR, 2021

CMT: Convolutional Neural Networks Meet Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

Efficient Vision Transformers via Fine-Grained Manifold Distillation.

[BibT_eX]

[DOI]

CoRR, 2021

Post-Training Quantization for Vision Transformer.

[BibT_eX]

[DOI]

CoRR, 2021

Visual Transformer Pruning.

[BibT_eX]

[DOI]

CoRR, 2021

AdderNet and its Minimalist Hardware Design for Energy-Efficient Artificial Intelligence.

[BibT_eX]

[DOI]

CoRR, 2021

GhostSR: Learning Ghost Features for Efficient Image Super-Resolution.

[BibT_eX]

[DOI]

CoRR, 2021

Mining Neighbor Frames for Person Re-identification by Global Optimal Tracking.

[BibT_eX]

[DOI]

Proceedings of the Advances in Swarm Intelligence - 12th International Conference, 2021

Dynamic Resolution Network.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Frequency Domain Approximation for Binary Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Augmented Shortcuts for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Post-Training Quantization for Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Transformer in Transformer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

ReNAS: Relativistic Evaluation of Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Positive-Unlabeled Data Purification in the Wild for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Distilling Object Detectors via Decoupled Features.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Multi-bit Adaptive Distillation for Binary Neural Networks.

[BibT_eX]

[DOI]

Ying Nie

Kai Han

Yunhe Wang

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

A Survey on Visual Transformer.

[BibT_eX]

[DOI]

CoRR, 2020

Dynamic Feature Pyramid Networks for Object Detection.

[BibT_eX]

[DOI]

CoRR, 2020

VEGA: Towards an End-to-End Configurable AutoML Pipeline.

[BibT_eX]

[DOI]

CoRR, 2020

Widening and Squeezing: Towards Accurate and Efficient QNNs.

[BibT_eX]

[DOI]

CoRR, 2020

Searching for Low-Bit Weights in Quantized Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Model Rubik's Cube: Twisting Resolution, Depth and Width for TinyNets.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Training Binary Neural Networks through Learning with Noisy Supervision.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Balanced Binary Neural Networks with Gated Residual.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

GhostNet: More Features From Cheap Operations.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Hit-Detector: Hierarchical Trinity Architecture Search for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

RNAS: Architecture Ranking for Powerful Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Balanced Binary Neural Networks with Gated Residual.

[BibT_eX]

[DOI]

CoRR, 2019

Full-Stack Filters to Build Minimum Viable CNNs.

[BibT_eX]

[DOI]

CoRR, 2019

Positive-Unlabeled Compression on the Cloud.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Learning Instance-wise Sparsity for Accelerating Deep Models.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Attribute Aware Pooling for Pedestrian Attribute Recognition.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Searching for Accurate Binary Neural Architectures.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Co-Evolutionary Compression for Unpaired Image Translation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Low-resolution Visual Recognition via Deep Feature Distillation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Greedy Hash: Towards Fast Optimization for Accurate Hash Coding in CNN.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Attribute-Aware Attention Model for Fine-grained Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Autoencoder Inspired Unsupervised Feature Selection.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Autoencoder Feature Selector.

[BibT_eX]

[DOI]

Kai Han

Chao Li

Xin Shi

CoRR, 2017

Kai Han

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...