Yifu Ding

Orcid: 0000-0002-3612-8757

Affiliations:
  • Beihang University, State Key Laboratory of Complex & Critical Software Environment, Beijing, China


According to our database1, Yifu Ding authored at least 36 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
BWTA: Accurate and Efficient Binarized Transformer by Algorithm-Hardware Co-design.
CoRR, April, 2026

Diagonal-Tiled Mixed-Precision Attention for Efficient Low-Bit MXFP Inference.
CoRR, April, 2026

CMedBench: A Comprehensive Benchmark for Efficient Medical Large Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping.
CoRR, November, 2025

QVGen: Pushing the Limit of Quantized Video Generative Models.
CoRR, May, 2025

Dynamic Parallel Tree Search for Efficient LLM Reasoning.
CoRR, February, 2025

A survey of low-bit large language models: Basics, systems, and algorithms.
Neural Networks, 2025

DA-KD: Difficulty-Aware Knowledge Distillation for Efficient Large Language Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

ECLR'25: 2nd Workshop on Efficient Computing Under Limited Resources: Visual Computing.
Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

Low-Bit FlashAttention Accelerated Operator Design Based on Triton.
Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

Dynamic Parallel Tree Search for Efficient LLM Reasoning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance.
IEEE Trans. Neural Networks Learn. Syst., August, 2024

A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms.
CoRR, 2024

PTQ4SAM: Post-Training Quantization for Segment Anything.
CoRR, 2024

LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Compressing Large Language Models by Joint Sparsification and Quantization.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

PTQ4SAM: Post-Training Quantization for Segment Anything.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Reg-PTQ: Regression-specialized Post-training Quantization for Fully Quantized Object Detector.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DB-LLM: Accurate Dual-Binarization for Efficient LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Diverse Sample Generation: Pushing the Limit of Generative Data-Free Quantization.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Spatio-Temporal Adaptive Network With Bidirectional Temporal Difference for Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Distribution-Sensitive Information Retention for Accurate Binary Neural Network.
Int. J. Comput. Vis., 2023

OHQ: On-chip Hardware-aware Quantization.
CoRR, 2023

QuantSR: Accurate Low-bit Quantization for Efficient Image Super-Resolution.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

BiBench: Benchmarking and Analyzing Network Binarization.
Proceedings of the International Conference on Machine Learning, 2023

2022
Towards Accurate Post-Training Quantization for Vision Transformer.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

BiFSMN: Binary Neural Network for Keyword Spotting.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

BiBERT: Accurate Fully Binarized BERT.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Exploring Endogenous Shift for Cross-domain Detection: A Large-scale Benchmark and Perturbation Suppression Network.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

An Empirical study of Data-Free Quantization's Tuning Robustness.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Diverse Sample Generation: Pushing the Limit of Data-free Quantization.
CoRR, 2021

Over-sampling De-occlusion Attention Network for Prohibited Items Detection in Noisy X-ray Images.
CoRR, 2021

Improving Generalization of Deepfake Detection with Domain Adaptive Batch Normalization.
Proceedings of the ADVM '21: Proceedings of the 1st International Workshop on Adversarial Learning for Multimedia, 2021

Multi-Pretext Attention Network For Few-Shot Learning With Self-Supervision.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

BiPointNet: Binary Neural Network for Point Clouds.
Proceedings of the 9th International Conference on Learning Representations, 2021

Diversifying Sample Generation for Accurate Data-Free Quantization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021


  Loading...