Junping Zhao

Orcid: 0009-0003-5268-2541

According to our database1, Junping Zhao authored at least 18 papers between 2007 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Enhancing AI-Based Tropical Cyclone Track and Intensity Forecasting via Systematic Bias Correction.
CoRR, March, 2026

2025
FlexLink: Boosting your NVLink Bandwidth by 27% without accuracy concern.
CoRR, October, 2025

eLLM: Elastic Memory Management Framework for Efficient LLM Serving.
CoRR, June, 2025

LCD: Advancing Extreme Low-Bit Clustering for Large Language Models via Knowledge Distillation.
CoRR, June, 2025

FlexQuant: A Flexible and Efficient Dynamic Precision Switching Framework for LLM Quantization.
CoRR, June, 2025

Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs.
CoRR, March, 2025

ASTER: Adaptive Dynamic Layer-Skipping for Efficient Transformer Inference via Markov Decision Process.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

FlexQuant: A Flexible and Efficient Dynamic Precision Switching Framework for LLM Quantization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

EVASION: Efficient KV CAche CompreSsion vIa PrOduct QuaNtization.
Proceedings of the Design, Automation & Test in Europe Conference, 2025

2024
SearchQ: Search-Based Fine-Grained Quantization for Data-Free Model Compression.
IEEE Trans. Circuits Syst. Artif. Intell., December, 2024

LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management.
CoRR, 2024

vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving.
CoRR, 2024

GMLake: Efficient and Transparent GPU Memory Defragmentation for Large-scale DNN Training with Virtual Memory Stitching.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2020
GC-Net: Global context network for medical image segmentation.
Comput. Methods Programs Biomed., 2020

2016
Haar wavelets method for solving Poisson equations with jump conditions in irregular domain.
Adv. Comput. Math., 2016

2009
Research on Consumption Model about Urban Households Based on Keynes' Absolute Income Hypotheses in China.
Proceedings of the Second International Workshop on Knowledge Discovery and Data Mining, 2009

A Fast and Flexible Sorting Algorithm with CUDA.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2009

2007
Use of Open Access Electronic Journals by Chinese Scholars, and an Initiative to Facilitate Access to Chinese Journals.
Proceedings of the Openness in Digital Publishing: Awareness, Discovery and Access - Proceedings of the 11th International Conference on Electronic Publishing held in Vienna, 2007


  Loading...