Juntao Zhao

Orcid: 0000-0003-3376-0607

According to our database1, Juntao Zhao authored at least 24 papers between 2011 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Sandwich: Separating Prefill-Decode Compilation for Efficient CPU LLM Serving.
CoRR, July, 2025

OVERLORD: Ultimate Scaling of DataLoader for Multi-Source Large Foundation Model Training.
CoRR, April, 2025

Efficient LLM Serving on Hybrid Real-time and Best-effort Requests.
CoRR, April, 2025

Reinforcement learning-enhanced variable neighborhood search strategies for the k-clustering minimum biclique completion problem.
Comput. Oper. Res., 2025

A population-based simulated annealing approach with adaptive mutation operator for solving the discounted {0-1} knapsack problem.
Appl. Soft Comput., 2025

2024
An adaptive evolutionary search-based method for efficiently tackling the set-union knapsack problem.
Inf. Sci., 2024

A Novel robot Path Planning Algorithm based on the Improved Wild Horse optimiser with Hybrid Strategies.
Int. J. Robotics Autom., 2024

QSpec: Speculative Decoding with Complementary Quantization Schemes.
CoRR, 2024

LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization.
CoRR, 2024

An incremental method-based machine learning approach for max-min knapsack with multiple scenarios.
Comput. Ind. Eng., 2024

A reinforcement learning-driven cooperative scatter search for the knapsack problem with forfeits.
Comput. Ind. Eng., 2024

POSTER: LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization.
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024

QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

CDMPP: A Device-Model Agnostic Framework for Latency Prediction of Tensor Programs.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

Tackling the Generalized Max-Mean Dispersion Problem with a Hybrid Population Method.
Proceedings of the 10th International Conference on Control, 2024

A Cooperative Method for Solving the Set-Union Knapsack Problem.
Proceedings of the 10th International Conference on Control, 2024

2023
CryptoArcade: A Cloud Gaming System With Blockchain-Based Token Economy.
IEEE Trans. Cloud Comput., 2023

Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

A Research on the Application of Digital Twin in Ship Industry.
Proceedings of the 2023 3rd International Conference on Big Data, 2023

2022
Synchronization in games sound: an audiovisual study on player experience and performance.
Proceedings of the GameSys 2022: Proceedings of the 2nd Workshop on Games Systems, 2022

2020
CloudArcade: A Blockchain Empowered Cloud Gaming System.
Proceedings of the BSCI '20: Proceedings of the 2nd ACM International Symposium on Blockchain and Secure Critical Infrastructure, 2020

2015
Coordinated multi-user transmission in distributed antenna-based spectrum-sharing systems.
Proceedings of the International Conference on Wireless Communications & Signal Processing, 2015

Secrecy rate region of independent parallel multiple-carrier two-way secure communications with secure feedback.
Proceedings of the International Conference on Wireless Communications & Signal Processing, 2015

2011
High Performance LDPC Decoder on CELL BE for WiMAX System.
Proceedings of the Third International Conference on Communications and Mobile Computing, 2011


  Loading...