Shuzhang Zhong

Orcid: 0009-0006-5478-3604

According to our database1, Shuzhang Zhong authored at least 9 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing.
CoRR, September, 2025

H2EAL: Hybrid-Bonding Architecture with Hybrid Sparse Attention for Efficient Long-Context LLM Inference.
CoRR, August, 2025

HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference.
Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

SpecASR: Accelerating LLM-based Automatic Speech Recognition via Speculative Decoding.
Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

2024
ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding.
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference.
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization.
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

2023
Memory-aware Scheduling for Complex Wired Networks with Iterative Graph Optimization.
Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

2021
dgQuEST: Accelerating Large Scale Quantum Circuit Simulation through Hybrid CPU-GPU Memory Hierarchies.
Proceedings of the Network and Parallel Computing, 2021


  Loading...