Shuzhang Zhong

Orcid: 0009-0006-5478-3604

According to our database1, Shuzhang Zhong authored at least 12 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
DualSpec: Accelerating Deep Research Agents via Dual-Process Action Speculation.
CoRR, March, 2026

HyPER: Bridging Exploration and Exploitation for Scalable LLM Reasoning with Hypothesis Path Expansion and Reduction.
CoRR, February, 2026

2025
H2EAL: Hybrid-Bonding Architecture with Hybrid Sparse Attention for Efficient Long-Context LLM Inference.
CoRR, August, 2025

HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing.
Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2025

H<sup>2</sup>EAL: Hybrid-Bonding Architecture with Hybrid Sparse Attention for Efficient Long-Context LLM Inference.
Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2025

HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference.
Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

SpecASR: Accelerating LLM-based Automatic Speech Recognition via Speculative Decoding.
Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

2024
ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding.
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference.
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization.
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

2023
Memory-aware Scheduling for Complex Wired Networks with Iterative Graph Optimization.
Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

2021
dgQuEST: Accelerating Large Scale Quantum Circuit Simulation through Hybrid CPU-GPU Memory Hierarchies.
Proceedings of the Network and Parallel Computing, 2021


  Loading...