Shuzhang Zhong
Orcid: 0009-0006-5478-3604
According to our database1,
Shuzhang Zhong authored at least 12 papers
between 2021 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
CoRR, March, 2026
HyPER: Bridging Exploration and Exploitation for Scalable LLM Reasoning with Hypothesis Path Expansion and Reduction.
CoRR, February, 2026
2025
H2EAL: Hybrid-Bonding Architecture with Hybrid Sparse Attention for Efficient Long-Context LLM Inference.
CoRR, August, 2025
HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing.
Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2025
H<sup>2</sup>EAL: Hybrid-Bonding Architecture with Hybrid Sparse Attention for Efficient Long-Context LLM Inference.
Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2025
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference.
Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025
SpecASR: Accelerating LLM-based Automatic Speech Recognition via Speculative Decoding.
Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025
2024
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference.
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024
PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization.
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024
2023
Memory-aware Scheduling for Complex Wired Networks with Iterative Graph Optimization.
Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023
2021
dgQuEST: Accelerating Large Scale Quantum Circuit Simulation through Hybrid CPU-GPU Memory Hierarchies.
Proceedings of the Network and Parallel Computing, 2021