Shuzhang Zhong

Orcid: 0009-0006-5478-3604

According to our database¹, Shuzhang Zhong authored at least 14 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

NASiC: 3D NAND-based CAM-Selected Multibit CIM Architecture for Efficient On-Device Mixture-of-Experts LLM Inference.

[BibT_eX]

[DOI]

CoRR, May, 2026

Breaking the Reward Barrier: Accelerating Tree-of-Thought Reasoning via Speculative Exploration.

[BibT_eX]

[DOI]

CoRR, May, 2026

DualSpec: Accelerating Deep Research Agents via Dual-Process Action Speculation.

[BibT_eX]

[DOI]

CoRR, March, 2026

HyPER: Bridging Exploration and Exploitation for Scalable LLM Reasoning with Hypothesis Path Expansion and Reduction.

[BibT_eX]

[DOI]

CoRR, February, 2026

2025

H2EAL: Hybrid-Bonding Architecture with Hybrid Sparse Attention for Efficient Long-Context LLM Inference.

[BibT_eX]

[DOI]

CoRR, August, 2025

HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2025

H<sup>2</sup>EAL: Hybrid-Bonding Architecture with Hybrid Sparse Attention for Efficient Long-Context LLM Inference.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2025

HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference.

[BibT_eX]

[DOI]

Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

SpecASR: Accelerating LLM-based Automatic Speech Recognition via Speculative Decoding.

[BibT_eX]

[DOI]

Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

2024

ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding.

[BibT_eX]

[DOI]

Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference.

[BibT_eX]

[DOI]

Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization.

[BibT_eX]

[DOI]

Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

2023

Memory-aware Scheduling for Complex Wired Networks with Iterative Graph Optimization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

2021

dgQuEST: Accelerating Large Scale Quantum Circuit Simulation through Hybrid CPU-GPU Memory Hierarchies.

[BibT_eX]

[DOI]

Proceedings of the Network and Parallel Computing, 2021

Shuzhang Zhong

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...