Yushu Zhao
Orcid: 0009-0008-7225-1366
According to our database1,
Yushu Zhao authored at least 6 papers
between 2021 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
MoBiLE: Efficient Mixture-of-Experts Inference on Consumer GPU with Mixture of Big Little Experts.
Proceedings of the 31st Asia and South Pacific Design Automation Conference, 2026
2025
PuzzleMoE: Efficient Compression of Large Mixture-of-Experts Models via Sparse Expert Merging and Bit-packed inference.
CoRR, November, 2025
From Quarter to All: Accelerating Speculative LLM Decoding via Floating-Point Exponent Remapping and Parameter Sharing.
CoRR, October, 2025
23.8 An 88.36TOPS/W Bit-Level-Weight-Compressed Large-Language-Model Accelerator with Cluster-Aligned INT-FP-GEMM and Bi-Dimensional Workflow Reformulation.
Proceedings of the IEEE International Solid-State Circuits Conference, 2025
2024
Co-DTC: Concentric Trench-Based Integrated Capacitors for Advanced Chiplet-Based Platforms.
Proceedings of the Great Lakes Symposium on VLSI 2024, 2024
2021
Proceedings of the 19th IEEE/ACIS International Conference on Computer and Information Science, 2021