Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Optimized Multi-Token Joint Decoding With Auxiliary Model for LLM Inference.

[BibT_eX]

[DOI]

Zongyue Qin

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Automated Design Space Exploration in High-Level Physical Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2025

InTRRA: Inter-Task Resource-Repurposing Accelerator for Efficient Transformer Inference on FPGAs.

[BibT_eX]

[DOI]

Proceedings of the 2025 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2025

NoH: NoC Compilation in High-Level Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 33rd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2025

InTAR: Inter-Task Auto-Reconfigurable Accelerator Design for High Data Volume Variation in DNNs.

[BibT_eX]

[DOI]

Proceedings of the 33rd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2025

Monolithic 3D FPGA Design and Synthesis with Back-End-of-Line Configuration Memories.

[BibT_eX]

[DOI]

Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

Dynamic-Width Speculative Beam Decoding for LLM Inference.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Dynamic-Width Speculative Beam Decoding for Efficient LLM Inference.

[BibT_eX]

[DOI]

CoRR, 2024

Multi-Token Joint Speculative Decoding for Accelerating Large Language Model Inference.

[BibT_eX]

[DOI]

CoRR, 2024

HMT: Hierarchical Memory Transformer for Long Context Language Processing.

[BibT_eX]

[DOI]

CoRR, 2024

LevelST: Stream-based Accelerator for Sparse Triangular Solver.

[BibT_eX]

[DOI]

Proceedings of the 2024 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2024

2022

Optimization of Assisted Search Over Server-Mediated Peer-to-peer Networks.

[BibT_eX]

[DOI]

Zifan He

Leonard Kleinrock

Proceedings of the IEEE Global Communications Conference, 2022

Zifan He

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...