Zifan He

Orcid: 0009-0008-3482-838X

According to our database1, Zifan He authored at least 14 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Genome-Anchored Foundation Model Embeddings Improve Molecular Prediction from Histology Images.
CoRR, June, 2025

Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models.
CoRR, March, 2025

Monolithic 3D FPGAs Utilizing Back-End-of-Line Configuration Memories.
CoRR, January, 2025

HMT: Hierarchical Memory Transformer for Efficient Long Context Language Processing.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Optimized Multi-Token Joint Decoding With Auxiliary Model for LLM Inference.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

InTRRA: Inter-Task Resource-Repurposing Accelerator for Efficient Transformer Inference on FPGAs.
Proceedings of the 2025 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2025

NoH: NoC Compilation in High-Level Synthesis.
Proceedings of the 33rd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2025

InTAR: Inter-Task Auto-Reconfigurable Accelerator Design for High Data Volume Variation in DNNs.
Proceedings of the 33rd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2025

Dynamic-Width Speculative Beam Decoding for LLM Inference.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Dynamic-Width Speculative Beam Decoding for Efficient LLM Inference.
CoRR, 2024

Multi-Token Joint Speculative Decoding for Accelerating Large Language Model Inference.
CoRR, 2024

HMT: Hierarchical Memory Transformer for Long Context Language Processing.
CoRR, 2024

LevelST: Stream-based Accelerator for Sparse Triangular Solver.
Proceedings of the 2024 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2024

2022
Optimization of Assisted Search Over Server-Mediated Peer-to-peer Networks.
Proceedings of the IEEE Global Communications Conference, 2022


  Loading...