Zhichen Zeng

Orcid: 0009-0005-0023-2367

Affiliations:
  • University of Washington
  • University of Science and Technology of China, Hefei, China


According to our database1, Zhichen Zeng authored at least 8 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Tactic: Adaptive Sparse Attention with Clustering and Distribution Fitting for Long-Context LLMs.
CoRR, February, 2025

LUT Tensor Core: A Software-Hardware Co-Design for LUT-Based Low-Bit LLM Inference.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2025

2024
Allo: A Programming Model for Composable Accelerator Design.
Proc. ACM Program. Lang., 2024

SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs.
CoRR, 2024

LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration.
CoRR, 2024

EN-TensorCore: Advancing TensorCores Performance through Encoder-Based Methodology.
CoRR, 2024

EN-T: Optimizing Tensor Computing Engines Performance via Encoder-Based Methodology.
Proceedings of the 42nd IEEE International Conference on Computer Design, 2024


  Loading...