Heng Liao

Orcid: 0009-0002-5992-5000

According to our database1, Heng Liao authored at least 17 papers between 1996 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
HiFloat4 Format for Language Model Inference.
CoRR, February, 2026

BAPS: A Fine-Grained Low-Precision Scheme for Softmax in Attention via Block-Aware Precision reScaling.
CoRR, February, 2026

2025
The application of FCM-based computer image segmentation technology in agricultural production.
Serv. Oriented Comput. Appl., December, 2025

Serving Large Language Models on Huawei CloudMatrix384.
CoRR, June, 2025

UB-Mesh: a Hierarchically Localized nD-FullMesh Datacenter Network Architecture.
CoRR, March, 2025

UB-Mesh: A Hierarchically Localized nD-FullMesh Data Center Network Architecture.
IEEE Micro, 2025

RICH Prefetcher: Storing Rich Information in Memory to Trade Capacity and Bandwidth for Latency Hiding.
Proceedings of the 58th IEEE/ACM International Symposium on Microarchitecture, 2025

UB-mesh: An New Interconnection Technology for Large AI SuperNode.
Proceedings of the IEEE Hot Chips 37 Symposium, 2025

Libra: A Hybrid-Sparse Attention Accelerator Featuring Multi-Level Workload Balance.
Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

2024
Computational Graph Representation of Equations System Constructors in Hierarchical Circuit Simulation.
CoRR, 2024

MemoryFormer : Minimize Transformer Computation by Removing Fully-Connected Layers.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
LEGO-Prover: Neural Theorem Proving with Growing Libraries.
CoRR, 2023

2021
Ascend: a Scalable and Unified Architecture for Ubiquitous Deep Neural Network Computing : Industry Track Paper.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021

2019
DaVinci: A Scalable Architecture for Neural Network Computing.
Proceedings of the 2019 IEEE Hot Chips 31 Symposium (HCS), 2019

2006
Parallel Switch System with QoS Guarantee for Real-Time Traffic.
J. Comput. Sci. Technol., 2006

1997
Available Parallelism in Video Applications.
Proceedings of the Thirtieth Annual IEEE/ACM International Symposium on Microarchitecture, 1997

1996
DYNAMEM - A microarchitecture for improving memory disambiguation at run-time.
J. Comput. Sci. Technol., 1996


  Loading...