Weiming Hu

Orcid: 0009-0003-5115-0498

Affiliations:
  • Shanghai Jiao Tong University, Shanghai, China


According to our database1, Weiming Hu authored at least 4 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
eLLM: Elastic Memory Management Framework for Efficient LLM Serving.
CoRR, June, 2025

M-ANT: Efficient Low-bit Group Quantization for LLMs via Mathematically Adaptive Numerical Type.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2025

2024
vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving.
CoRR, 2024

2023
OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization.
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023


  Loading...