Zunhai Su
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
  2025
KVSink: Understanding and Enhancing the Preservation of Attention Sinks in KV Cache Quantization for LLMs.
    
  
    CoRR, August, 2025
    
  
    CoRR, July, 2025
    
  
AKVQ-VL: Attention-Aware KV Cache Adaptive 2-Bit Quantization for Vision-Language Models.
    
  
    CoRR, January, 2025
    
  
RotateKV: Accurate and Robust 2-Bit KV Cache Quantization for LLMs via Outlier-Aware Adaptive Rotations.
    
  
    Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025