Yuechi Zhou

According to our database1, Yuechi Zhou authored at least 10 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
RuPLaR : Efficient Latent Compression of LLM Reasoning Chains with Rule-Based Priors From Multi-Step to One-Step.
CoRR, May, 2026

LongFlow: Efficient KV Cache Compression for Reasoning M.
CoRR, March, 2026

ALD2: Adaptive layer-wise denoising decoding for hallucinations mitigation in large vision-language models.
Inf. Process. Manag., 2026

2025
A<sup>3</sup>: Attention-Aware Accurate KV Cache Fusion for Fast Large Language Model Serving.
CoRR, November, 2025

CaliDrop: KV Cache Compression with Calibration.
CoRR, July, 2025

ALW: Adaptive Layer-Wise contrastive decoding enhancing reasoning ability in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Accurate KV Cache Quantization with Outlier Tokens Tracing.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning.
CoRR, 2024

The Benefits in Shallow: Merge Decoding Across Large Language Model Layers.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

2022
Chinese grammatical error correction based on knowledge distillation.
CoRR, 2022


  Loading...