Yerui Sun

According to our database1, Yerui Sun authored at least 14 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
FG<sup>2</sup>-GDN: Enhancing Long-Context Gated Delta Networks with Doubly Fine-Grained Control.
CoRR, April, 2026

SparseBalance: Load-Balanced Long Context Training with Dynamic Sparse Attention.
CoRR, April, 2026

AsyncTLS: Efficient Generative LLM Inference with Asynchronous Two-level Sparse Attention.
CoRR, April, 2026

Scaling Embeddings Outperforms Scaling Experts in Language Models.
CoRR, January, 2026

HetAuto: Cross-Cluster Auto-Parallelism for Heterogeneous Distributed Training.
Proceedings of the 21st European Conference on Computer Systems, 2026

2025
Efficient Context Scaling with LongCat ZigZag Attention.
CoRR, December, 2025

AFA-LoRA: Enabling Non-Linear Adaptations in LoRA with Activation Function Annealing.
CoRR, December, 2025

Accelerate Speculative Decoding with Sparse Computation in Verification.
CoRR, December, 2025

Optimizing Native Sparse Attention with Latent Attention and Local Global Alternating Strategies.
CoRR, November, 2025

WISCA: A Lightweight Model Transition Method to Improve LLM Training via Weight Scaling.
CoRR, August, 2025

2024
Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference.
CoRR, 2024

Integer Scale: A Free Lunch for Faster Fine-grained Quantization of LLMs.
CoRR, 2024

2023
A Speed Odyssey for Deployable Quantization of LLMs.
CoRR, 2023

FPTQ: Fine-grained Post-Training Quantization for Large Language Models.
CoRR, 2023


  Loading...