Ziwei He

Orcid: 0000-0001-5389-8457

According to our database1, Ziwei He authored at least 30 papers between 2017 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
VQKV: High-Fidelity and High-Ratio Cache Compression via Vector-Quantization.
CoRR, March, 2026

One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache.
CoRR, March, 2026

CoDAR: Continuous Diffusion Language Models are More Powerful Than You Think.
CoRR, March, 2026

PonderLM-3: Adaptive Token-Wise Pondering with Differentiable Masking.
CoRR, March, 2026

AdaPonderLM: Gated Pondering Language Models with Token-Wise Adaptive Depth.
CoRR, March, 2026

MOVA: Towards Scalable and Synchronized Video-Audio Generation.
CoRR, February, 2026

FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation.
CoRR, January, 2026

Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
DiRL: An Efficient Post-Training Framework for Diffusion Language Models.
CoRR, December, 2025

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs.
CoRR, December, 2025

AWM: Accurate Weight-Matrix Fingerprint for Large Language Models.
CoRR, October, 2025

Pretraining LLM with Latent Thoughts in Continuous Space.
CoRR, September, 2025

Fourier-VLM: Compressing Vision Tokens in the Frequency Domain for Large Vision-Language Models.
CoRR, August, 2025

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs.
CoRR, June, 2025

Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache.
CoRR, June, 2025

Pretraining Language Models to Ponder in Continuous Space.
CoRR, May, 2025

FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension.
CoRR, May, 2025

TreeKV: Smooth Key-Value Cache Compression with Tree Structures.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

WeightedKV: Attention Scores Weighted Key-Value Cache Merging for Large Language Models.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Fovea Transformer: Efficient Long-Context Modeling with Structured Fine-To-Coarse Attention.
Proceedings of the IEEE International Conference on Acoustics, 2024

Towards Controlled Table-to-Text Generation with Scientific Reasoning.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Cloud-Edge Collaboration-Based Distribution Network Reconfiguration for Voltage Preventive Control.
IEEE Trans. Ind. Informatics, December, 2023

Few-Shot Table-to-Text Generation with Prompt-based Adapter.
CoRR, 2023

Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
RASAT: Integrating Relational Structures into Pretrained Seq2Seq Model for Text-to-SQL.
CoRR, 2022

RASAT: Integrating Relational Structures into Pretrained Seq2Seq Model for Text-to-SQL.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Evaluation of Achievement Effect on Visual Tools for Online Teaching-Learning Program in China During the Covid-19 Pandemic.
Proceedings of the Advances in Human Factors in Training, Education, and Learning Sciences, 2021

2019
Half&Half: New Tasks and Benchmarks for Studying Visual Common Sense.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2017
A Comparative Study of Different Approaches for Tracking Communities in Evolving Social Networks.
Proceedings of the 2017 IEEE International Conference on Data Science and Advanced Analytics, 2017


  Loading...