Pingzhi Tang

Orcid: 0009-0001-7958-7144

According to our database1, Pingzhi Tang authored at least 10 papers between 2025 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
TEAM: Temporal-Spatial Consistency Guided Expert Activation for MoE Diffusion Language Model Acceleration.
CoRR, February, 2026

Breaking the Blocks: Continuous Low-Rank Decomposed Scaling for Unified LLM Quantization and Adaptation.
CoRR, January, 2026

TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill & Decode Inference.
Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026

Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
Orchestrating Dual-Boundaries: An Arithmetic Intensity Inspired Acceleration Framework for Diffusion Language Models.
CoRR, November, 2025

TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill and Decode Inference.
CoRR, August, 2025

HD-PiSSA: High-Rank Distributed Orthogonal Adaptation.
CoRR, May, 2025

TransMLA: Migrating GQA Models to MLA with Full DeepSeek Compatibility and Speedup.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

CLOVER: Cross-Layer Orthogonal Vectors Pruning.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

HD-PiSSA: High-Rank Distributed Orthogonal Adaptation.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025


  Loading...