Pingzhi Tang
Orcid: 0009-0001-7958-7144
According to our database1,
Pingzhi Tang authored at least 10 papers
between 2025 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
TEAM: Temporal-Spatial Consistency Guided Expert Activation for MoE Diffusion Language Model Acceleration.
CoRR, February, 2026
Breaking the Blocks: Continuous Low-Rank Decomposed Scaling for Unified LLM Quantization and Adaptation.
CoRR, January, 2026
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill & Decode Inference.
Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026
2025
Orchestrating Dual-Boundaries: An Arithmetic Intensity Inspired Acceleration Framework for Diffusion Language Models.
CoRR, November, 2025
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill and Decode Inference.
CoRR, August, 2025
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025