Shizhi Tang

Orcid: 0000-0002-6543-0859

According to our database1, Shizhi Tang authored at least 14 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
ParDiff: Efficiently Parallelizing Reverse-Mode Automatic Differentiation with Direct Indexing.
Proceedings of the 31st ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2026

2025
IntelliGen: Instruction-Level Auto-tuning for Tensor Program with Monotonic Memory Optimization.
Proceedings of the 23rd ACM/IEEE International Symposium on Code Generation and Optimization, 2025

2023
Optimizing DNNs With Partially Equivalent Transformations and Automated Corrections.
IEEE Trans. Computers, December, 2023

Mat2Stencil: A Modular Matrix-Based DSL for Explicit and Implicit Matrix-Free PDE Solvers on Structured Grid.
Proc. ACM Program. Lang., October, 2023

Unified Programming Models for Heterogeneous High-Performance Computers.
J. Comput. Sci. Technol., February, 2023

PowerFusion: A Tensor Compiler with Explicit Data Movement Description and Instruction-level Graph IR.
CoRR, 2023

EINNET: Optimizing Tensor Programs with Derivation-Based Transformations.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

2022
OLLIE: Derivation-based Tensor Program Optimizer.
CoRR, 2022

Programming Matrices as Staged Sparse Rows to Generate Efficient Matrix-free Differential Equation Solver.
CoRR, 2022

BaGuaLu: targeting brain scale pretrained models with over 37 million cores.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

FreeTensor: a free-form DSL with holistic optimizations for irregular tensor programs.
Proceedings of the PLDI '22: 43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation, San Diego, CA, USA, June 13, 2022

2021
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections.
Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation, 2021

2019
Student Cluster Competition 2018, Team Tsinghua University: Reproducing performance of multi-physics simulations of the Tsunamigenic 2004 Sumatra megathrust earthquake on the Intel Skylake Architecture.
Parallel Comput., 2019

Toward Edge-Assisted Video Content Intelligent Caching With Long Short-Term Memory Learning.
IEEE Access, 2019


  Loading...