Xing Su

Orcid: 0000-0002-7514-1495

Affiliations:
  • National University of Defense Technology, Changsha, China


According to our database1, Xing Su authored at least 8 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Optimizing Full-Spectrum Matrix Multiplications on ARMv8 Multi-Core CPUs.
IEEE Trans. Parallel Distributed Syst., March, 2024

Optimizing Attention by Exploiting Data Reuse on ARM Multi-core CPUs.
Proceedings of the 38th ACM International Conference on Supercomputing, 2024

2023
Characterize and Optimize Dense Linear Solver on Multi-core CPUs.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

2021
LIBSHALOM: optimizing small and irregular-shaped matrix multiplications on ARMv8 multi-cores.
Proceedings of the International Conference for High Performance Computing, 2021

2019
SCP: Shared Cache Partitioning for High-Performance GEMM.
ACM Trans. Archit. Code Optim., 2019

2017
Automatic generation of fast BLAS3-GEMM: a portable compiler approach.
Proceedings of the 2017 International Symposium on Code Generation and Optimization, 2017

2016
Galaxyfly: A Novel Family of Flexible-Radix Low-Diameter Topologies for Large-Scales Interconnection Networks.
Proceedings of the 2016 International Conference on Supercomputing, 2016

2015
Design and Implementation of a Highly Efficient DGEMM for 64-Bit ARMv8 Multi-core Processors.
Proceedings of the 44th International Conference on Parallel Processing, 2015


  Loading...