Yuyang Jin

Orcid: 0000-0003-2358-3395

Affiliations:
  • Tsinghua University, Beijing, China


According to our database1, Yuyang Jin authored at least 14 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Leveraging Graph Analysis to Pinpoint Root Causes of Scalability Issues for Parallel Applications.
IEEE Trans. Parallel Distributed Syst., February, 2025

mTuner: Accelerating Parameter-Efficient Fine-Tuning on Multi-GPU Servers with Elastic Tensor.
Proceedings of the 2025 USENIX Annual Technical Conference, 2025

FlashTensor: Optimizing Tensor Programs by Leveraging Fine-grained Tensor Property.
Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025

2024
Efficient Inference for Pruned CNN Models on Mobile Devices With Holistic Sparsity Alignment.
IEEE Trans. Parallel Distributed Syst., November, 2024

Graph-Centric Performance Analysis for Large-Scale Parallel Applications.
IEEE Trans. Parallel Distributed Syst., July, 2024

PUZZLE: Efficiently Aligning Large Language Models through Light-Weight Context Switch.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

BoostN: Optimizing Imbalanced Neighborhood Communication on Homogeneous Many-Core System.
Proceedings of the 53rd International Conference on Parallel Processing, 2024

WiseGraph: Optimizing GNN with Joint Workload Partition of Graph and Operations.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

2023
Unified Programming Models for Heterogeneous High-Performance Computers.
J. Comput. Sci. Technol., February, 2023

2022
Detecting Performance Variance for Parallel Applications Without Source Code.
IEEE Trans. Parallel Distributed Syst., 2022

Vapro: performance variance detection and diagnosis for production-run parallel applications.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

PerFlow: a domain specific framework for automatic performance analysis of parallel applications.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

2020
ScalAna: automating scaling loss detection with graph analysis.
Proceedings of the International Conference for High Performance Computing, 2020

Identifying scalability bottlenecks for large-scale parallel programs with graph analysis.
Proceedings of the PPoPP '20: 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020


  Loading...