Yang Shi

Orcid: 0000-0001-5786-3171

Affiliations:
  • National University of Defense Technology, College of Computer Science and Technology, Changsha, China


According to our database1, Yang Shi authored at least 27 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
FAMS: A FrAmework of Memory-Centric Mapping for DNNs on Systolic Array Accelerators.
IEEE Trans. Very Large Scale Integr. Syst., April, 2025

HWPQ: Hessian-free Weight Pruning-Quantization For LLM Compression And Acceleration.
CoRR, January, 2025

ESCAN: Efficient GPU sharing for cascade neural network inference.
Neural Networks, 2025

SparSynergy: Unlocking Flexible and Efficient DNN Acceleration Through Multi-Level Sparsity.
Proceedings of the Design, Automation & Test in Europe Conference, 2025

WinAcc: Window-based Acceleration of Neural Networks Using Block Floating Point.
Proceedings of the Design, Automation & Test in Europe Conference, 2025

2024
Optimizing VLIW Instruction Scheduling via a Two-Dimensional Constrained Dynamic Programming.
ACM Trans. Design Autom. Electr. Syst., 2024

ESEN: Efficient GPU sharing of Ensemble Neural Networks.
Neurocomputing, 2024

HyFiSS: A Hybrid Fidelity Stall-Aware Simulator for GPGPUs.
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024

MAP-SIM: A Performance Model for Shared-Memory Heterogeneous Systems with Mapping Awareness.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2024

2023
Releasing the Potential of Tensor Core for Unstructured SpMM using Tiled-CSR Format.
Proceedings of the 41st IEEE International Conference on Computer Design, 2023

Automatic End-to-End Joint Optimization for Kernel Compilation on DSPs.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

2022
Light: A Component Enhances Faster and More Accurate Traffic Measurement<sup>*</sup>.
Proceedings of the IEEE International Conference on Communications, 2022

CORF: Bridging the Gap of Complex Operator Fusion for Faster DNN Inference.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

Exploring ILP for VLIW Architecture by Quantified Modeling and Dynamic Programming-Based Instruction Scheduling.
Proceedings of the 27th Asia and South Pacific Design Automation Conference, 2022

2021
Automatic mapping and code optimization for OpenCL kernels on FT-matrix architecture (WIP paper).
Proceedings of the LCTES '21: 22nd ACM SIGPLAN/SIGBED International Conference on Languages, 2021

sRouting: Towards a Better Flow Size Estimation Performance through Routing and Sketch Configuration.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021

2020
Incremental Deployment of Programmable Switches for Sketch-based Network Measurement.
Proceedings of the IEEE Symposium on Computers and Communications, 2020

Towards High-Efficiency Data Centers via Job-Aware Network Scheduling.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020

2019
Metaflow: A DAG-Based Network Abstraction for Distributed Applications.
CoRR, 2019

Application-Oriented Network Scheduling With Metaflow.
IEEE Access, 2019

Interleaved Sketch: Toward Consistent Network Telemetry for Commodity Programmable Switches.
IEEE Access, 2019

KVSwitch: An In-network Load Balancer for Key-Value Stores.
Proceedings of the 2019 IEEE Symposium on Computers and Communications, 2019

Metaflow: A Better Traffic Abstraction for Distributed Applications.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019

TBSW: Time-Based Sliding Window Algorithm for Network Traffic Measurement.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019

SACC: Configuring Application-Level Cache Intelligently for In-Memory Database Based on Long Short-Term Memory.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019

SWAP: a sliding window algorithm for in-network packet measurement.
Proceedings of the 3rd International Conference on High Performance Compilation, 2019

2018
Multiple CNN-based Tasks Scheduling across Shared GPU Platform in Research and Development Scenarios.
Proceedings of the 20th IEEE International Conference on High Performance Computing and Communications; 16th IEEE International Conference on Smart City; 4th IEEE International Conference on Data Science and Systems, 2018


  Loading...