Zhen Zheng

Orcid: 0000-0002-8902-9357

According to our database1, Zhen Zheng authored at least 39 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Multi-task aided face recognition network with convolution kernel spatial collaboration.
Signal Image Video Process., June, 2024

FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design.
CoRR, 2024

2023
Magnetic Anomaly Detection Based on a Compound Tri-Stable Stochastic Resonance System.
Sensors, November, 2023

Expanding the Edge: Enabling Efficient Winograd CNN Inference With Deep Reuse on Edge Device.
IEEE Trans. Knowl. Data Eng., October, 2023

BladeDISC: Optimizing Dynamic Shape Machine Learning Workloads via Compiler Approach.
Proc. ACM Manag. Data, September, 2023

Flash-LLM: Enabling Low-Cost and Highly-Efficient Large Generative Model Inference With Unstructured Sparsity.
Proc. VLDB Endow., 2023

ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks.
CoRR, 2023

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity.
CoRR, 2023

Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform.
CoRR, 2023

RECom: A Compiler Approach to Accelerating Recommendation Model Inference with Massive Embedding Columns.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
Optimizing DNN Compilation for Distributed Training With Joint OP and Tensor Fusion.
IEEE Trans. Parallel Distributed Syst., 2022

Single-Stage Adaptive Multi-Scale Point Cloud Noise Filtering Algorithm Based on Feature Information.
Remote. Sens., 2022

Research on localisation algorithm of large irregular workpiece for industrial robot.
Int. J. Comput. Sci. Math., 2022

DREW: Efficient Winograd CNN Inference with Deep Reuse.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Whale: Efficient Giant Model Training over Heterogeneous GPUs.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures.
Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

2021
Research on point cloud generation algorithm of virtual depth camera.
Int. J. Wirel. Mob. Comput., 2021

Exploring deep reuse in winograd CNN inference.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

Understanding and bridging the gaps in current GNN performance optimizations.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

DAPPLE: a pipelined data parallel approach for training large models.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

DISC: A Dynamic Shape Compiler for Machine Learning Workloads.
Proceedings of the EuroMLSys@EuroSys 2021, 2021

2020
FusionStitching: Boosting Memory Intensive Computations for Deep Learning Workloads.
CoRR, 2020

Auto-MAP: A DQN Framework for Exploring Distributed Execution Plans for DNN Workloads.
CoRR, 2020

Adaptive Edge Detection Algorithm Based on Improved Grey Prediction Model.
IEEE Access, 2020

Optimizing distributed training deployment in heterogeneous GPU clusters.
Proceedings of the CoNEXT '20: The 16th International Conference on emerging Networking EXperiments and Technologies, 2020

GOPipe: A Granularity-Oblivious Programming Framework for Pipelined Stencil Executions on GPU.
Proceedings of the PACT '20: International Conference on Parallel Architectures and Compilation Techniques, 2020

2019
The construction of fuzzy concept lattice based on weighted complete graph.
J. Intell. Fuzzy Syst., 2019

Adaptive Edge Detection Algorithm Based on Grey Entropy Theory and Textural Features.
IEEE Access, 2019

HiWayLib: A Software Framework for Enabling High Performance Communications for Heterogeneous Pipeline Computations.
Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019

2018
Design of Combined Receiving Lens for Panoramic Laser Fuze Detection.
Proceedings of the IEEE International Conference on Information and Automation, 2018

The Influence of Cambered Optical Fairing on the Light Beam of Underwater Laser Fuze.
Proceedings of the IEEE International Conference on Information and Automation, 2018

Optimization design of multi-sided reflecting prism in laser line scanning imaging system.
Proceedings of the IEEE International Conference on Information and Automation, 2018

2017
Versapipe: a versatile programming framework for pipelined computing on GPU.
Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, 2017

2016
Refactoring and optimizing the community atmosphere model (CAM) on the sunway taihulight supercomputer.
Proceedings of the International Conference for High Performance Computing, 2016

2014
Holistic Modeling and Performance Evaluation for Converged Network-Cloud Service Provisioning.
Proceedings of the 28th IEEE International Conference on Advanced Information Networking and Applications, 2014

2013
MIMO channel estimation based on distributed compressed sensing for LTE-advanced.
Proceedings of the 9th International Conference on Information, 2013

2012
Hybrid Synchronization of Two Delayed Systems with Uncertain Parameters.
Proceedings of the Advances in Neural Networks - ISNN 2012, 2012

A pilot study on simulation of Ultrasound from MRI using ray-based model.
Proceedings of 2012 IEEE-EMBS International Conference on Biomedical and Health Informatics, 2012

2006
Illumination Variation in Images in Independent Component Analysis and Principal Component Analysis Subspaces.
Proceedings of the Sixth International Conference on Intelligent Systems Design and Applications (ISDA 2006), 2006


  Loading...