Zizhong Chen

Orcid: 0000-0003-2578-4940

According to our database1, Zizhong Chen authored at least 159 papers between 2003 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
POSTER: Optimizing Collective Communications with Error-bounded Lossy Compression for GPU Clusters.
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024

2023
Toward Feature-Preserving Vector Field Compression.
IEEE Trans. Vis. Comput. Graph., December, 2023

FT-BLAS: A Fault Tolerant High Performance BLAS Implementation on x86 CPUs.
IEEE Trans. Parallel Distributed Syst., December, 2023

GRRS: Accurate and Efficient Neighborhood Rough Set for Feature Selection.
IEEE Trans. Knowl. Data Eng., September, 2023

GreenMD: Energy-efficient Matrix Decomposition on Heterogeneous Multi-GPU Systems.
ACM Trans. Parallel Comput., June, 2023

SZ3: A Modular Framework for Composing Prediction-Based Error-Bounded Lossy Compressors.
IEEE Trans. Big Data, April, 2023

cuSZ-I: High-Fidelity Error-Bounded Lossy Compression for Scientific Data on GPUs.
CoRR, 2023

High-performance Effective Scientific Error-bounded Lossy Compression with Auto-tuned Multi-component Interpolation.
CoRR, 2023

SRN-SZ: Deep Leaning-Based Scientific Error-bounded Lossy Compression with Super-resolution Neural Networks.
CoRR, 2023

gZCCL: Compression-Accelerated Collective Communication Framework for GPU Clusters.
CoRR, 2023

C-Coll: Introducing Error-bounded Lossy Compression into MPI Collectives.
CoRR, 2023

Improving Energy Saving of One-Sided Matrix Decompositions on CPU-GPU Heterogeneous Systems.
Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2023

ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs.
Proceedings of the 37th International Conference on Supercomputing, 2023

FAZ: A flexible auto-tuned modular error-bounded compression framework for scientific data.
Proceedings of the 37th International Conference on Supercomputing, 2023

GBCloud: a Robust Cloud Model for Processing Multi-dimensional Data Using Granular-ball Computing.
Proceedings of the 8th International Conference on Mathematics and Artificial Intelligence, 2023

FT-GEMM: A Fault Tolerant High Performance GEMM Implementation on x86 CPUs.
Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, 2023

Accelerating MPI Collectives with Process-in-Process-based Multi-object Techniques.
Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, 2023

PiP-MColl: Process-in-Process-based Multi-object MPI Collectives.
Proceedings of the IEEE International Conference on Cluster Computing, 2023

KF K-means: A High Performance K-means Implementation using Kernel Fusion.
Proceedings of the IEEE International Conference on Big Data, 2023

Scientific Error-bounded Lossy Compression with Super-resolution Neural Networks.
Proceedings of the IEEE International Conference on Big Data, 2023

Exploring Wavelet Transform Usages for Error-bounded Scientific Data Compression.
Proceedings of the IEEE International Conference on Big Data, 2023

2022
mCRF and mRD: Two Classification Methods Based on a Novel Multiclass Label Noise Filtering Learning Framework.
IEEE Trans. Neural Networks Learn. Syst., 2022

GBNRS: A Novel Rough Set Algorithm for Fast Adaptive Attribute Reduction in Classification.
IEEE Trans. Knowl. Data Eng., 2022

Random Space Division Sampling for Label-Noisy Classification or Imbalanced Classification.
IEEE Trans. Cybern., 2022

Ball $k$k-Means: Fast Adaptive Clustering With No Bounds.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

An adaptive and general model for label noise detection using relative probabilistic density.
Knowl. Based Syst., 2022

Double embedding and bidirectional sentiment dependence detector for aspect sentiment triplet extraction.
Knowl. Based Syst., 2022

Multi-granularity Association Learning Framework for on-the-fly Fine-Grained Sketch-based Image Retrieval.
CoRR, 2022

An Efficient and Accurate Rough Set for Feature Selection, Classification and Knowledge Representation.
CoRR, 2022

Dynamic Quality Metric Oriented Error Bounded Lossy Compression for Scientific Datasets.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Accelerating Encrypted Computing on Intel GPUs.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

MDZ: An Efficient Error-bounded Lossy Compressor for Molecular Dynamics.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Efficient Soft-Error Detection for Low-precision Deep Learning Recommendation Models.
Proceedings of the IEEE International Conference on Big Data, 2022

2021
FT-CNN: Algorithm-Based Fault Tolerance for Convolutional Neural Networks.
IEEE Trans. Parallel Distributed Syst., 2021

RSMOTE: A self-adaptive robust SMOTE for imbalanced problems with label noise.
Inf. Sci., 2021

Resilient error-bounded lossy compressor for data transfer.
Proceedings of the International Conference for High Performance Computing, 2021

Locality-aware Thread Block Design in Single and Multi-GPU Graph Processing.
Proceedings of the IEEE International Conference on Networking, Architecture and Storage, 2021

A Fast Attribute Reduction Algorithm of Neighborhood Rough Set.
Proceedings of the 13th International Conference on Knowledge and Smart Technology, 2021

Fast k-means Clustering Based on the Neighbor Information.
Proceedings of the ISEEIE 2021: International Symposium on Electrical, Electronics and Information Engineering, Seoul Republic of Korea, February 19, 2021

An Improved Genetic Algorithm Based on k-means.
Proceedings of the ISEEIE 2021: International Symposium on Electrical, Electronics and Information Engineering, Seoul Republic of Korea, February 19, 2021

FT-BLAS: a high performance BLAS implementation with online fault tolerance.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021

Optimizing Error-Bounded Lossy Compression for Scientific Data by Dynamic Spline Interpolation.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Daps: A Dynamic Asynchronous Progress Stealing Model for MPI Communication.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

Exploring Autoencoder-based Error-bounded Compression for Scientific Data.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

Improving Lossy Compression for SZ by Exploring the Best-Fit Lossless Compression Techniques.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020
Weighted pseudometric approximation of 2-dimensional fuzzy numbers by fuzzy 2-cell prismoid numbers preserving the centroid.
Fuzzy Sets Syst., 2020

SDC Resilient Error-bounded Lossy Compressor.
CoRR, 2020

Ball k-means.
CoRR, 2020

Algorithm-Based Fault Tolerance for Convolutional Neural Networks.
CoRR, 2020

Normalization of Input-output Shared Embeddings in Text Generation Models.
CoRR, 2020

CAB-MPI: exploring interprocess work-stealing towards balanced MPI communication.
Proceedings of the International Conference for High Performance Computing, 2020

SAOU: safe adaptive overclocking and undervolting for energy-efficient GPU computing.
Proceedings of the ISLPED '20: ACM/IEEE International Symposium on Low Power Electronics and Design, 2020

Significantly Improving Lossy Compression for HPC Datasets with Second-Order Prediction and Parameter Optimization.
Proceedings of the HPDC '20: The 29th International Symposium on High-Performance Parallel and Distributed Computing, 2020

Towards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression.
Proceedings of the IEEE International Conference on Cluster Computing, 2020

SDRBench: Scientific Data Reduction Benchmark for Lossy Compressors.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Toward Feature-Preserving 2D and 3D Vector Field Compression.
Proceedings of the 2020 IEEE Pacific Visualization Symposium, 2020

2019
Optimizing Lossy Compression Rate-Distortion from Automatic Online Selection between SZ and ZFP.
IEEE Trans. Parallel Distributed Syst., 2019

Complete Random Forest Based Class Noise Filtering Learning for Improving the Generalizability of Classifiers.
IEEE Trans. Knowl. Data Eng., 2019

Z-checker: A framework for assessing lossy compression of scientific data.
Int. J. High Perform. Comput. Appl., 2019

Transferring Ensemble Representations Using Deep Convolutional Neural Networks for Small-Scale Image Classification.
IEEE Access, 2019

Significantly improving lossy compression quality based on an optimized hybrid prediction model.
Proceedings of the International Conference for High Performance Computing, 2019

FT-iSort: efficient fault tolerance for introsort.
Proceedings of the International Conference for High Performance Computing, 2019

GreenMM: energy efficient GPU matrix multiplication through undervolting.
Proceedings of the ACM International Conference on Supercomputing, 2019

TSM2: optimizing tall-and-skinny matrix-matrix multiplication on GPUs.
Proceedings of the ACM International Conference on Supercomputing, 2019

A Multi-granularity Genetic Algorithm.
Proceedings of the 2019 IEEE International Conference on Big Knowledge, 2019

Improving Performance of Data Dumping with Lossy Compression for Scientific Simulation.
Proceedings of the 2019 IEEE International Conference on Cluster Computing, 2019

Data Transfer between Scientific Facilities - Bottleneck Analysis, Insights and Optimizations.
Proceedings of the 19th IEEE/ACM International Symposium on Cluster, 2019

Efficient concolic testing of MPI applications.
Proceedings of the 28th International Conference on Compiler Construction, 2019

2018
Scaling Up Parallel Computation of Tiled QR Factorizations by a Distributed Scheduling Runtime System and Analytical Modeling.
Parallel Process. Lett., 2018

Fault tolerant one-sided matrix decompositions on heterogeneous systems with GPUs.
Proceedings of the International Conference for High Performance Computing, 2018

COMPI: Concolic Testing for MPI Applications.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Non-intrusively Avoiding Scaling Problems in and out of MPI Collectives.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

BeeFlow: A Workflow Management System for In Situ Processing across HPC and Cloud Systems.
Proceedings of the 38th IEEE International Conference on Distributed Computing Systems, 2018

The k-Means Forest Classifier for High Dimensional Data.
Proceedings of the 2018 IEEE International Conference on Big Knowledge, 2018

Improving performance of iterative methods by lossy checkponting.
Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing, 2018

Performance analysis and optimization of in-situ integration of simulation with data analysis: zipping applications up.
Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing, 2018

Fixed-PSNR Lossy Compression for Scientific Data.
Proceedings of the IEEE International Conference on Cluster Computing, 2018

An Efficient Transformation Scheme for Lossy Data Compression with Point-Wise Relative Error Bound.
Proceedings of the IEEE International Conference on Cluster Computing, 2018

Error-Controlled Lossy Compression Optimized for High Compression Ratios of Scientific Datasets.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

Optimizing Lossy Compression with Adjacent Snapshots for N-body Simulation Data.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

Build and Execution Environment (BEE): an Encapsulated Environment Enabling HPC Applications Running Everywhere.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

2017
Docker-Enabled Build and Execution Environment (BEE): an Encapsulated Environment Enabling HPC Applications Running Everywhere.
CoRR, 2017

Exploration of Pattern-Matching Techniques for Lossy Compression on Cosmology Simulation Data Sets.
Proceedings of the High Performance Computing, 2017

Correcting soft errors online in fast fourier transform.
Proceedings of the International Conference for High Performance Computing, 2017

Parastack: efficient hang detection for MPI programs at large scale.
Proceedings of the International Conference for High Performance Computing, 2017

Silent Data Corruption Resilient Two-sided Matrix Factorizations.
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017

Significantly Improving Lossy Compression for Scientific Data Sets Based on Multidimensional Prediction and Error-Controlled Quantization.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

HIPS Keynote.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

In-depth exploration of single-snapshot lossy compression techniques for N-body simulations.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

2016
Scalable Energy Efficiency with Resilience for High Performance Computing Systems: A Quantitative Methodology.
ACM Trans. Archit. Code Optim., 2016

GreenLA: green linear algebra software for GPU-accelerated heterogeneous computing.
Proceedings of the International Conference for High Performance Computing, 2016

GPU-ABFT: Optimizing Algorithm-Based Fault Tolerance for Heterogeneous Systems with GPUs.
Proceedings of the IEEE International Conference on Networking, 2016

Online Algorithm-Based Fault Tolerance for Cholesky Decomposition on Heterogeneous Systems with GPUs.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

suCAQR: A Simplified Communication-Avoiding QR Factorization Solver Using the TBLAS Framework.
Proceedings of the 22nd IEEE International Conference on Parallel and Distributed Systems, 2016

Algorithm-Directed Data Placement in Explicitly Managed Non-Volatile Memory.
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, 2016

Towards Practical Algorithm Based Fault Tolerance in Dense Linear Algebra.
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, 2016

New-Sum: A Novel Online ABFT Scheme For General Iterative Methods.
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, 2016

2015
Fail-Stop Failure Algorithm-Based Fault Tolerance for Cholesky Decomposition.
IEEE Trans. Parallel Distributed Syst., 2015

Optimising MPI tree-based communication for NUMA architectures.
Int. J. Auton. Adapt. Commun. Syst., 2015

Slow Down or Halt: Saving the Optimal Energy for Scalable HPC Systems.
Proceedings of the 6th ACM/SPEC International Conference on Performance Engineering, Austin, TX, USA, January 31, 2015

Investigating the Interplay between Energy Efficiency and Resilience in High Performance Computing.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Simulated Annealing to Generate Numerically Stable Real Number Error Correction Codes.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

Cholesky Factorization on Heterogeneous CPU and GPU Systems.
Proceedings of the Ninth International Conference on Frontier of Computer Science and Technology, 2015

2014
A survey of power and energy efficient techniques for high performance numerical linear algebra operations.
Parallel Comput., 2014

TX: algorithmic energy saving for distributed dense matrix factorizations.
Proceedings of the 5th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2014

TOUGH2-PETSc: A Parallel Solver for TOUGH2.
Proceedings of the 15th International Conference on Parallel and Distributed Computing, 2014

Extending checksum-based ABFT to tolerate soft errors online in iterative methods.
Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014

HP-DAEMON: High Performance Distributed Adaptive Energy-efficient Matrix-multiplicatiON.
Proceedings of the International Conference on Computational Science, 2014

FT-ScaLAPACK: correcting soft errors on-line for ScaLAPACK cholesky, QR, and LU factorization routines.
Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing, 2014

2013
Multilevel Diskless Checkpointing.
IEEE Trans. Computers, 2013

On-line soft error correction in matrix-matrix multiplication.
J. Comput. Sci., 2013

Rethinking algorithm-based fault tolerance with a cooperative software-hardware approach.
Proceedings of the International Conference for High Performance Computing, 2013

Online-ABFT: an online algorithm based fault tolerance scheme for soft error detection in iterative methods.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2013

A2E: Adaptively aggressive energy efficient DVFS scheduling for data intensive applications.
Proceedings of the IEEE 32nd International Performance Computing and Communications Conference, 2013

Correcting soft errors online in LU factorization.
Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing, 2013

Energy-Efficient Scheduling for Multicore Systems with Bounded Resources.
Proceedings of the 2013 IEEE International Conference on Green Computing and Communications (GreenCom) and IEEE Internet of Things (iThings) and IEEE Cyber, 2013

Power and energy characteristics of MapReduce data movements.
Proceedings of the International Green Computing Conference, 2013

Improving performance and energy efficiency of matrix multiplication via pipeline broadcast.
Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013

2012
Reduced Data Communication for Parallel CMA-ES for REACTS.
Proceedings of the 20th Euromicro International Conference on Parallel, 2012

eTune: A Power Analysis Framework for Data-Intensive Computing.
Proceedings of the 41st International Conference on Parallel Processing Workshops, 2012

Energy Efficient Parallel Matrix-Matrix Multiplication for DVFS-enabled Clusters.
Proceedings of the 41st International Conference on Parallel Processing Workshops, 2012

Runtime Optimization of Broadcast Communications Using Dynamic Network Topology Information from MPI.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

Energy consumption analysis of parallel sorting algorithms running on multicore systems.
Proceedings of the 2012 International Green Computing Conference, 2012

Optimizing Process-to-Core Mappings for Application Level Multi-dimensional MPI Communications.
Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012

2011
Fault tolerant matrix-matrix multiplication: correcting soft errors on-line.
Proceedings of the second workshop on Scalable algorithms for large-scale systems, 2011

Algorithm-based recovery for HPL.
Proceedings of the 16th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2011

Matrix Multiplication on GPUs with On-Line Fault Tolerance.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2011

Algorithm-Based Recovery for Newton's Method without Checkpointing.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

High performance linpack benchmark: a fault tolerant implementation without checkpointing.
Proceedings of the 25th International Conference on Supercomputing, 2011, Tucson, AZ, USA, May 31, 2011

SRC: soft error detection and recovery for high performance linpack.
Proceedings of the 25th International Conference on Supercomputing, 2011, Tucson, AZ, USA, May 31, 2011

Optimizing Process-to-Core Mappings for Two Dimensional Broadcast/Reduce on Multicore Architectures.
Proceedings of the International Conference on Parallel Processing, 2011

Algorithm-based recovery for iterative methods without checkpointing.
Proceedings of the 20th ACM International Symposium on High Performance Distributed Computing, 2011

2010
Adaptive Checkpointing (Invited Paper).
J. Commun., 2010

Highly scalable checkpointing for exascale computing.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Algorithmic Cholesky factorization fault recovery.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Fault tolerant linear algebra: Recovering from fail-stop failures without checkpointing.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Constructing numerically stable real number codes using evolutionary computation.
Proceedings of the Genetic and Evolutionary Computation Conference, 2010

2009
Highly Scalable Self-Healing Algorithms for High Performance Scientific Computing.
IEEE Trans. Computers, 2009

Pipelining parallel image compositing and delivery for efficient remote visualization.
J. Parallel Distributed Comput., 2009

Optimal real number codes for fault tolerant matrix operations.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

N-Level Diskless Checkpointing.
Proceedings of the 11th IEEE International Conference on High Performance Computing and Communications, 2009

2008
Algorithm-Based Fault Tolerance for Fail-Stop Failures.
IEEE Trans. Parallel Distributed Syst., 2008

Performance of MPI broadcast algorithms.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Extending algorithm-based fault tolerance to tolerate fail-stop failures in high performance distributed environments.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

A Scalable Checkpoint Encoding Algorithm for Diskless Checkpointing.
Proceedings of the 11th IEEE High Assurance Systems Engineering Symposium, 2008

2007
Recovery Patterns for Iterative Methods in a Parallel Unstable Environment.
SIAM J. Sci. Comput., 2007

An efficient packet loss recovery methodology for video-over-IP.
Proceedings of the Signal and Image Processing (SIP 2007), 2007

Self Adaptive Application Level Fault Tolerance for Parallel and Distributed Computing.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

An Efficient Audio-Video Synchronization Methodology.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2006
Self-adapting numerical software (SANS) effort.
IBM J. Res. Dev., 2006

Algorithm-based checkpoint-free fault tolerance for parallel matrix computations on volatile resources.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

2005
Condition Numbers of Gaussian Random Matrices.
SIAM J. Matrix Anal. Appl., 2005

Process Fault Tolerance: Semantics, Design and Applications for High Performance Computing.
Int. J. High Perform. Comput. Appl., 2005

Fault tolerant high performance computing by a coding approach.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2005

Numerically Stable Real Number Codes Based on Random Matrices.
Proceedings of the Computational Science, 2005

2004
The LAPACK for Clusters Project: An Example of Self Adapting Numerical Software.
Proceedings of the 37th Hawaii International Conference on System Sciences (HICSS-37 2004), 2004

2003
Self-adapting software for numerical linear algebra and LAPACK for clusters.
Parallel Comput., 2003

Self-Adapting Software for Numerical Linear Algebra Library Routines on Clusters.
Proceedings of the Computational Science - ICCS 2003, 2003


  Loading...