Proceedings of the 2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), New York City, NY, USA, September 30, 2021

Unleashing the Low-Precision Computation Potential of Tensor Cores on GPUs.

[BibT_eX]

[DOI]

Guangli Li

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2021

2020

Fusion-Catalyzed Pruning for Optimizing Deep Learning on Intelligent Edge Devices.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

Visual field movement detection model based on low-resolution images.

[BibT_eX]

[DOI]

Int. J. Embed. Syst., 2020

Compiler-Assisted Operator Template Library for DNN Accelerators.

[BibT_eX]

[DOI]

Proceedings of the Network and Parallel Computing, 2020

Characterizing the I/O Pipeline in the Deployment of CNNs on Commercial Accelerators.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2020

Lance: efficient low-precision quantized winograd convolution for neural networks based on graphics processing units.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Accelerating Deep Learning Inference with Cross-Layer Data Reuse on GPUs.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2020: Parallel Processing, 2020

2019

Cacheap: Portable and Collaborative I/O Optimization for Graph Processing.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2019

ElasticActor: An Actor System with Automatic Granularity Adjustment.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2019

Exploiting the input sparsity to accelerate deep neural networks: poster.

[BibT_eX]

[DOI]

Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019

Accelerating GPU Computing at Runtime with Binary Optimization.

[BibT_eX]

[DOI]

Guangli Li

Lei Liu

Xiaobing Feng

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2019

XDN: Towards Efficient Inference of Residual Neural Networks on Cambricon Chips.

[BibT_eX]

[DOI]

Proceedings of the Benchmarking, Measuring, and Optimizing, 2019

Acorns: A Framework for Accelerating Deep Neural Networks with Input Sparsity.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019

2018

Background Subtraction on Depth Videos with Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Auto-tuning Neural Network Quantization Framework for Collaborative Inference Between the Cloud and Edge.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2018, 2018

Fast CNN Pruning via Redundancy-Aware Training.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2018, 2018

2017

Redundancy checking algorithms based on parallel novel extension rule.

[BibT_eX]

[DOI]

J. Exp. Theor. Artif. Intell., 2017

Two-Level Task Scheduling for Irregular Applications on GPU Platform.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2017

SysMon: Monitoring Memory Behaviors via OS Approach.

[BibT_eX]

[DOI]

Proceedings of the Advanced Parallel Processing Technologies, 2017

2016

Rethinking Memory Management in Modern Operating System: Horizontal, Vertical or Random?

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2016

Pragma Directed Shared Memory Centric Optimizations on GPUs.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2016

Memos: A full hierarchy hybrid memory management framework.

[BibT_eX]

[DOI]

Proceedings of the 34th IEEE International Conference on Computer Design, 2016

2015

WiseThrottling: a new asynchronous task scheduler for mitigating I/O bottleneck in large-scale datacenter servers.

[BibT_eX]

[DOI]

J. Supercomput., 2015

2014

BPM/BPM+: Software-based dynamic memory partitioning mechanisms for mitigating DRAM bank-/channel-level interferences in multicore systems.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2014

Dynamic I/O-Aware Scheduling for Batch-Mode Applications on Chip Multiprocessor Systems of Cluster Platforms.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2014

Going vertical in memory management: Handling multiplicity by multi-policy.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE 41st International Symposium on Computer Architecture, 2014

2013

Access Annotation for Safe Program Parallelization.

[BibT_eX]

[DOI]

Chen Ding

Lei Liu

Proceedings of the Network and Parallel Computing - 10th IFIP International Conference, 2013

2012

A software memory partition approach for eliminating bank-level interference in multicore systems.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012

2011

Safe parallel programming using dynamic dependence hints.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual ACM SIGPLAN Conference on Object-Oriented Programming, 2011

2010

Unified Parallel C for GPU Clusters: Language Extensions and Compiler Implementation.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2010

2008

Automatic Implementation of Multi-partitioning Using Global Tiling.

[BibT_eX]

[DOI]

Proceedings of the 14th International Conference on Parallel and Distributed Systems, 2008

Global Tiling for Communication Minimal Parallelization on Distributed Memory Systems.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2008, 2008

Lei Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...