Proceedings of the Euro-Par 2023: Parallel Processing - 29th International Conference on Parallel and Distributed Computing, Limassol, Cyprus, August 28, 2023

Prophet: Fine-grained Load Balancing for Parallel Training of Large-scale MoE Models.

[BibT_eX]

[DOI]

Wei Wang

Proceedings of the IEEE International Conference on Cluster Computing, 2023

2022

DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation.

[BibT_eX]

[DOI]

CoRR, 2022

BRGraph: An efficient graph neural network training system by reusing batch data on GPU.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2022

Accelerating Sample-based GNN Training by Feature Caching on GPUs.

[BibT_eX]

[DOI]

Proceedings of the 7th IEEE International Conference on Smart Cloud, SmartCloud 2022, 2022

SCGraph: Accelerating Sample-based GNN Training by Staged Caching of Features on GPUs.

[BibT_eX]

[DOI]

Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2022

EmbRace: Accelerating Sparse Communication for Distributed Training of Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 51st International Conference on Parallel Processing, 2022

S2 Reducer: High-Performance Sparse Communication to Accelerate Distributed Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

AutoPipe: A Fast Pipeline Parallelism Approach with Balanced Partitioning and Micro-batch Slicing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2022

HPH: Hybrid Parallelism on Heterogeneous Clusters for Accelerating Large-scale DNNs Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2022

2021

Coordinative Scheduling of Computation and Communication in Data-Parallel Systems.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2021

EmbRace: Accelerating Sparse Communication for Distributed Training of NLP Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2021

S2 Reducer: High-Performance Sparse Communication to Accelerate Distributed Deep Learning.

[BibT_eX]

[DOI]

CoRR, 2021

PCGraph: Accelerating GNN Inference on Large Graphs via Partition Caching.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), New York City, NY, USA, September 30, 2021

Accelerate Graph Neural Network Training by Reusing Batch Data on GPUs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Performance, 2021

Hippie: A Data-Paralleled Pipeline Approach to Improve Memory-Efficiency and Scalability for Large DNN Training.

[BibT_eX]

[DOI]

Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021

HMA: An Efficient Training Method for NLP Models.

[BibT_eX]

[DOI]

Proceedings of the ICIAI 2021: 2021 the 5th International Conference on Innovation in Artificial Intelligence, 2021

Prediction of the Cyanobacteria Coverage in Time-series Images based on Convolutional Neural Network.

[BibT_eX]

[DOI]

Xiangyu Ye

Zhiquan Lai

Dongsheng Li

Proceedings of the ICCCV 2021: 4th International Conference on Control and Computer Vision, Macau, SAR, China, August 13, 2021

2PGraph: Accelerating GNN Training over Large Graphs on GPU Clusters.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2021

CASQ: Accelerate Distributed Deep Learning with Sketch-Based Gradient Quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020

ADMMiRNN: Training RNN with Stable Convergence via an Efficient ADMM Approach.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2020

Poster Abstract: Model Average-based Distributed Training for Sparse Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 39th IEEE Conference on Computer Communications, 2020

2019

HPDL: Towards a General Framework for High-performance Distributed Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 39th IEEE International Conference on Distributed Computing Systems, 2019

2017

PoweRock: Power Modeling and Flexible Dynamic Power Management for Many-Core Architectures.

[BibT_eX]

[DOI]

IEEE Syst. J., 2017

A Two-Tiered Defence of Techniques to Prevent SQL Injection Attacks.

[BibT_eX]

[DOI]

Proceedings of the Innovative Mobile and Internet Services in Ubiquitous Computing, 2017

2015

Latency-aware DVFS for efficient power state transitions on many-core architectures.

[BibT_eX]

[DOI]

J. Supercomput., 2015

2014

A Power Modelling Approach for Many-Core Architectures.

[BibT_eX]

[DOI]

Proceedings of the 2014 10th International Conference on Semantics, 2014

Efficient DVFS to Prevent Hard Faults for Many-Core Architectures.

[BibT_eX]

[DOI]

Zhiquan Lai

Baokang Zhao

Jinshu Su

Proceedings of the Information and Communication Technology, 2014

Rhymes: A shared virtual memory system for non-coherent tiled many-core architectures.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014

2012

Mining of Attack Models in IDS Alerts from Network Backbone by a Two-stage Clustering Method.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

Zhiquan Lai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...