Hongwu Peng

Orcid: 0000-0003-2025-2195

According to our database¹, Hongwu Peng authored at least 33 papers between 2021 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

ASPLOS 2024 Artifact for "MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training".

[BibT_eX]

[DOI]

Dataset, February, 2024

ASPLOS 2024 Artifact for "MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training".

[BibT_eX]

[DOI]

Dataset, February, 2024

APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking.

[BibT_eX]

[DOI]

Sanguthevar Rajasekaran

Dimitris N. Metaxas

CoRR, 2024

SSNet: A Lightweight Multi-Party Computation Scheme for Practical Privacy-Preserving Machine Learning Service in the Cloud.

[BibT_eX]

[DOI]

CoRR, 2024

Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate.

[BibT_eX]

[DOI]

CoRR, 2024

Zero-Space Cost Fault Tolerance for Transformer-based Language Models on ReRAM.

[BibT_eX]

[DOI]

CoRR, 2024

Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs.

[BibT_eX]

[DOI]

Proceedings of the Companion of the 15th ACM/SPEC International Conference on Performance Engineering, 2024

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023

MaxK-GNN: Towards Theoretical Speed Limits for Accelerating Graph Neural Networks Training.

[BibT_eX]

[DOI]

CoRR, 2023

Advanced Large Language Model (LLM)-Driven Verilog Development: Enhancing Power, Performance, and Area Optimization in Code Synthesis.

[BibT_eX]

[DOI]

CoRR, 2023

RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference.

[BibT_eX]

[DOI]

CoRR, 2023

LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

AQ2PNN: Enabling Two-party Privacy-Preserving Deep Neural Network Inference with Adaptive Quantization.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023

AutoReP: Automatic ReLU Replacement for Fast Private Network Inference.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

PASNet: Polynomial Architecture Search Framework for Two-party Computation-based Secure Neural Network Deployment.

[BibT_eX]

[DOI]

Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

Dynamic Sparse Training via Balancing the Exploration-Exploitation Trade-off.

[BibT_eX]

[DOI]

Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

2022

Aerial Manipulation Using a Novel Unmanned Aerial Vehicle Cyber-Physical System.

[BibT_eX]

[DOI]

CoRR, 2022

An Automatic and Efficient BERT Pruning for Edge AI Systems.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Symposium on Quality Electronic Design, 2022

Towards Sparsification of Graph Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE 40th International Conference on Computer Design, 2022

CoDG-ReRAM: An Algorithm-Hardware Co-design to Accelerate Semi-Structured GNNs on ReRAM.

[BibT_eX]

[DOI]

Proceedings of the IEEE 40th International Conference on Computer Design, 2022

A length adaptive algorithm-hardware co-design of transformer on FPGA through sparse attention and dynamic pipelining.

[BibT_eX]

[DOI]

Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

2021

Detecting Gender Bias in Transformer-based Models: A Case Study on BERT.

[BibT_eX]

[DOI]

CoRR, 2021

Optimizing FPGA-based Accelerator Design for Large-Scale Molecular Similarity Search.

[BibT_eX]

[DOI]

CoRR, 2021

Binary Complex Neural Network Acceleration on FPGA.

[BibT_eX]

[DOI]

CoRR, 2021

Improving DNN Fault Tolerance using Weight Pruning and Differential Crossbar Mapping for ReRAM-based Edge AI.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Symposium on Quality Electronic Design, 2021

Accelerating Transformer-based Deep Learning Models on FPGAs using Column Balanced Block Pruning.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Symposium on Quality Electronic Design, 2021

Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2021

Optimizing FPGA-based Accelerator Design for Large-Scale Molecular Similarity Search (Special Session Paper).

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2021

Accommodating Transformer onto FPGA: Coupling the Balanced Model Compression and FPGA-Implementation Optimization.

[BibT_eX]

[DOI]

Proceedings of the GLSVLSI '21: Great Lakes Symposium on VLSI 2021, 2021

HMC-TRAN: A Tensor-core Inspired Hierarchical Model Compression for Transformer-based DNNs on GPU.

[BibT_eX]

[DOI]

Proceedings of the GLSVLSI '21: Great Lakes Symposium on VLSI 2021, 2021

Binary Complex Neural Network Acceleration on FPGA : (Invited Paper).

[BibT_eX]

[DOI]

Proceedings of the 32nd IEEE International Conference on Application-specific Systems, 2021

Hongwu Peng

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...