Jinhui Wei

Orcid: 0000-0002-9850-8384

According to our database1, Jinhui Wei authored at least 14 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
POLAR-PIC: A Holistic Framework for Matrixized PIC with Co-Designed Compute, Layout, and Communication.
CoRR, April, 2026

ASM-SpMM: Unleashing the Potential of Arm SME for Sparse Matrix Multiplication Acceleration.
Proceedings of the 31st ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2026

Matrix‑PIC: Harnessing Matrix Outer-product for High‑Performance Particle‑in‑Cell Simulations.
Proceedings of the 21st European Conference on Computer Systems, 2026

2025
Deep reinforcement learning for dynamic strategy interchange in financial markets.
Appl. Intell., January, 2025

IasRT: Interference-Aware and SLO-Driven GPU Scheduling for Real-Time DNN Inference.
Proceedings of the 43rd IEEE International Conference on Computer Design, 2025

Ghidorah: Fast LLM Inference on Edge with Speculative Decoding and Hetero-Core Parallelism.
Proceedings of the 43rd IEEE International Conference on Computer Design, 2025

2024
Community detection in attributed networks via adaptive deep nonnegative matrix factorization.
Neural Comput. Appl., January, 2024

Liger: Interleaving Intra- and Inter-Operator Parallelism for Distributed Large Model Inference.
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024

GCCR: GAT-Based Category-Aware Course Recommendation.
Proceedings of the Knowledge Science, Engineering and Management, 2024

Efficient Coupling Streaming AI and Ensemble Simulations on HPC Clusters.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024

Enhancing Embedding and Hierarchical Reward Shaping for Multi-Hop Reasoning with Reinforcement Learning.
Proceedings of the Advanced Data Mining and Applications - 20th International Conference, 2024

2022
Attention-based bi-directional refinement network for salient object detection.
Appl. Intell., 2022

2020
Dynamic GMMU Bypass for Address Translation in Multi-GPU Systems.
Proceedings of the Network and Parallel Computing, 2020

A Research and Design of Lightweight Convolutional Neural Networks Accelerator Based on Systolic Array Structure.
Proceedings of the ICRAI 2020: 6th International Conference on Robotics and Artificial Intelligence, 2020


  Loading...