Shaoqiang Lu

Orcid: 0009-0002-8624-8666

According to our database1, Shaoqiang Lu authored at least 11 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Harnessing Spatiotemporal Redundancy for Fast Diffusion Models on FPGA.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2026

DFVG: A Heterogeneous Architecture for Speculative Decoding with Draft-on-FPGA and Verify-on-GPU.
Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026

dLLM-OPU: An FPGA Overlay Processor for Accelerated Diffusion Large Language Models.
Proceedings of the 31st Asia and South Pacific Design Automation Conference, 2026

Mixture-of-Trees: Learning to Select and Weigh Reasoning Paths for Efficient LLM Inference.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
MCoreOPU: An FPGA-based Multi-Core Overlay Processor for Transformer-based Models.
ACM Trans. Reconfigurable Technol. Syst., September, 2025

MoE-OPU: An FPGA Overlay Processor Leveraging Expert Parallelism for MoE-based Large Language Models.
Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2025

C2OPU: Hybrid Compute-in-Memory and Coarse-Grained Reconfigurable Architecture for Overlay Processing of Transformers.
Proceedings of the 33rd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2025

MambaOPU: An FPGA Overlay Processor for State-space-duality-based Mamba Models.
Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

METAL: A Memory-Efficient Transformer Architecture for Long-Context Inference on FPGA.
Proceedings of the 36th IEEE International Conference on Application-specific Systems, 2025

2024
ChatOPU: An FPGA-based Overlay Processor for Large Language Models with Unstructured Sparsity.
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

2023
Token Packing for Transformers with Variable-Length Inputs.
Proceedings of the 33rd International Conference on Field-Programmable Logic and Applications, 2023


  Loading...