Bingqiang Wang

Orcid: 0009-0004-4964-3258

According to our database1, Bingqiang Wang authored at least 31 papers between 2003 and 2025.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
NM-SpMM: Accelerating Matrix Multiplication Using N:M Sparsity with GPGPU.
CoRR, March, 2025

Accelerating Model Training on Ascend Chips: An Industrial System for Profiling, Analysis and Optimization.
Proceedings of the 2025 USENIX Annual Technical Conference, 2025

NM-SpMM: Accelerating Matrix Multiplication Using N: M Sparsity with GPGPU.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2025

AUE: A Normalized Energy Efficiency Metric for AI Servers Under LLM Workloads.
Proceedings of the 31th IEEE International Conference on Parallel and Distributed Systems, 2025

Improving the Energy Efficiency of AI Clusters Through Variability-Aware Frequency Scaling and Task Allocation.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2025

ParaCoder: Parallel Code Generation with Large Language Model.
Proceedings of the 1st FastCode Programming Challenge, 2025

Using Analytical Performance/Power Model and Fine-Grained DVFS to Enhance AI Accelerator Energy Efficiency.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

2024
DSO: A GPU Energy Efficiency Optimizer by Fusing Dynamic and Static Information.
Proceedings of the 32nd IEEE/ACM International Symposium on Quality of Service, 2024

Improving GPU Energy Efficiency through an Application-transparent Frequency Scaling Policy with Performance Assurance.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

2018
mSNP: A Massively Parallel Algorithm for Large-Scale SNP Detection.
IEEE Trans. Parallel Distributed Syst., 2018

K-mer Counting for Genomic Big Data.
Proceedings of the Big Data - BigData 2018, 2018

2017
异构集群上的宏基因组聚类优化 (Accelerating Gene Clustering on Heterogeneous Clusters).
计算机科学, 2017

Bloomfish: A Highly Scalable Distributed K-mer Counting Framework.
Proceedings of the 23rd IEEE International Conference on Parallel and Distributed Systems, 2017

Scalable Assembly for Massive Genomic Graphs.
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017

2016
SWAP-Assembler 2: Optimization of De Novo Genome Assembler at Extreme Scale.
Proceedings of the 45th International Conference on Parallel Processing, 2016

2015
Large-Scale Neo-Heterogeneous Programming and Optimization of SNP Detection on Tianhe-2.
Proceedings of the High Performance Computing - 30th International Conference, 2015

The Challenge of Scaling Genome Big Data Analysis Software on TH-2 Supercomputer.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

Accelerating large-scale biological database search on Xeon Phi-based neo-heterogeneous architectures.
Proceedings of the 2015 IEEE International Conference on Bioinformatics and Biomedicine, 2015

2014
SWAP-Assembler: scalable and efficient genome assembly towards thousands of cores.
BMC Bioinform., 2014

mBWA: A Massively Parallel Sequence Reads Aligner.
Proceedings of the 8th International Conference on Practical Applications of Computational Biology & Bioinformatics, 2014

Multi-task Parallel Algorithm for DSRC.
Proceedings of the Second International Conference on Information Technology and Quantitative Management, 2014

2013
GPU-accelerated adaptive compression framework for genomics data.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

Improved Parallel Processing of Massive De Bruijn Graph for Genome Assembly.
Proceedings of the Web Technologies and Applications - 15th Asia-Pacific Web Conference, 2013

GPU-Accelerated Bidirected De Bruijn Graph Construction for Genome Assembly.
Proceedings of the Web Technologies and Applications - 15th Asia-Pacific Web Conference, 2013

2012
Gene set analysis in the cloud.
Bioinform., 2012

SOAP3: ultra-fast GPU-based parallel alignment tool for short reads.
Bioinform., 2012

Accelerating minor allele frequency computation with graphics processors.
Proceedings of the 1st International Workshop on Big Data, 2012

Improving Data Processing Time with Access Sequence Prediction.
Proceedings of the 18th IEEE International Conference on Parallel and Distributed Systems, 2012

2011
GSNP: A DNA Single-Nucleotide Polymorphism Detection System with GPU Acceleration.
Proceedings of the International Conference on Parallel Processing, 2011

2009
Formal Semantic of Component-Based Reconfiguration Router Unit's Software Model.
Proceedings of the International Forum on Information Technology and Applications, 2009

2003
The System for Computing of Molecule Structure on the Computational Grid Environment.
Proceedings of the Grid and Cooperative Computing, Second International Workshop, 2003


  Loading...