Haipeng Jia

Orcid: 0000-0002-9855-5367

According to our database1, Haipeng Jia authored at least 36 papers between 2006 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Heter-Train: A Distributed Training Framework Based on Semi-Asynchronous Parallel Mechanism for Heterogeneous Intelligent Transportation Systems.
IEEE Trans. Intell. Transp. Syst., January, 2024

2023
An Android Malware Detection Method Using Multi-Feature and MobileNet.
J. Circuits Syst. Comput., November, 2023

Multiport Current Injection Hybrid DC Circuit Breaker With Simple Bridge Arm Circuit.
IEEE Trans. Ind. Electron., October, 2023

Gamify Stencil Dwarf on Cloud for Democratizing Scientific Computing.
CoRR, 2023

Generating Fast FFT Kernels on CPUs via FFT-Specific Intrinsics.
Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2023

OpenFFT: An Adaptive Tuning Framework for 3D FFT on ARM Multicore CPUs.
Proceedings of the 37th International Conference on Supercomputing, 2023

SA_TRSM: A Shape-Aware Auto-Tuning Framework for Small-Scale Irregular-Shaped TRSM.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

2022
Publisher Correction: Smart scheduler: an adaptive NVM-aware thread scheduling approach on NUMA systems.
CCF Trans. High Perform. Comput., December, 2022

Smart scheduler: an adaptive NVM-aware thread scheduling approach on NUMA systems.
CCF Trans. High Perform. Comput., December, 2022

IATF: An Input-Aware Tuning Framework for Compact BLAS Based on ARMv8 CPUs.
Proceedings of the 51st International Conference on Parallel Processing, 2022

LBBGEMM: A Load-balanced Batch GEMM Framework on ARM CPU s.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

EgpuIP: An Embedded GPU Accelerated Library for Image Processing.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

2021
AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs.
Proceedings of the 2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), New York City, NY, USA, September 30, 2021

IAAT: A Input-Aware Adaptive Tuning framework for Small GEMM.
Proceedings of the 27th IEEE International Conference on Parallel and Distributed Systems, 2021

A Transpose-free Three-dimensional FFT Algorithm on ARM CPUs.
Proceedings of the 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, 2021

2020
Automatic Generation of High-Performance FFT Kernels on Arm and X86 CPUs.
IEEE Trans. Parallel Distributed Syst., 2020

一种偶数基Cooley-Tukey FFT高性能实现方法 (High-performance Implementation Method for Even Basis of Cooley-Tukey FFT).
计算机科学, 2020

Accelerated LiDAR data processing algorithm for self-driving cars on the heterogeneous computing platform.
IET Comput. Digit. Tech., 2020

2019
Efficient parallel optimizations of a high-performance SIFT on GPUs.
J. Parallel Distributed Comput., 2019

AutoFFT: a template-based FFT codes auto-generation framework for ARM and X86 CPUs.
Proceedings of the International Conference for High Performance Computing, 2019

2018
DropPruning for Model Compression.
CoRR, 2018

Implementation and Optimization of Multi-dimensional Real FFT on ARMv8 Platform.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2018

2017
HartSift: A High-Accuracy and Real-Time SIFT Based on GPU.
Proceedings of the 23rd IEEE International Conference on Parallel and Distributed Systems, 2017

2016
Parallel Processing Systems for Big Data: A Survey.
Proc. IEEE, 2016

边缘海静力数值预报模式并行算法研究 (Parallelization of Hydrostatic Numerical Forecasting Model of Marginal Sea).
计算机科学, 2016

2015
基于OpenCL的直方图生成算法优化方法研究 (Research on Histogram Generation Algorithm Optimization Based on OpenCL).
计算机科学, 2015

Optimizing Image Sharpening Algorithm on GPU.
Proceedings of the 44th International Conference on Parallel Processing, 2015

Optimized Password Recovery for Encrypted RAR on GPUs.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

2014
Research on Mahalanobis Distance Algorithm Optimization Based on OpenCL.
Proceedings of the 2014 IEEE International Conference on High Performance Computing and Communications, 2014

2013
MPFFT: An Auto-Tuning FFT Library for OpenCL GPUs.
J. Comput. Sci. Technol., 2013

CLSIFT: An Optimization Study of the Scale Invariance Feature Transform on GPUs.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013

2012
An Insightful Program Performance Tuning Chain for GPU Computing.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2012

Accelerating Viola-Jones Facce Detection Algorithm on GPUs.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

GPURoofline: A Model for Guiding Performance Optimizations on GPUs.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

2011
Automatic FFT Performance Tuning on OpenCL GPUs.
Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

2006
Evolutionary Based Intelligent Algorithm for Topology Optimization of Structure.
Proceedings of the Sixth International Conference on Intelligent Systems Design and Applications (ISDA 2006), 2006


  Loading...