Hal Finkel

Orcid: 0000-0002-7551-7122

Affiliations:
  • Department of Energy, MA, USA
  • Argonne National Laboratory (former)


According to our database1, Hal Finkel authored at least 71 papers between 2012 and 2022.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
OpenMP application experiences: Porting to accelerated nodes.
Parallel Comput., 2022

Autotuning PolyBench benchmarks with LLVM Clang/Polly loop optimization pragmas using Bayesian optimization.
Concurr. Comput. Pract. Exp., 2022

2021
GRChombo: An adaptable numerical relativity code for fundamental physics.
J. Open Source Softw., 2021

Autotuning PolyBench Benchmarks with LLVM Clang/Polly Loop Optimization Pragmas Using Bayesian Optimization (extended version).
CoRR, 2021

Report of the Workshop on Program Synthesis for Scientific Computing.
CoRR, 2021

2020
Really Embedding Domain-Specific Languages into C++.
CoRR, 2020

Autotuning Search Space for Loop Transformations.
CoRR, 2020

Extending C++ for Heterogeneous Quantum-Classical Computing.
CoRR, 2020

Performance Evaluation of the Vectorizable Binary Search Algorithms on an FPGA Platform.
Proceedings of the 10th IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms, 2020

A Case Study on the HACCmk Routine in SYCL on Integrated Graphics.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

Analyzing Deep Learning Model Inferences for Image Classification using OpenVINO.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

Population Count on Intel® CPU, GPU and FPGA.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

2019
ClangJIT: Enhancing C++ with Just-in-Time Compilation.
CoRR, 2019

Performance Exploration Through Optimistic Static Program Annotations.
Proceedings of the High Performance Computing - 34th International Conference, 2019

Full-state quantum circuit simulation by using data compression.
Proceedings of the International Conference for High Performance Computing, 2019

ClangJIT: Enhancing C++ with Just-in-Time Compilation.
Proceedings of the 2019 IEEE/ACM International Workshop on Performance, 2019

Design and Use of Loop-Transformation Pragmas.
Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

The TRegion Interface and Compiler Optimizations for OpenMP Target Regions.
Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

Exploring Integer Sum Reduction using Atomics on Intel CPU.
Proceedings of the International Workshop on OpenCL, 2019

Performance of Floating-point Intensive Kernels on Low-power Processor - A Case Study with Geodesic Distance Kernel.
Proceedings of the Tenth International Green and Sustainable Computing Conference, 2019

Nuclear Reactor Simulations on OpenCL FPGA Platform.
Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019

Base64 Encoding on OpenCL FPGA Platform.
Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019

OpenCL Kernel Vectorization on the CPU, GPU, and FPGA: A Case Study with Frequent Pattern Compression.
Proceedings of the 27th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2019

Exploring the Random Network of Hodgkin and Huxley Neurons with Exponential Synaptic Conductances on OpenCL FPGA Platform.
Proceedings of the 27th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2019

Accelerating Hyperdimensional Classifier on Multiple GPUs.
Proceedings of the 2019 IEEE International Conference on Cluster Computing, 2019

A Case Study of k-means Clustering using SYCL.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Exploration of OpenCL 2D Convolution Kernels on Intel FPGA, CPU, and GPU Platforms.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Evaluation of Medical Imaging Applications using SYCL.
Proceedings of the 2019 IEEE International Conference on Bioinformatics and Biomedicine, 2019

Simulation of Random Network of Hodgkin and Huxley Neurons with Exponential Synaptic Conductances on an FPGA Platform.
Proceedings of the 10th ACM International Conference on Bioinformatics, 2019

Base64 Encoding on Heterogeneous Computing Platforms.
Proceedings of the 30th IEEE International Conference on Application-specific Systems, 2019

Evaluating LULESH Kernels on OpenCL FPGA.
Proceedings of the Applied Reconfigurable Computing - 15th International Symposium, 2019

2018
Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression.
CoRR, 2018

Amplitude-Aware Lossy Compression for Quantum Circuit Simulation.
CoRR, 2018

Loop Optimization Framework.
CoRR, 2018

User-Directed Loop-Transformations in Clang.
CoRR, 2018

Evaluating Floating-point Intensive Applications on OpenCL FPGA Platforms: A Case Study on the SimpleMOC Kernel.
Proceedings of the 2018 International Conference on ReConFigurable Computing and FPGAs, 2018

Evaluating and Optimizing OpenCL Base64 Data Unpacking Kernel with FPGA.
Proceedings of the 26th Euromicro International Conference on Parallel, 2018

Compiler Optimizations for Parallel Programs.
Proceedings of the Languages and Compilers for Parallel Computing, 2018

Manage OpenMP GPU Data Environment Under Unified Address Space.
Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

A Proposal for Loop-Transformation Pragmas.
Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

Compiler Optimizations for OpenMP.
Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

Distributed & Heterogeneous Programming in C++ for HPC at SC17.
Proceedings of the International Workshop on OpenCL, 2018

Nuclear Reactor Simulation on OpenCL FPGA: a Case Study of RSBench.
Proceedings of the International Workshop on OpenCL, 2018

Performance-oriented Optimizations for OpenCL Streaming Kernels on the FPGA.
Proceedings of the International Workshop on OpenCL, 2018

Evaluation of MD5Hash Kernel on OpenCL FPGA Platform.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Power and Performance Tradeoff of a Floating-Point Intensive Kernel on OpenCL FPGA Platform.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Optimizing an Atomics-Based Reduction Kernel on OpenCL FPGA Platform.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Optimizing Parallel Reduction on OpenCL FPGA Platform - A Case Study of Frequent Pattern Compression.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Evaluating an OpenCL FPGA Platform for HPC: a Case Study with the HACCmk Kernel.
Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018

A Case Study of Integer Sum Reduction using Atomics.
Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, 2018

Evaluating Radial Basis Function Kernel on OpenCL FPGA Platform.
Proceedings of the Ninth International Green and Sustainable Computing Conference, 2018

A Case Study of Complementary-multiply-with-carry Method on OpenCL FPGA.
Proceedings of the Ninth International Green and Sustainable Computing Conference, 2018

Evaluation of OpenCL Performance-oriented Optimizations for Streaming Kernels on the FPGA: (Abstract Only).
Proceedings of the 2018 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2018

Bob Jenkins Lookup3 Hash Function on OpenCL FPGA Platform.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

Optimizing Radial Basis Function Kernel on OpenCL FPGA Platform.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

2017
Trends in Data Locality Abstractions for HPC Systems.
IEEE Trans. Parallel Distributed Syst., 2017

HACC: extreme scaling and performance across diverse architectures.
Commun. ACM, 2017

Benchmarking and Evaluating Unified Memory for OpenMP GPU Offloading.
Proceedings of the Fourth Workshop on the LLVM Compiler Infrastructure in HPC, 2017

Evaluating irregular memory access on OpenCL FPGA platforms: A case study with XSBench.
Proceedings of the 27th International Conference on Field Programmable Logic and Applications, 2017

Evaluation of a Floating-Point Intensive Kernel on FPGA - A Case Study of Geodesic Distance Kernel.
Proceedings of the Euro-Par 2017: Parallel Processing Workshops, 2017

2016
Doing Moore with Less - Leapfrogging Moore's Law with Inexactness for Supercomputing.
CoRR, 2016

2015
High Energy Physics Forum for Computational Excellence: Working Group Reports (I. Applications Software II. Software Libraries and Tools III. Systems).
CoRR, 2015

Large-scale compute-intensive analysis via a combined in-situ and co-scheduling workflow approach.
Proceedings of the International Conference for High Performance Computing, 2015

Testing and debugging exascale applications by mocking MPI.
Proceedings of the 3rd International Workshop on Software Engineering for High Performance Computing in Computational Science and Engineering, 2015

Supporting Indirect Data Mapping in OpenMP.
Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015

Data-dependence profiling to enable safe thread level speculation.
Proceedings of 25th Annual International Conference on Computer Science and Software Engineering, 2015

2014
Large-Scale Simulations of Sky Surveys.
Comput. Sci. Eng., 2014

Scalable Parallel I/O on a Blue Gene/Q Supercomputer Using Compression, Topology-Aware Data Aggregation, and Subfiling.
Proceedings of the 22nd Euromicro International Conference on Parallel, 2014

2013
HACC: extreme scaling and performance across diverse architectures.
Proceedings of the International Conference for High Performance Computing, 2013

2012
Meshing the Universe: Integrating Analysis in Cosmological Simulations.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

The universe at extreme scale: multi-petaflop sky simulation on the BG/Q.
Proceedings of the SC Conference on High Performance Computing Networking, 2012


  Loading...