Daniel Jiménez-González

Orcid: 0000-0001-6064-7883

Affiliations:
  • Polytechnic University of Catalonia, Barcelona, Spain
  • Barcelona Supercomputing Center, Spain


According to our database1, Daniel Jiménez-González authored at least 56 papers between 1997 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Enabling HW-Based Task Scheduling in Large Multicore Architectures.
IEEE Trans. Computers, January, 2024

2023
Improving the Discovery and Clustering of Three-Dimensional Protein Patterns with OpenMP.
Proceedings of the 35th IEEE International Symposium on Computer Architecture and High Performance Computing, 2023

FPGA Framework Improvements for HPC Applications.
Proceedings of the International Conference on Field Programmable Technology, 2023

Improving Performance of HPC Kernels on FPGAs Using High-Level Resource Management.
Proceedings of the 31st IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2023

2022
OmpSs@cloudFPGA: An FPGA Task-Based Programming Model with Message Passing.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

Towards Reconfigurable Accelerators in HPC: Designing a Multipurpose eFPGA Tile for Heterogeneous SoCs.
Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022

2021
OmpSs@FPGA Framework for High Performance FPGA Computing.
IEEE Trans. Computers, 2021

The AXIOM Project: IoT on Heterogeneous Embedded Platforms.
IEEE Des. Test, 2021

Task-Based Programming Models for Heterogeneous Recurrent Workloads.
Proceedings of the Applied Reconfigurable Computing. Architectures, Tools, and Applications, 2021

2020
Asynchronous runtime with distributed manager for task-based programming models.
Parallel Comput., 2020

Breaking master-slave model between host and FPGAs.
Proceedings of the PPoPP '20: 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020

2019
A Hardware Runtime for Task-Based Programming Models.
IEEE Trans. Parallel Distributed Syst., 2019

2018
An approach to task-based parallel programming for undergraduate students.
J. Parallel Distributed Comput., 2018

LightDock: a new multi-scale approach to protein-protein docking.
Bioinform., 2018


Application Acceleration on FPGAs with OmpSs@FPGA.
Proceedings of the International Conference on Field-Programmable Technology, 2018


2017
The AXIOM platform for next-generation cyber physical systems.
Microprocess. Microsystems, 2017

Implementation of the K-Means Algorithm on Heterogeneous Devices: A Use Case Based on an Industrial Dataset.
Proceedings of the Parallel Computing is Everywhere, 2017

General Purpose Task-Dependence Management Hardware for Task-Based Dataflow Programming Models.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Characterizing and Improving the Performance of Many-Core Task-Based Parallel Programming Runtimes.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Picos, A Hardware Task-Dependence Manager for Task-Based Dataflow Programming Models.
Proceedings of the 2017 International Conference on High Performance Computing & Simulation, 2017

Exploiting Parallelism on GPUs and FPGAs with OmpSs.
Proceedings of the 1st Workshop on AutotuniNg and aDaptivity AppRoaches for Energy efficient HPC Systems, 2017

2016
MInGLE: An Efficient Framework for Domain Acceleration Using Low-Power Specialized Functional Units.
ACM Trans. Archit. Code Optim., 2016

The AXIOM software layers.
Microprocess. Microsystems, 2016

The Secrets of the Accelerators Unveiled: Tracing Heterogeneous Executions Through OMPT.
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016

Performance analysis of a hardware accelerator of dependence management for task-based dataflow programming models.
Proceedings of the 2016 IEEE International Symposium on Performance Analysis of Systems and Software, 2016


2015
Picos: A hardware runtime architecture support for OmpSs.
Future Gener. Comput. Syst., 2015

Coarse-Grain Performance Estimator for Heterogeneous Parallel Computing Architectures like Zynq All-Programmable SoC.
CoRR, 2015

Tareador: a tool to unveil parallelization strategies at undergraduate level.
Proceedings of the Workshop on Computer Architecture Education, 2015

The AXIOM project (Agile, eXtensible, fast I/O Module).
Proceedings of the 2015 International Conference on Embedded Computer Systems: Architectures, 2015


Automatic design of domain-specific instructions for low-power processors.
Proceedings of the 26th IEEE International Conference on Application-specific Systems, 2015

2014
Hybrid Dataflow/von-Neumann Architectures.
IEEE Trans. Parallel Distributed Syst., 2014

OmpSs@Zynq all-programmable SoC ecosystem.
Proceedings of the 2014 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2014

2013
Accelerating an application domain with specialized functional units.
ACM Trans. Archit. Code Optim., 2013

Heterogeneous tasking on SMP/FPGA SoCs: The case of OmpSs and the Zynq.
Proceedings of the 21st IEEE/IFIP International Conference on VLSI and System-on-Chip, 2013

Analysis of the Task Superscalar Architecture Hardware Design.
Proceedings of the International Conference on Computational Science, 2013

2012
Cell-Dock: high-performance protein-protein docking.
Bioinform., 2012

2010
Extending OpenMP to Survive the Heterogeneous Multi-Core Era.
Int. J. Parallel Program., 2010

Drug Design on the Cell BE.
Proceedings of the Scientific Computing with Multicore and Accelerators., 2010

2009
OpenMP extensions for FPGA accelerators.
Proceedings of the 2009 International Conference on Embedded Computer Systems: Architectures, 2009

A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures.
Proceedings of the Evolving OpenMP in an Age of Extreme Parallelism, 2009

2008
Drug Design Issues on the Cell BE.
Proceedings of the High Performance Embedded Architectures and Compilers, 2008

2007
Performance Analysis of Cell Broadband Engine for High Memory Bandwidth Applications.
Proceedings of the 2007 IEEE International Symposium on Performance Analysis of Systems and Software, 2007

Drug Design on the Cell BroadBand Engine.
Proceedings of the 16th International Conference on Parallel Architectures and Compilation Techniques (PACT 2007), 2007

2004
Algoritmos de ordenación conscientes de la arquitectura y las características de los datos.
PhD thesis, 2004

Characterization of the data access behavior for TPC-C traces.
Proceedings of the 2004 IEEE International Symposium on Performance Analysis of Systems and Software, 2004

2003
CC-Radix: a Cache Conscious Sorting Based on Radix sort.
Proceedings of the 11th Euromicro Workshop on Parallel, 2003

2002
The Effect of Local Sort on Parallel Sorting Algorithms.
Proceedings of the 10th Euromicro Workshop on Parallel, 2002

Case Study: Memory Conscious Parallel Sorting.
Proceedings of the Algorithms for Memory Hierarchies, 2002

2001
Fast parallel in-memory 64-bit sorting.
Proceedings of the 15th international conference on Supercomputing, 2001

1999
Sorting on the SGI Origin 2000: Comparing MPI and Shared Memory Implementations.
Proceedings of the 19th International Conference of the Chilean Computer Science Society (SCCC '99), 1999

Communication conscious radix sort.
Proceedings of the 13th international conference on Supercomputing, 1999

1997
An Analysis of Superscalar Sorting Algorithms on an R8000 Processor.
Proceedings of 17th International Conference of the Chilean Computer Science Society (SCCC '97), 1997


  Loading...