Ramón Doallo

According to our database1, Ramón Doallo authored at least 175 papers between 1990 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2019
Parallel prefix operations on GPU: tridiagonal system solvers and scan operators.
The Journal of Supercomputing, 2019

Supporting multi-resolution out-of-core rendering of massive LiDAR point clouds through non-redundant data structures.
International Journal of Geographical Information Science, 2019

A Fast Solver for Large Tridiagonal Systems on Multi-Core Processors (Lass Library).
IEEE Access, 2019

2018
Solving Large Problem Sizes of Index-Digit Algorithms on GPU: FFT and Tridiagonal System Solvers.
IEEE Trans. Computers, 2018

Towards cloud-based parallel metaheuristics.
IJHPCA, 2018

Heterogeneous distributed computing based on high-level abstractions.
Concurrency and Computation: Practice and Experience, 2018

Multimethod optimization in the cloud: A case-study in systems biology modelling.
Concurrency and Computation: Practice and Experience, 2018

Multimethod Optimization for Reverse Engineering of Complex Biological Networks.
Proceedings of the 6th International Workshop on Parallelism in Bioinformatics, 2018

Solving Multiple Tridiagonal Systems on a Multi-GPU Platform.
Proceedings of the 26th Euromicro International Conference on Parallel, 2018

Efficient Solving of Scan Primitive on Multi-GPU Systems.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Guiding the Optimization of Parallel Codes on Multicores Using an Analytical Cache Model.
Proceedings of the Computational Science - ICCS 2018, 2018

Big data storage technologies: a case study for web-based LiDAR visualization.
Proceedings of the IEEE International Conference on Big Data, 2018

2017
BPLG-BMCS: GPU-sorting algorithm using a tuning skeleton library.
The Journal of Supercomputing, 2017

High productivity multi-device exploitation with the Heterogeneous Programming Library.
J. Parallel Distrib. Comput., 2017

Facilitating the development of stencil applications using the Heterogeneous Programming Library.
Concurrency and Computation: Practice and Experience, 2017

A cloud-based enhanced differential evolution algorithm for parameter estimation problems in computational systems biology.
Cluster Computing, 2017

Parameter estimation in large-scale systems biology models: a parallel and self-adaptive cooperative strategy.
BMC Bioinformatics, 2017

Using the Cloud for parameter estimation problems: comparing Spark vs MPI with a case-study.
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017

2016
A simulated annealing algorithm for zoning in planning using parallel computing.
Computers, Environment and Urban Systems, 2016

Designing Efficient Index-Digit Algorithms for CUDA GPU Architectures.
IEEE Trans. Parallel Distrib. Syst., 2016

Performance Evaluation of Data-Intensive Computing Applications on a Public IaaS Cloud.
Comput. J., 2016

Towards a High Level Approach for the Programming of Heterogeneous Clusters.
Proceedings of the 45th International Conference on Parallel Processing Workshops, 2016

Implementing Parallel Differential Evolution on Spark.
Proceedings of the Applications of Evolutionary Computation - 19th European Conference, 2016

Evaluation of Parallel Differential Evolution Implementations on MapReduce and Spark.
Proceedings of the Euro-Par 2016: Parallel Processing Workshops, 2016

2015
Developing adaptive multi-device applications with the Heterogeneous Programming Library.
The Journal of Supercomputing, 2015

On Processing Extreme Data.
Scalable Computing: Practice and Experience, 2015

BPLG: A Tuned Butterfly Processing Library for GPU Architectures.
International Journal of Parallel Programming, 2015

Low-latency Java communication devices on RDMA-enabled networks.
Concurrency and Computation: Practice and Experience, 2015

Automatic Generation of Optimized OpenCL Codes Using OCLoptimizer.
Comput. J., 2015

Enhanced parallel Differential Evolution algorithm for problems in computational systems biology.
Appl. Soft Comput., 2015

Parallel Metaheuristics in Computational Biology: An Asynchronous Cooperative Enhanced Scatter Search Method.
Proceedings of the International Conference on Computational Science, 2015

New Tridiagonal Systems Solvers on GPU Architectures.
Proceedings of the 22nd IEEE International Conference on High Performance Computing, 2015

2014
Address independent estimation of the boundaries of cache performance.
Microprocessors and Microsystems - Embedded Hardware Design, 2014

High-performance computing selection of models of DNA substitution for multicore clusters.
IJHPCA, 2014

A fine-grained thread-aware management policy for shared caches.
Concurrency and Computation: Practice and Experience, 2014

Multicore cache hierarchies: design and programmability issues.
Concurrency and Computation: Practice and Experience, 2014

FastMPJ: a scalable and efficient Java message-passing library.
Cluster Computing, 2014

Efficient Scan Operator Methods on a GPU.
Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

A Parallel Differential Evolution Algorithm for Parameter Estimation in Dynamic Models of Biological Systems.
Proceedings of the 8th International Conference on Practical Applications of Computational Biology & Bioinformatics, 2014

Writing Self-adaptive Codes for Heterogeneous Systems.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2013
High performance genetic algorithm for land use planning.
Computers, Environment and Urban Systems, 2013

A population-based iterated greedy algorithm for the delimitation and zoning of rural settlements.
Computers, Environment and Urban Systems, 2013

Influence of memory access patterns to small-scale FFT performance.
The Journal of Supercomputing, 2013

Virtually split cache: An efficient mechanism to distribute instructions and data.
TACO, 2013

Java in the High Performance Computing arena: Research, practice and experience.
Sci. Comput. Program., 2013

Evaluation of messaging middleware for high-performance cloud computing.
Personal and Ubiquitous Computing, 2013

Accurate prediction of the behavior of multithreaded applications in shared caches.
Parallel Computing, 2013

Design and Implementation of an Extended Collectives Library for Unified Parallel C.
J. Comput. Sci. Technol., 2013

Compiler-Assisted Checkpointing of Parallel Codes: The Cetus and LLVM Experience.
International Journal of Parallel Programming, 2013

Parallel Monte Carlo radiosity using scene partitioning.
IJHPCA, 2013

Analysis of I/O Performance on an Amazon EC2 Cluster Compute and High I/O Platform.
J. Grid Comput., 2013

Performance analysis of HPC applications in the cloud.
Future Generation Comp. Syst., 2013

Web-GIS tool for the management of rural land markets - Application to the Land Bank of Galicia (NWSpain).
Earth Science Informatics, 2013

A multi-GPU shallow-water simulation with transport of contaminants.
Concurrency and Computation: Practice and Experience, 2013

General-purpose computation on GPUs for high performance cloud computing.
Concurrency and Computation: Practice and Experience, 2013

Graphics processing unit computing and exploitation of hardware accelerators.
Concurrency and Computation: Practice and Experience, 2013

Design of Scalable Java Communication Middleware for Multi-Core Systems.
Comput. J., 2013

SPLG: A Tuned Signal Processing Library for GPU Architectures.
Proceedings of the 25th International Symposium on Computer Architecture and High Performance Computing, 2013

Topic 4: High-Performance Architectures and Compilers - (Introduction).
Proceedings of the Euro-Par 2013 Parallel Processing, 2013

Evaluation of Java for General Purpose GPU Computing.
Proceedings of the 27th International Conference on Advanced Information Networking and Applications Workshops, 2013

2012
F-MPJ: scalable Java message-passing communications on parallel systems.
The Journal of Supercomputing, 2012

Design of scalable Java message-passing communications over InfiniBand.
The Journal of Supercomputing, 2012

Static analysis of the worst-case memory performance for irregular codes with indirections.
TACO, 2012

High-performance Monte Carlo radiosity on GPU based on scene partitioning.
Microprocessors and Microsystems - Embedded Hardware Design, 2012

Special issue editorial: Exploitation of hardware accelerators.
Microprocessors and Microsystems - Embedded Hardware Design, 2012

Special issue editorial: Accelerators for high-performance computing.
J. Parallel Distrib. Comput., 2012

UPCBLAS: a library for parallel matrix computations in Unified Parallel C.
Concurrency and Computation: Practice and Experience, 2012

Using an Analytical Model of Shared Caches for Selecting the Optimal Parallelization Scheme.
Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012

Adaptive Set-Granular Cooperative Caching.
Proceedings of the 18th IEEE International Symposium on High Performance Computer Architecture, 2012

Synthesis of Multiresolution Scenes with Global Illumination on a GPU.
Proceedings of the GRAPP & IVAPP 2012: Proceedings of the International Conference on Computer Graphics Theory and Applications and International Conference on Information Visualization Theory and Applications, 2012

2011
Design of efficient Java message-passing collectives on multi-core clusters.
The Journal of Supercomputing, 2011

Optimizing Monte Carlo radiosity on graphics hardware.
The Journal of Supercomputing, 2011

Parallel hierarchical radiosity on hybrid platforms.
The Journal of Supercomputing, 2011

Device level communication libraries for high-performance computing in Java.
Concurrency and Computation: Practice and Experience, 2011

ProtTest 3: fast selection of best-fit models of protein evolution.
Bioinformatics, 2011

FFT Implementation on a Streaming Architecture.
Proceedings of the 19th International Euromicro Conference on Parallel, 2011

Simulation of pollutant transport in shallow water on a CUDA architecture.
Proceedings of the 2011 International Conference on High Performance Computing & Simulation, 2011

Design and Implementation of MapReduce Using the PGAS Programming Model with UPC.
Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

Scalable Java Communication Middleware for Hybrid Shared/Distributed Memory Architectures.
Proceedings of the 13th IEEE International Conference on High Performance Computing & Communication, 2011

A Java-based parallel genetic algorithm for the land use planning problem.
Proceedings of the 13th Annual Genetic and Evolutionary Computation Conference, 2011

HPC selection of models of DNA substitution.
Proceedings of the Computational Methods in Systems Biology, 9th International Conference, 2011

2010
Address-Independent Estimation of the Worst-case Memory Performance.
IEEE Trans. Industrial Informatics, 2010

Performance analysis of message-passing libraries on high-speed clusters.
Comput. Syst. Sci. Eng., 2010

CPPC: a compiler-assisted tool for portable checkpointing of message-passing applications.
Concurrency and Computation: Practice and Experience, 2010

Hierarchical Radiosity for Multiresolution Systems Based on Normal Tests.
Comput. J., 2010

Uniform partitioning of Monte Carlo radiosity on GPUs.
Proceedings of the 2010 International Conference on High Performance Computing & Simulation, 2010

Reducing capacity and conflict misses using Set Saturation Levels.
Proceedings of the 2010 International Conference on High Performance Computing, 2010

ProtTest-HPC: Fast Selection of Best-Fit Models of Protein Evolution.
Proceedings of the Euro-Par 2010 Parallel Processing Workshops, 2010

2009
Static Prediction of Worst-Case Data Cache Performance in the Absence of Base Address Information.
Proceedings of the 15th IEEE Real-Time and Embedded Technology and Applications Symposium, 2009

Performance Evaluation of MPI, UPC and OpenMP on Multicore Architectures.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

Java for high performance computing: assessment of current research and practice.
Proceedings of the 7th International Conference on Principles and Practice of Programming in Java, 2009

High Performance Global Illumination on Multi-core Architectures.
Proceedings of the 17th Euromicro International Conference on Parallel, 2009

NPB-MPJ: NAS Parallel Benchmarks Implementation for Message-Passing in Java.
Proceedings of the 17th Euromicro International Conference on Parallel, 2009

Adaptive line placement with the set balancing cache.
Proceedings of the 42st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-42 2009), 2009

Ontological Configuration Management for Wireless Mesh Routers.
Proceedings of the IP Operations and Management, 9th IEEE International Workshop, 2009

Performance Evaluation of Unified Parallel C Collective Communications.
Proceedings of the 11th IEEE International Conference on High Performance Computing and Communications, 2009

Efficient Java Communication Libraries over InfiniBand.
Proceedings of the 11th IEEE International Conference on High Performance Computing and Communications, 2009

A Parallel Numerical Library for UPC.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

2008
XARK: An extensible framework for automatic recognition of computational kernels.
ACM Trans. Program. Lang. Syst., 2008

Java Fast Sockets: Enabling high-speed Java communications on high performance clusters.
Computer Communications, 2008

2007
Precise automatable analytical modeling of the cache behavior of codes with indirections.
TACO, 2007

Special Issue: Current Trends in Compilers for Parallel Computers.
Concurrency and Computation: Practice and Experience, 2007

Automated and accurate cache behavior analysis for codes with irregular access patterns.
Concurrency and Computation: Practice and Experience, 2007

A Hierarchical Radiosity Method with Scene Distribution.
Proceedings of the 15th Euromicro International Conference on Parallel, 2007

Fault-tolerant solutions for a MPI compute intensive application.
Proceedings of the 15th Euromicro International Conference on Parallel, 2007

High Performance Java Sockets for Parallel Computing on Clusters.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Program Behavior Characterization Through Advanced Kernel Recognition.
Proceedings of the Euro-Par 2007, 2007

Towards Low-Latency Model-Oriented Distributed Systems Management.
Proceedings of the Managing Next Generation Networks and Services, 2007

2006
Analytical modeling of codes with arbitrary data-dependent conditional structures.
Journal of Systems Architecture, 2006

High Performance Air Quality Simulation in the European CrossGrid Project.
Computers and Artificial Intelligence, 2006

Non-blocking Java Communications Support on Clusters.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

Cache Behavior Modelling for Codes Involving Banded Matrices.
Proceedings of the Languages and Compilers for Parallel Computing, 2006

Efficient Java Communication Protocols on High-speed Cluster Interconnects.
Proceedings of the LCN 2006, 2006

Dynamic Load-Balancing for the STEM-II Air Quality Model.
Proceedings of the Computational Science and Its Applications, 2006

2005
Parallel Global Illumination Method Based on a Non-Uniform Partitioning of the Scene.
Proceedings of the 13th Euromicro Workshop on Parallel, 2005

Designing Efficient Java Communications on Clusters.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

A Framework Focus on Configuration Modeling and Integration with Transparent Persistence.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

Modeling Execution Time of Selected Computation and Communication Kernels on Grids.
Proceedings of the Advances in Grid Computing, 2005

Improving the Performance of Visibility Determination in Global Illumination Methods.
Proceedings of The 2005 International Conference on Imaging Science, 2005

2004
High Performance Air Pollution Simulation Using OpenMP.
The Journal of Supercomputing, 2004

A compiler tool to predict memory hierarchy performance of scientific codes.
Parallel Computing, 2004

A middleware architecture for distributed systems management.
J. Parallel Distrib. Comput., 2004

Progressive radiosity method on clusters using a new clipping algorithm.
IJHPCN, 2004

A Grid Portal to Support High-Performance Scientific Computing on Distributed Resources.
IEICE Transactions, 2004

An Inspector-Executor Algorithm for Irregular Assignment Parallelization.
Proceedings of the Parallel and Distributed Processing and Applications, 2004

Compiler Support for Parallel Code Generation through Kernel Recognition.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

Air Pollution Modeling in the CrossGrid Project.
Proceedings of the Computational Science, 2004

Modeling the Cache Behavior of Codes with Arbitrary Data-Dependent Conditional Structures.
Proceedings of the Advances in Computer Systems Architecture, 9th Asia-Pacific Conference, 2004

2003
Probabilistic Miss Equations: Evaluating Memory Hierarchy Performance.
IEEE Trans. Computers, 2003

High performance air pollution modeling for a power plant environment.
Parallel Computing, 2003

Research Article: A GIS-embedded system to support land consolidation plans in Galicia.
International Journal of Geographical Information Science, 2003

Cache Behavior Modeling of Codes with Data-Dependent Conditionals.
Proceedings of the Software and Compilers for Embedded Systems, 7th International Workshop, 2003

Performance Modeling and Evaluation of Java Message-Passing Primitives on a Cluster.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface,10th European PVM/MPI Users' Group Meeting, Venice, Italy, September 29, 2003

Increasing the Throughput of Available Resources Using Management Tools Based on Grid Technologies.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

A GSA-based compiler infrastructure to extract parallelism from complex loops.
Proceedings of the 17th Annual International Conference on Supercomputing, 2003

A Grid-Enabled Air Quality Simulation.
Proceedings of the Grid Computing, 2003

Performance Analysis of Java Message-Passing Libraries on Fast Ethernet, Myrinet and SCI Clusters.
Proceedings of the 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003), 2003

2002
Performance Modeling and Evaluation of MPI-I/O on a Cluster.
J. Inf. Sci. Eng., 2002

A Meshing Scheme for Real Time Surface Subdivision.
Proceedings of the 10-th International Conference in Central Europe on Computer Graphics, 2002

A High-Performance Progressive Radiosity Method Based on Scene Partitioning.
Proceedings of the High Performance Computing for Computational Science, 2002

Performance analysis of MPI-I/O primitives on a PC cluster.
Proceedings of the 2002 ACM Symposium on Applied Computing (SAC), 2002

A Cluster-Based Solution for a High Performance Air Quality Simulation.
Proceedings of the Applied Parallel Computing Advanced Scientific Computing, 2002

Irregular Assignment Computations on cc-NUMA Multiprocessors.
Proceedings of the High Performance Computing, 4th International Symposium, 2002

High Performance Air Pollution Simulation Using OpenMP.
Proceedings of the 31st International Conference on Parallel Processing Workshops (ICPP 2002 Workshops), 2002

Towards Detection of Coarse-Grain Loop-Level Parallelism in Irregular Computations.
Proceedings of the Euro-Par 2002, 2002

Efficient parallel implementations for surface subdivision.
Proceedings of the Fourth Eurographics Workshop on Parallel Graphics and Visualization, 2002

2001
Efficient parallel numerical solver for the elastohydrodynamic Reynolds-Hertz problem.
Parallel Computing, 2001

Hierarchical Radiosity on Multicomputers: a Load-Balanced Approach.
Proceedings of the Tenth SIAM Conference on Parallel Processing for Scientific Computing, 2001

A Compiler Framework to Detect Parallelism in Irregular Codes.
Proceedings of the Languages and Compilers for Parallel Computing, 2001

The STEM-II Air Quality Model on a Distributed Memory System.
Proceedings of the 30th International Workshops on Parallel Processing (ICPP 2001 Workshops), 2001

Characterization of Message-Passing Overhead on the AP3000 Multicomputer.
Proceedings of the 2001 International Conference on Parallel Processing, 2001

Parallelization of the STEM-II Air Quality Model.
Proceedings of the High-Performance Computing and Networking, 9th International Conference, 2001

COPA: a GIS-based Tool for Land Consolidation Projects.
Proceedings of the ACM-GIS 2001, 2001

1999
Memory Hierarchy Performance Prediction for Blocked Sparse Algorithms.
Parallel Processing Letters, 1999

Modeling MPI Collective Communications on the AP3000 Multicomputer.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 1999

Direct mapped cache performance modeling for sparse matrix operations.
Proceedings of the Seventh Euromicro Workshop on Parallel and Distributed Processing. PDP'99, 1999

Bayesian image restoration: Parallel implementation on a SGI origin multiprocessor.
Proceedings of the Parallel Computing: Fundamentals & Applications, 1999

A Parallel Approach for Solving a Lubrication Problem in Industrial Devices.
Proceedings of the High-Performance Computing and Networking, 7th International Conference, 1999

Performance Evaluation and Modeling of the Fujitsu AP3000 Message-Passing Libraries.
Proceedings of the Euro-Par '99 Parallel Processing, 5th International Euro-Par Conference, Toulouse, France, August 31, 1999

Set Associative Cache Behavior Optimization.
Proceedings of the Euro-Par '99 Parallel Processing, 5th International Euro-Par Conference, Toulouse, France, August 31, 1999

Automatic Analytical Modeling for the Estimation of Cache Misses.
Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques, 1999

1998
High Performance Computing of an Industrial Problem in Tribology.
Proceedings of the Vector and Parallel Processing, 1998

Modeling Set Associative Caches Behavior for Irregular Computations.
Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, 1998

A PVM-Based Library for Sparse Matrex Factorizations.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 1998

HPF-2 Support for Dynamic Sparse Computations.
Proceedings of the Languages and Compilers for Parallel Computing, 1998

Cache Misses Prediction for High Performance Sparse Algorithms.
Proceedings of the Euro-Par '98 Parallel Processing, 1998

Cache Probabilistic Modeling for Basic Sparse Algebra Kernels Involving Matrices with a Non Uniform Distribution.
Proceedings of the 24th EUROMICRO '98 Conference, 1998

1997
High-performance VLSI architecture for the Viterbi algorithm.
IEEE Trans. Communications, 1997

1996
Parallel Programming with Polaris.
IEEE Computer, 1996

Sparse Householder QR Factorization on a Mesh.
Proceedings of the 4th Euromicro Workshop on Parallel and Distributed Processing (PDP '96), 1996

Parallel Sparse Modified Gram-Schmidt QR Decomposition.
Proceedings of the High-Performance Computing and Networking, 1996

1994
Parallel Architecture for Fast Transforms with Trigonometric Kernel.
IEEE Trans. Parallel Distrib. Syst., 1994

1990
Gaussian elimination with pivoting on hypercubes.
Parallel Computing, 1990

A VLSI Systolic Architecture for Solving DBT-Transformed Fuzzy Clustering Problems of Arbitrary Size.
Parallel Computing, 1990

ACLE: A Software Package for SIMD Computer Simulation.
Comput. J., 1990


  Loading...