Alistair P. Rendell

Orcid: 0000-0002-9445-0146

According to our database1, Alistair P. Rendell authored at least 59 papers between 1993 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
Scaling Correlated Fragment Molecular Orbital Calculations on Summit.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

2021
Enabling large-scale correlated electronic structure calculations: scaling the RI-MP2 method on summit.
Proceedings of the International Conference for High Performance Computing, 2021

2020
Scaling the hartree-fock matrix build on summit.
Proceedings of the International Conference for High Performance Computing, 2020

2018
Development and Application of a Hybrid Programming Environment on an ARM/DSP System for High Performance Computing.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

2017
Programming the Adapteva Epiphany 64-core network-on-chip coprocessor.
Int. J. High Perform. Comput. Appl., 2017

Performance and Energy Analysis of Scientific Workloads Executing on LPSoCs.
Proceedings of the Parallel Processing and Applied Mathematics, 2017

Embedded Accelerators for Scientific High-Performance Computing: An Energy Study of OpenCL Gaussian Elimination Workloads.
Proceedings of the 46th International Conference on Parallel Processing Workshops, 2017

2016
Full-wave nonlinear ultrasound simulation on distributed clusters with applications in high-intensity focused ultrasound.
Int. J. High Perform. Comput. Appl., 2016

CompAdaGrad: A Compressed, Complementary, Computationally-Efficient Adaptive Gradient Method.
CoRR, 2016

Split Wisely: When Work Partitioning is Energy-Optimal on Heterogeneous Hardware.
Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

2014
Resolutions of the Coulomb operator: VIII. Parallel implementation using the modern programming language X10.
J. Comput. Chem., 2014

PGAS-FMM: Implementing a distributed fast multipole method using the X10 programming language.
Concurr. Comput. Pract. Exp., 2014

Implementation and Optimization of the OpenMP Accelerator Model for the TI Keystone II Architecture.
Proceedings of the Using and Improving OpenMP for Devices, Tasks, and More, 2014

2013
Deterministic global optimization in ab-initio quantum chemistry.
J. Glob. Optim., 2013

First principles study of gallium cleaning for hydrogen-contaminated α-Al<sub>2</sub>O<sub>3</sub>(0001) surfaces.
J. Comput. Chem., 2013

Managing Complexity in the Parallel Sparse Grid Combination Technique.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

OpenMP on the Low-Power TI Keystone II ARM/DSP System-on-Chip.
Proceedings of the OpenMP in the Era of Low Power Devices and Accelerators, 2013

Use of SIMD Vector Operations to Accelerate Application Code Performance on Low-Powered ARM and Intel Platforms.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Fault-Tolerant Grid-Based Solvers: Combining Concepts from Sparse Grids and MapReduce.
Proceedings of the International Conference on Computational Science, 2013

2012
Generating optimal CUDA sparse matrix-vector product implementations for evolving GPU hardware.
Concurr. Comput. Pract. Exp., 2012

Implementation of 3D FFTs Across Multiple GPUs in Shared Memory Environments.
Proceedings of the 13th International Conference on Parallel and Distributed Computing, 2012

Efficient update of ghost regions using active messages.
Proceedings of the 19th International Conference on High Performance Computing, 2012

2011
Modelling the Runtime of the Gaussian Computational Chemistry Application and Assessing the Impacts of Microarchitectural Variations.
Proceedings of the International Conference on Computational Science, 2011

High-Performance Pseudo-Random Number Generation on Graphics Processing Units.
Proceedings of the Parallel Processing and Applied Mathematics, 2011

Profiling Directed NUMA Optimization on Linux Systems: A Case Study of the Gaussian Computational Chemistry Code.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

X10 as a Parallel Language for Scientific Computation: Practice and Experience.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

2010
Runtime sparse matrix format selection.
Proceedings of the International Conference on Computational Science, 2010

From Sparse Matrix to Optimal GPU CUDA Sparse Matrix Vector Product Implementation.
Proceedings of the 10th IEEE/ACM International Conference on Cluster, 2010

Region-Based Prefetch Techniques for Software Distributed Shared Memory Systems.
Proceedings of the 10th IEEE/ACM International Conference on Cluster, 2010

2009
Liquid water: obtaining the right answer for the right reasons.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

Non-threaded and Threaded Approaches to MultiRail Communication with uDAPL.
Proceedings of the NPC 2009, 2009

Use of Cluster OpenMP with the Gaussian Quantum Chemistry Code: A Preliminary Performance Analysis.
Proceedings of the Evolving OpenMP in an Age of Extreme Parallelism, 2009

Effective Use of Dynamic Page Migration on NUMA Platforms: The Gaussian Chemistry Code on the SunFire X4600M2 System.
Proceedings of the 10th International Symposium on Pervasive Systems, 2009

A Simple Performance Model for Multithreaded Applications Executing on Non-uniform Memory Access Computers.
Proceedings of the 11th IEEE International Conference on High Performance Computing and Communications, 2009

Integrating software distributed shared memory and message passing programming.
Proceedings of the 2009 IEEE International Conference on Cluster Computing, August 31, 2009

2008
Micro-benchmarks for Cluster OpenMP Implementations: Memory Consistency Costs.
Proceedings of the OpenMP in a New Era of Parallelism, 4th International Workshop, 2008

Memory and Thread Placement Effects as a Function of Cache Usage: A Study of the Gaussian Chemistry Code on the SunFire X4600 M2.
Proceedings of the 9th International Symposium on Parallel Architectures, 2008

Performance Evaluation of the NVIDIA GeForce 8800 GTX GPU for Machine Learning.
Proceedings of the Computational Science, 2008

Including Rigorous Numerical Bounds in Quantum Chemistry Calculations: Gaussian Integral Evaluation.
Proceedings of the 11th IEEE International Conference on Computational Science and Engineering, 2008

Reinforcement learning for automated performance tuning: Initial evaluation for sparse matrix format selection.
Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008

Performance models for Cluster-enabled OpenMP implementations.
Proceedings of the 13th Asia-Pacific Computer Systems Architecture Conference, 2008

2007
On the Use of Incomplete LU Decomposition as a Preconditioning Technique for Density Fitting in Electronic Structure Computations.
Proceedings of the Computational Science and Its Applications, 2007

Interval Arithmetic and Computational Science: Rounding and Truncation Errors in N-Body Methods.
Proceedings of the Selected Papers of the Fifth International Conference on, 2007

The design of MPI based distributed shared memory systems to support OpenMP on clusters.
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

2006
Interval Arithmetic and Computational Science: Performance Considerations.
Proceedings of the Computational Science, 2006

A Project Based Approach to Teaching Parallel Systems.
Proceedings of the Computational Science, 2006

Modelling the Performance of the Gaussian Chemistry Code on x86 Architectures.
Proceedings of the Modeling, 2006

Exploring Thread and Memory Placement on NUMA Architectures: Solaris and Linux, UltraSPARC/FirePlane and Opteron/HyperTransport.
Proceedings of the High Performance Computing, 2006

2005
The SCore Cluster Enabled OpenMP Environment: Performance Prospects for Computational Science.
Proceedings of the Computational Science, 2005

Introducing Design Patterns, Graphical User Interfaces and Threads Within the Context of a High Performance Computing Application.
Proceedings of the Computational Science, 2005

2004
Cache Oblivious Matrix Transposition: Simulation and Experiment.
Proceedings of the Computational Science, 2004

2003
Enabling the Efficient Use of SMP Clusters: The GAMESS/DDI Model.
Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 2003

OpenMP and NUMA Architectures I: Investigating Memory Placement on the SCI Origin 3000.
Proceedings of the Computational Science - ICCS 2003, 2003

2002
Comparison of linear-scaling semiempirical methods and combined quantum mechanical/molecular mechanical methods for enzymic reactions. II. An energy decomposition analysis.
J. Comput. Chem., 2002

2000
Computational chemistry on Fujitsu vector-parallel processors: Development and performance of applications software.
Parallel Comput., 2000

Computational chemistry on Fujitsu vector-parallel processors: Hardware and programming environment.
Parallel Comput., 2000

Comparison of enzyme polarization of ligands and charge-transfer effects for dihydrofolate reductase using point-charge embedded ab initio quantum mechanical and linear-scaling semiempirical quantum mechanical methods.
J. Comput. Chem., 2000

1999
Faster gradients for semiempirical methods.
J. Comput. Chem., 1999

1993
Distributed data parallel coupled-cluster algorithm: Application to the 2-hydroxypyridine/2-pyridone tautomerism.
J. Comput. Chem., 1993


  Loading...