We stand with Ukraine

We stand with Ukraine

Dimitrios S. Nikolopoulos

Orcid: 0000-0003-0217-8307

According to our database¹, Dimitrios S. Nikolopoulos authored at least 256 papers between 1998 and 2026.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2026

FACT: Compositional Kernel Synthesis with a Three-Stage Agentic Workflow.

[DOI]

,

Dimitrios S. Nikolopoulos

CoRR, April, 2026

Polymorph: Energy-Efficient Multi-Label Classification for Video Streams on Embedded Devices.

[DOI]

,

,

,

,

,

Dimitrios S. Nikolopoulos

,

Hans Vandierendonck

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

ConfigSpec: Profiling-Based Configuration Selection for Distributed Edge-Cloud Speculative LLM Serving.

[DOI]

,

,

,

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Proceedings of the 4th International Workshop on Testing Distributed Internet of Things Systems, 2026

WISP: Waste- and Interference-Suppressed Distributed Speculative LLM Serving at the Edge via Dynamic Drafting and SLO-Aware Batching.

[DOI]

,

,

,

Dimitrios Spatharakis

,

,

Hans Vandierendonck

,

,

,

,

Dimitrios S. Nikolopoulos

Proceedings of the Abstracts of the 2026 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2026

dLLM-Serve: Bridging the Memory Gap in Diffusion Language Model Serving.

[DOI]

,

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 40th ACM International Conference on Supercomputing, 2026

2025

Modality Inflation: Energy Characterization and Optimization Opportunities for MLLM Inference.

[DOI]

Mona Moghadampanah

,

Adib Rezaei Shahmirzadi

,

,

Dimitrios S. Nikolopoulos

CoRR, December, 2025

Taming the Memory Footprint Crisis: System Design for Production Diffusion LLM Serving.

[DOI]

,

,

,

Dimitrios S. Nikolopoulos

CoRR, December, 2025

DiffPro: Joint Timestep and Layer-Wise Precision Optimization for Efficient Diffusion Inference.

[DOI]

,

,

Kanchon Gharami

,

Mona Moghadampanah

,

Dimitrios S. Nikolopoulos

CoRR, November, 2025

QPART: Adaptive Model Quantization and Dynamic Workload Balancing for Accuracy-aware Edge Inference.

[DOI]

,

,

,

Hans Vandierendonck

,

,

Dimitrios S. Nikolopoulos

CoRR, June, 2025

Parallel CPU-GPU Execution for LLM Inference on Constrained GPUs.

[DOI]

,

,

,

Dimitrios S. Nikolopoulos

CoRR, June, 2025

MARCO: A Multi-Agent System for Optimizing HPC Code Generation Using Large Language Models.

[DOI]

,

Veljko Cvetkovic

,

,

,

,

,

,

,

,

Dimitrios S. Nikolopoulos

CoRR, May, 2025

MARVEL: An End-to-End Framework for Generating Model-Class Aware Custom RISC-V Extensions for Lightweight AI.

[DOI]

,

,

Pedro Kreutz Werle

,

Shreejith Shanker

,

Dimitrios S. Nikolopoulos

,

,

Hans Vandierendonck

,

IEEE Open J. Circuits Syst., 2025

SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving.

[DOI]

,

Dimitrios Spatharakis

,

,

,

Hans Vandierendonck

,

,

,

Dimitrios S. Nikolopoulos

Proceedings of the Tenth ACM/IEEE Symposium on Edge Computing, 2025

HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models.

[DOI]

Kazi Hasan Ibn Arif

,

,

Dimitrios S. Nikolopoulos

,

Hans Vandierendonck

,

,

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

On Robust Optimal Joint Deployment and Assignment of RAN Intelligent Controllers in O-RANs.

[DOI]

Mohammad J. Abdel-Rahman

,

Emadeldin A. Mazied

,

,

,

Allen B. MacKenzie

,

Scott F. Midkiff

,

Kleber Vieira Cardoso

,

Dimitrios S. Nikolopoulos

IEEE Open J. Commun. Soc., 2024

Tuning Fast Memory Size based on Modeling of Page Migration for Tiered Memory.

[DOI]

,

,

,

,

,

Dimitrios S. Nikolopoulos

,

,

,

,

CoRR, 2024

HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments.

[DOI]

Kazi Hasan Ibn Arif

,

,

Dimitrios S. Nikolopoulos

,

Hans Vandierendonck

,

,

CoRR, 2024

Parallel Islands: A Parallel Computing Educational Video Game.

[DOI]

Melissa Cameron

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 55th ACM Technical Symposium on Computer Science Education, 2024

ParvaGPU: Efficient Spatial GPU Sharing for Large-Scale DNN Inference in Cloud Environments.

[DOI]

,

,

,

,

,

,

Dimitrios S. Nikolopoulos

,

Proceedings of the International Conference for High Performance Computing, 2024

FrameFeedback: A Closed-Loop Control System for Dynamic Offloading Real-Time Edge Inference.

[DOI]

Matthew Jackson

,

,

Dimitrios S. Nikolopoulos

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

Application-Attuned Memory Management for Containerized HPC Workflows.

[DOI]

,

,

M. Mustafa Rafique

,

Dimitrios S. Nikolopoulos

,

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

2023

Decentralised Biomedical Signal Classification using Early Exits.

[DOI]

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

,

,

,

Proceedings of the 21st IEEE Interregional NEWCAS Conference, 2023

2022

Efficient, Dynamic Multi-Task Execution on FPGA-Based Computing Systems.

[DOI]

Umar Ibrahim Minhas

,

,

Dimitrios S. Nikolopoulos

,

Georgios Karakonstantis

IEEE Trans. Parallel Distributed Syst., 2022

Power Log'n'Roll: Power-Efficient Localized Rollback for MPI Applications Using Message Logging Protocols.

[DOI]

,

Daniele De Sensi

,

Dimitrios S. Nikolopoulos

,

Kirk W. Cameron

,

Ivor T. A. Spence

IEEE Trans. Parallel Distributed Syst., 2022

Mixed-Precision Kernel Recursive Least Squares.

[DOI]

,

Dimitrios S. Nikolopoulos

,

Hans Vandierendonck

IEEE Trans. Neural Networks Learn. Syst., 2022

gShare: A centralized GPU memory management framework to enable GPU memory sharing for containers.

[DOI]

,

,

,

Dimitrios S. Nikolopoulos

Future Gener. Comput. Syst., 2022

On Realizing Efficient Deep Learning Using Serverless Computing.

[DOI]

,

,

M. Mustafa Rafique

,

Dimitrios S. Nikolopoulos

Proceedings of the 22nd IEEE International Symposium on Cluster, 2022

2021

Revealing DRAM Operating GuardBands Through Workload-Aware Error Predictive Modeling.

[DOI]

,

Konstantinos Tovletoglou

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

,

Georgios Karakonstantis

IEEE Trans. Computers, 2021

Linear Regression Based DDoS Attack Detection.

[DOI]

Sakil Barbhuiya

,

Peter Kilpatrick

,

Dimitrios S. Nikolopoulos

Proceedings of the ICMLC 2021: 13th International Conference on Machine Learning and Computing, 2021

2020

ENORM: A Framework For Edge NOde Resource Management.

[DOI]

,

Blesson Varghese

,

Michail Matthaiou

,

Dimitrios S. Nikolopoulos

IEEE Trans. Serv. Comput., 2020

AIR: Iterative refinement acceleration using arbitrary dynamic precision.

[DOI]

,

Gregory D. Peterson

,

Dimitrios S. Nikolopoulos

,

Hans Vandierendonck

Parallel Comput., 2020

DYVERSE: DYnamic VERtical Scaling in multi-tenant Edge environments.

[DOI]

,

Michail Matthaiou

,

Dimitrios S. Nikolopoulos

,

Blesson Varghese

Future Gener. Comput. Syst., 2020

Fast load balance parallel graph analytics with an automatic graph data structure selection algorithm.

[DOI]

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Future Gener. Comput. Syst., 2020

RIANN: Real-time Incremental Learning with Approximate Nearest Neighbor on Mobile Devices.

[DOI]

,

,

Dimitrios S. Nikolopoulos

,

Proceedings of the 2020 USENIX Conference on Operational Machine Learning, 2020

DStress: Automatic Synthesis of DRAM Reliability Stress Viruses using Genetic Algorithms.

[DOI]

,

Dimitrios S. Nikolopoulos

,

Georgios Karakonstantis

Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

DroidLight: Lightweight Anomaly-based Intrusion Detection System for Smartphone Devices.

[DOI]

Sakil Barbhuiya

,

Peter Kilpatrick

,

Dimitrios S. Nikolopoulos

Proceedings of the ICDCN 2020: 21st International Conference on Distributed Computing and Networking, 2020

DEFCON: Generating and Detecting Failure-prone Instruction Sequences via Stochastic Search.

[DOI]

Ioannis Tsiokanos

,

,

Giorgis Georgakoudis

,

Dimitrios S. Nikolopoulos

,

Georgios Karakonstantis

Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020

Fast Analysis and Prediction in Large Scale Virtual Machines Resource Utilisation.

[DOI]

Abdullahi Abubakar

,

Sakil Barbhuiya

,

Peter Kilpatrick

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 10th International Conference on Cloud Computing and Services Science, 2020

Cross Architectural Power Modelling.

[DOI]

,

Peter Kilpatrick

,

Dimitrios S. Nikolopoulos

,

Blesson Varghese

Proceedings of the 20th IEEE/ACM International Symposium on Cluster, 2020

HaRMony: Heterogeneous-Reliability Memory and QoS-Aware Energy Management on Virtualized Servers.

[DOI]

Konstantinos Tovletoglou

,

,

Dimitrios S. Nikolopoulos

,

Georgios Karakonstantis

Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020

2019

Hyperqueues: Design and Implementation of Deterministic Concurrent Queues.

[DOI]

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

ACM Trans. Parallel Comput., 2019

Fast and Energy-Efficient OLAP Data Management on Hybrid Main Memory Systems.

[DOI]

,

Dimitrios S. Nikolopoulos

,

Hans Vandierendonck

IEEE Trans. Computers, 2019

Shimmer: Implementing a Heterogeneous-Reliability DRAM Framework on a Commodity Server.

[DOI]

Konstantinos Tovletoglou

,

,

Dimitrios S. Nikolopoulos

,

Georgios Karakonstantis

IEEE Comput. Archit. Lett., 2019

Implementing efficient message logging protocols as MPI application extensions.

[DOI]

,

Dimitrios S. Nikolopoulos

Proceedings of the 26th European MPI Users' Group Meeting, 2019

VEBO: a vertex- and edge-balanced ordering heuristic to load balance parallel graph processing.

[DOI]

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019

SAFIRE: Scalable and Accurate Fault Injection for Parallel Multithreaded Applications.

[DOI]

Giorgis Georgakoudis

,

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

,

Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Workload-Aware DRAM Error Prediction using Machine Learning.

[DOI]

,

Konstantinos Tovletoglou

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

,

Georgios Karakonstantis

Proceedings of the IEEE International Symposium on Workload Characterization, 2019

TAPAS: Train-Less Accuracy Predictor for Architecture Search.

[DOI]

,

Florian Scheidegger

,

Giovanni Mariani

,

Dimitrios S. Nikolopoulos

,

Constantine Bekas

,

Adelmo Cristiano Innocenza Malossi

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Intra-Node Memory Safe GPU Co-Scheduling.

[DOI]

,

,

Dimitrios S. Nikolopoulos

,

Blesson Varghese

IEEE Trans. Parallel Distributed Syst., 2018

NanoStreams: A Microserver Architecture for Real-Time Analytics on Fast Data Streams.

[DOI]

Umar Ibrahim Minhas

,

Matthew Russell

,

Stelios Kaloutsakis

,

,

,

Giorgis Georgakoudis

,

,

Dimitrios S. Nikolopoulos

,

IEEE Trans. Multi Scale Comput. Syst., 2018

A taxonomy of task-based parallel programming technologies for high-performance computing.

[DOI]

,

,

,

Roman Iakymchuk

,

,

,

Philipp Gschwandtner

,

Pierre Lemarinier

,

Stefano Markidis

,

,

Thomas Fahringer

,

Kostas Katrinis

,

,

Dimitrios S. Nikolopoulos

J. Supercomput., 2018

DARE.

[DOI]

Charalampos Chalios

,

Giorgis Georgakoudis

,

Konstantinos Tovletoglou

,

Georgios Karakonstantis

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Int. J. High Perform. Comput. Appl., 2018

Energy-Efficient Iterative Refinement Using Dynamic Precision.

[DOI]

,

Hans Vandierendonck

,

,

Gregory D. Peterson

,

Dimitrios S. Nikolopoulos

IEEE J. Emerg. Sel. Topics Circuits Syst., 2018

RADS: Real-time Anomaly Detection System for Cloud Data Centres.

[DOI]

Sakil Barbhuiya

,

Zafeirios C. Papazachos

,

Peter Kilpatrick

,

Dimitrios S. Nikolopoulos

CoRR, 2018

Energy-efficient localised rollback after failures via data flow analysis.

[DOI]

,

Kirk W. Cameron

,

Dimitrios S. Nikolopoulos

CoRR, 2018

Expediting assessments of database performance for streams of respiratory parameters.

[DOI]

Charles J. Gillan

,

Aleksandar Novakovic

,

Adele H. Marshall

,

Murali Shyamsundar

,

Dimitrios S. Nikolopoulos

Comput. Biol. Medicine, 2018

Characterization of HPC workloads on an ARMv8 based server under relaxed DRAM refresh and thermal stress.

[DOI]

,

Konstantinos Tovletoglou

,

Dimitrios S. Nikolopoulos

,

Georgios Karakonstantis

Proceedings of the 18th International Conference on Embedded Computer Systems: Architectures, 2018

The VINEYARD integrated framework for hardware accelerators in the cloud.

[DOI]

Christoforos Kachris

,

Dimitrios Soudris

,

Stelios Mavridis

,

Manolis Pavlidakis

,

Christi Symeonidou

,

Christos Kozanitis

,

,

,

Sharatchandra Varma Bogaraju

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Proceedings of the 18th International Conference on Embedded Computer Systems: Architectures, 2018

Energy-efficient localised rollback via data flow analysis and frequency scaling.

[DOI]

,

Kirk W. Cameron

,

Dimitrios S. Nikolopoulos

Proceedings of the 25th European MPI Users' Group Meeting, 2018

Variation-Aware Pipelined Cores through Path Shaping and Dynamic Cycle Adjustment: Case Study on a Floating-Point Unit.

[DOI]

Ioannis Tsiokanos

,

,

Dimitrios S. Nikolopoulos

,

Georgios Karakonstantis

Proceedings of the International Symposium on Low Power Electronics and Design, 2018

Minimization of Timing Failures in Pipelined Designs via Path Shaping and Operand Truncation.

[DOI]

Ioannis Tsiokanos

,

,

Dimitrios S. Nikolopoulos

,

Georgios Karakonstantis

Proceedings of the 24th IEEE International Symposium on On-Line Testing And Robust System Design, 2018

DRAM Characterization under Relaxed Refresh Period Considering System Level Effects within a Commodity Server.

[DOI]

,

Konstantinos Tovletoglou

,

Dimitrios S. Nikolopoulos

,

Georgios Karakonstantis

Proceedings of the 24th IEEE International Symposium on On-Line Testing And Robust System Design, 2018

Userspace Hypervisor Data Characterization in Virtualized Environment.

[DOI]

,

Hans Vandierendonck

,

Georgios Karakonstantis

,

Dimitrios S. Nikolopoulos

Proceedings of the 24th IEEE International Conference on Parallel and Distributed Systems, 2018

Supporting Cloud IaaS Users in Detecting Performance-Based Violation for Streaming Applications.

[DOI]

,

,

Peter Kilpatrick

,

Ivor T. A. Spence

,

Dimitrios S. Nikolopoulos

Proceedings of the 2018 IEEE International Conference on Autonomic Computing, 2018

Code and Data Transformations to Address Garbage Collector Performance in Big Data Processing.

[DOI]

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Proceedings of the 25th IEEE International Conference on High Performance Computing, 2018

The transprecision computing paradigm: Concept, design, and applications.

[DOI]

A. Cristiano I. Malossi

,

Michael Schaffner

,

,

Luca Gammaitoni

,

Giuseppe Tagliavini

,

Andrew P. J. Emerson

,

,

Dimitrios S. Nikolopoulos

,

,

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

An energy-efficient and error-resilient server ecosystem exceeding conservative scaling limits.

[DOI]

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

2017

FairGV: Fair and Fast GPU Virtualization.

[DOI]

,

Ivor T. A. Spence

,

Dimitrios S. Nikolopoulos

IEEE Trans. Parallel Distributed Syst., 2017

ALEA: A Fine-Grained Energy Profiling Tool.

[DOI]

,

Pavlos Petoumenos

,

,

Konstantinos Parasyris

,

Dimitrios S. Nikolopoulos

,

Bronis R. de Supinski

,

ACM Trans. Archit. Code Optim., 2017

SCALO: Scalability-Aware Parallelism Orchestration for Multi-Threaded Workloads.

[DOI]

Giorgis Georgakoudis

,

Hans Vandierendonck

,

,

Bronis R. de Supinski

,

Thomas Fahringer

,

Dimitrios S. Nikolopoulos

ACM Trans. Archit. Code Optim., 2017

Managed acceleration for In-Memory database analytic workloads.

[DOI]

,

,

Peter Kilpatrick

,

Dimitrios S. Nikolopoulos

Int. J. Parallel Emergent Distributed Syst., 2017

On the Virtualization of CUDA Based GPU Remoting on ARM and X86 Machines in the GVirtuS Framework.

[DOI]

Raffaele Montella

,

,

Giuliano Laccetti

,

,

,

Carmine Ferraro

,

Valentina Pelliccia

,

,

Ivor T. A. Spence

,

Dimitrios S. Nikolopoulos

Int. J. Parallel Program., 2017

GPU Virtualization and Scheduling Methods: A Comprehensive Survey.

[DOI]

,

Ivor T. A. Spence

,

Dimitrios S. Nikolopoulos

ACM Comput. Surv., 2017

Feasibility of Fog Computing.

[DOI]

Blesson Varghese

,

,

Dimitrios S. Nikolopoulos

,

CoRR, 2017

Dependency-Aware Rollback and Checkpoint-Restart for Distributed Task-Based Runtimes.

[DOI]

,

,

Konstantinos Tovletoglou

,

,

Dimitrios S. Nikolopoulos

,

Georgios Karakonstantis

,

CoRR, 2017

Error-Resilient Server Ecosystems for Edge and Cloud Datacenters.

[DOI]

Georgios Karakonstantis

,

Dimitrios S. Nikolopoulos

,

Dimitris Gizopoulos

,

,

Yiannakis Sazeides

,

Christos D. Antonopoulos

,

Srikumar Venugopal

,

Computer, 2017

REFINE: realistic fault injection via compiler-based instrumentation for accuracy, portability and speed.

[DOI]

Giorgis Georgakoudis

,

,

Dimitrios S. Nikolopoulos

,

Proceedings of the International Conference for High Performance Computing, 2017

Access-aware DRAM failure-rate estimation under relaxed refresh operations.

[DOI]

Konstantinos Tovletoglou

,

Dimitrios S. Nikolopoulos

,

Georgios Karakonstantis

Proceedings of the 2017 International Conference on Embedded Computer Systems: Architectures, 2017

A Taxonomy of Task-Based Technologies for High-Performance Computing.

[DOI]

,

,

,

Roman Iakymchuk

,

,

Philipp Gschwandtner

,

Pierre Lemarinier

,

Stefano Markidis

,

,

,

Kostas Katrinis

,

Dimitrios S. Nikolopoulos

,

Thomas Fahringer

Proceedings of the Parallel Processing and Applied Mathematics, 2017

Incremental Training of Deep Convolutional Neural Networks.

[DOI]

,

A. Cristiano I. Malossi

,

,

Dimitrios S. Nikolopoulos

Proceedings of the International Workshop on Automatic Selection, 2017

Edge-as-a-Service: Towards Distributed Cloud Architectures.

[DOI]

Blesson Varghese

,

,

,

Dimitrios S. Nikolopoulos

Proceedings of the Parallel Computing is Everywhere, 2017

Power Modelling for Heterogeneous Cloud-Edge Data Centers.

[DOI]

,

Blesson Varghese

,

Peter Kilpatrick

,

Dimitrios S. Nikolopoulos

Proceedings of the Parallel Computing is Everywhere, 2017

MiniSymposium on Edge Computing.

[DOI]

Christos D. Antonopoulos

,

Dimitrios S. Nikolopoulos

Proceedings of the Parallel Computing is Everywhere, 2017

Relaxing DRAM refresh rate through access pattern scheduling: A case study on stencil-based algorithms.

[DOI]

Konstantinos Tovletoglou

,

Dimitrios S. Nikolopoulos

,

Georgios Karakonstantis

Proceedings of the 23rd IEEE International Symposium on On-Line Testing and Robust System Design, 2017

GraphGrind: addressing load imbalance of graph partitioning.

[DOI]

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Proceedings of the International Conference on Supercomputing, 2017

Accelerating Graph Analytics by Utilising the Memory Locality of Graph Partitioning.

[DOI]

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Proceedings of the 46th International Conference on Parallel Processing, 2017

Using Docker Swarm with a User-Centric Decision-Making Framework for Cloud Application Migration.

[DOI]

,

Peter Kilpatrick

,

Ivor T. A. Spence

,

Dimitrios S. Nikolopoulos

Proceedings of the Cloud Computing and Service Science - 7th International Conference, 2017

MyMinder: A User-centric Decision Making Framework for Intercloud Migration.

[DOI]

,

Peter Kilpatrick

,

Ivor T. A. Spence

,

Dimitrios S. Nikolopoulos

Proceedings of the CLOSER 2017, 2017

2016

Special issue on Disruptive technologies for energy efficient computing.

[DOI]

,

,

Dimitrios S. Nikolopoulos

Sustain. Comput. Informatics Syst., 2016

Editorial of the Special issue: SI: E2SC.

[DOI]

,

,

Dimitrios S. Nikolopoulos

Parallel Comput., 2016

Exploiting Significance of Computations for Energy-Constrained Approximate Computing.

[DOI]

Vassilis Vassiliadis

,

Charalampos Chalios

,

Konstantinos Parasyris

,

Christos D. Antonopoulos

,

,

Nikolaos Bellas

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Int. J. Parallel Program., 2016

Evaluating fault tolerance on asymmetric multicore systems-on-chip using iso-metrics.

[DOI]

Charalampos Chalios

,

Dimitrios S. Nikolopoulos

,

Sandra Catalán

,

Enrique S. Quintana-Ortí

IET Comput. Digit. Tech., 2016

Energy Optimization of Memory Intensive Parallel workloads.

[DOI]

,

Hans Vandierendonck

,

Georgios Karakonstantis

,

Dimitrios S. Nikolopoulos

CoRR, 2016

Myrmics: Scalable, Dependency-aware Task Scheduling on Heterogeneous Manycores.

[DOI]

,

Polyvios Pratikakis

,

Iakovos Mavroidis

,

Dimitrios S. Nikolopoulos

CoRR, 2016

BDDT-SCC: A Task-parallel Runtime for Non Cache-Coherent Multicores.

[DOI]

Alexandros Labrineas

,

Polyvios Pratikakis

,

Dimitrios S. Nikolopoulos

,

CoRR, 2016

TwinCG: Dual Thread Redundancy with Forward Recovery for Conjugate Gradient Methods.

[DOI]

,

Dimitrios S. Nikolopoulos

CoRR, 2016

Methods and metrics for fair server assessment under real-time financial workloads.

[DOI]

Giorgis Georgakoudis

,

Charles J. Gillan

,

,

Ivor T. A. Spence

,

,

Dimitrios S. Nikolopoulos

Concurr. Comput. Pract. Exp., 2016

Brief Announcement: Energy Optimization of Memory Intensive Parallel Workloads.

[DOI]

,

Hans Vandierendonck

,

Georgios Karakonstantis

,

Dimitrios S. Nikolopoulos

Proceedings of the 28th ACM Symposium on Parallelism in Algorithms and Architectures, 2016

Challenges and Opportunities in Edge Computing.

[DOI]

Blesson Varghese

,

,

Sakil Barbhuiya

,

Peter Kilpatrick

,

Dimitrios S. Nikolopoulos

Proceedings of the 2016 IEEE International Conference on Smart Cloud, 2016

Runtime support for adaptive power capping on heterogeneous SoCs.

[DOI]

,

Dimitrios S. Nikolopoulos

,

Proceedings of the International Conference on Embedded Computer Systems: Architectures, 2016

NanoStreams: Codesigned microservers for edge analytics in real time.

[DOI]

Proceedings of the International Conference on Embedded Computer Systems: Architectures, 2016

VarSys Introduction.

[DOI]

Kirk W. Cameron

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

A Scalable Runtime for the ECOSCALE Heterogeneous Exascale Hardware Platform.

[DOI]

,

Konstantin Bakanov

,

Ivor T. A. Spence

,

Dimitrios S. Nikolopoulos

Proceedings of the 6th International Workshop on Runtime and Operating Systems for Supercomputers, 2016

Operator and Workflow Optimization for High-Performance Analytics.

[DOI]

Hans Vandierendonck

,

Karen L. Murphy

,

,

,

Dimitrios S. Nikolopoulos

Proceedings of the Workshops of the EDBT/ICDT 2016 Joint Conference, 2016

ECOSCALE: Reconfigurable computing and runtime system for future exascale systems.

[DOI]

Iakovos Mavroidis

,

Ioannis Papaefstathiou

,

Luciano Lavagno

,

Dimitrios S. Nikolopoulos

,

,

,

Ioannis Sourdis

,

Vassilis Papaefstathiou

,

Marcello Coppola

,

Manuel Palomino

Proceedings of the 2016 Design, Automation & Test in Europe Conference & Exhibition, 2016

TwinPCG: Dual Thread Redundancy with forward Recovery for Preconditioned Conjugate Gradient Methods.

[DOI]

,

Dimitrios S. Nikolopoulos

Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

HPTA: High-performance text analytics.

[DOI]

Hans Vandierendonck

,

Karen L. Murphy

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Big data availability: Selective partial checkpointing for in-memory database queries.

[DOI]

Daniel Playfair

,

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

A scalable and composable map-reduce system.

[DOI]

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

,

Bronis R. de Supinski

Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Low-Cost Hardware Infrastructure for Runtime Thread Level Energy Accounting.

[DOI]

,

,

,

Alexandru Amaricai

,

,

,

,

Giorgis Georgakoudis

,

Dimitrios S. Nikolopoulos

,

Cosmin Cernazanu-Glavan

,

,

Proceedings of the Architecture of Computing Systems - ARCS 2016, 2016

The VINEYARD Approach: Versatile, Integrated, Accelerator-Based, Heterogeneous Data Centres.

[DOI]

Christoforos Kachris

,

Dimitrios Soudris

,

Georgi Gaydadjiev

,

,

Dimitrios S. Nikolopoulos

,

,

,

Christos Strydis

,

Christos Tsalidis

,

,

Ricardo Jiménez-Peris

,

Alexandre Almeida

Proceedings of the Applied Reconfigurable Computing - 12th International Symposium, 2016

A Capital Market Metaphor for Content Delivery Network Resources.

[DOI]

,

Dimitrios S. Nikolopoulos

,

Stathes Hadjiefthymiades

Proceedings of the 30th IEEE International Conference on Advanced Information Networking and Applications, 2016

2015

Realizing Accelerated Cost-Effective Distributed RAID.

[DOI]

Aleksandr Khasymski

,

M. Mustafa Rafique

,

,

Sudharshan S. Vazhkudai

,

Dimitrios S. Nikolopoulos

Proceedings of the Handbook on Data Centers, 2015

TProf: An energy profiler for task-parallel programs.

[DOI]

Ioannis Manousakis

,

Foivos S. Zakkak

,

Polyvios Pratikakis

,

Dimitrios S. Nikolopoulos

Sustain. Comput. Informatics Syst., 2015

Iso-Quality of Service: Fairly Ranking Servers for Real-Time Data Analytics.

[DOI]

Giorgis Georgakoudis

,

,

,

Ivor T. A. Spence

,

,

Dimitrios S. Nikolopoulos

Parallel Process. Lett., 2015

Scalable black-box prediction models for multi-dimensional adaptation on NUMA multi-cores.

[DOI]

Aleksandr Khasymski

,

Dimitrios S. Nikolopoulos

Int. J. Parallel Emergent Distributed Syst., 2015

On the potential of significance-driven execution for energy-aware HPC.

[DOI]

Philipp Gschwandtner

,

Charalampos Chalios

,

Dimitrios S. Nikolopoulos

,

Hans Vandierendonck

,

Thomas Fahringer

Comput. Sci. Res. Dev., 2015

Guest Editorial.

[DOI]

José L. Núñez-Yáñez

,

,

Dimitrios S. Nikolopoulos

IET Comput. Digit. Tech., 2015

Evaluating Asymmetric Multicore Systems-on-Chip using Iso-Metrics.

[DOI]

Charalampos Chalios

,

Dimitrios S. Nikolopoulos

,

Enrique S. Quintana-Ortí

CoRR, 2015

On the Energy-Efficiency of Byte-Addressable Non-Volatile Memory.

[DOI]

Hans Vandierendonck

,

,

Dimitrios S. Nikolopoulos

IEEE Comput. Archit. Lett., 2015

Towards automated data-driven model creation for cloud computing simulation.

[DOI]

Sergej Svorobej

,

,

,

,

Christian Stier

,

Henning Groenda

,

Zafeirios C. Papazachos

,

Dimitrios S. Nikolopoulos

Proceedings of the 8th International Conference on Simulation Tools and Techniques, 2015

A programming model and runtime system for significance-aware energy-efficient computing.

[DOI]

Vassilis Vassiliadis

,

Konstantinos Parasyris

,

Charalambos Chalios

,

Christos D. Antonopoulos

,

,

Nikolaos Bellas

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

Mini-Symposium on Energy and Resilience in Parallel Programming.

[DOI]

Dimitrios S. Nikolopoulos

,

Christos D. Antonopoulos

Proceedings of the Parallel Computing: On the Road to Exascale, 2015

Performance and Fault Tolerance of Preconditioned Iterative Solvers on Low-Power ARM Architectures.

[DOI]

José Ignacio Aliaga

,

Sandra Catalán

,

Charalampos Chalios

,

Dimitrios S. Nikolopoulos

,

Enrique S. Quintana-Ortí

Proceedings of the Parallel Computing: On the Road to Exascale, 2015

HpMC: An Energy-aware Management System of Multi-level Memory Architectures.

[DOI]

,

,

,

Kirk W. Cameron

,

Bronis R. de Supinski

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 2015 International Symposium on Memory Systems, 2015

Application-Level Energy Awareness for OpenMP.

[DOI]

Ferdinando Alessi

,

,

Giorgis Georgakoudis

,

Thomas Fahringer

,

Dimitrios S. Nikolopoulos

Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015

Power Capping: What Works, What Does Not.

[DOI]

Pavlos Petoumenos

,

,

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 21st IEEE International Conference on Parallel and Distributed Systems, 2015

Energy-Efficient In-Memory Data Stores on Hybrid Memory Hierarchies.

[DOI]

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Proceedings of the 11th International Workshop on Data Management on New Hardware, 2015

LS-ADT: Lightweight and Scalable Anomaly Detection for Cloud Datacentres.

[DOI]

Sakil Barbhuiya

,

Zafeirios C. Papazachos

,

Peter Kilpatrick

,

Dimitrios S. Nikolopoulos

Proceedings of the Cloud Computing and Services Science - 5th International Conference, 2015

A Lightweight Tool for Anomaly Detection in Cloud Data Centres.

[DOI]

Sakil Barbhuiya

,

Zafeirios C. Papazachos

,

Peter Kilpatrick

,

Dimitrios S. Nikolopoulos

Proceedings of the CLOSER 2015, 2015

A significance-driven programming framework for energy-constrained approximate computing.

[DOI]

Vassilis Vassiliadis

,

Charalampos Chalios

,

Konstantinos Parasyris

,

Christos D. Antonopoulos

,

,

Nikolaos Bellas

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Proceedings of the 12th ACM International Conference on Computing Frontiers, 2015

Software-managed energy-efficient hybrid DRAM/NVM main memory.

[DOI]

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Proceedings of the 12th ACM International Conference on Computing Frontiers, 2015

ALEA: Fine-Grain Energy Profiling with Basic Block Sampling.

[DOI]

,

Dimitrios S. Nikolopoulos

,

Bronis R. de Supinski

Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015

Energy-Efficient Hybrid DRAM/NVM Main Memory.

[DOI]

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015

2014

Hybrid address spaces: A methodology for implementing scalable high-level programming models on non-coherent many-core architectures.

[DOI]

Anastasios Papagiannis

,

Dimitrios S. Nikolopoulos

J. Syst. Softw., 2014

FPGA prototyping of emerging manycore architectures for parallel programming research using Formic boards.

[DOI]

,

George Kalokerinos

,

Michalis Lygerakis

,

Vassilis Papaefstathiou

,

Iakovos Mavroidis

,

Manolis Katevenis

,

Dionisios N. Pnevmatikatos

,

Dimitrios S. Nikolopoulos

J. Syst. Archit., 2014

Distributed region-based memory allocation and synchronization.

[DOI]

Christi Symeonidou

,

Polyvios Pratikakis

,

Dimitrios S. Nikolopoulos

,

Int. J. High Perform. Comput. Appl., 2014

Energy Efficiency through Significance-Based Computing.

[DOI]

Dimitrios S. Nikolopoulos

,

Hans Vandierendonck

,

Nikolaos Bellas

,

Christos D. Antonopoulos

,

,

Georgios Karakonstantis

,

,

Computer, 2014

On the viability of microservers for financial analytics.

[DOI]

Charles J. Gillan

,

Dimitrios S. Nikolopoulos

,

Giorgis Georgakoudis

,

,

George Tzenakis

,

Ivor T. A. Spence

Proceedings of the 7th Workshop on High Performance Computational Finance, 2014

Fast Dynamic Binary Rewriting for flexible thread migration on shared-ISA heterogeneous MPSoCs.

[DOI]

Giorgis Georgakoudis

,

Dimitrios S. Nikolopoulos

,

Hans Vandierendonck

,

Proceedings of the XIVth International Conference on Embedded Computer Systems: Architectures, 2014

Overcoming the Scalability Challenges of Epidemic Simulations on Blue Waters.

[DOI]

,

Abhinav Bhatele

,

Keith R. Bisset

,

,

,

Laxmikant V. Kalé

,

Madhav V. Marathe

,

Dimitrios S. Nikolopoulos

,

,

Lukasz Wesolowski

Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

Power-capped DVFS and thread allocation with ANN models on modern NUMA systems.

[DOI]

Satoshi Imamura

,

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 32nd IEEE International Conference on Computer Design, 2014

Power modelling and capping for heterogeneous ARM/FPGA SoCs.

[DOI]

,

José L. Núñez-Yáñez

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 2014 International Conference on Field-Programmable Technology, 2014

The CACTOS Vision of Context-Aware Cloud Topology Optimization and Simulation.

[DOI]

Proceedings of the IEEE 6th International Conference on Cloud Computing Technology and Science, 2014

2013

Strategies for Energy-Efficient Resource Management of Hybrid Programming Models.

[DOI]

,

Bronis R. de Supinski

,

,

Dimitrios S. Nikolopoulos

,

Kirk W. Cameron

IEEE Trans. Parallel Distributed Syst., 2013

Analysis of dependence tracking algorithms for task dataflow execution.

[DOI]

Hans Vandierendonck

,

George Tzenakis

,

Dimitrios S. Nikolopoulos

ACM Trans. Archit. Code Optim., 2013

Deterministic scale-free pipeline parallelism with hyperqueues.

[DOI]

Hans Vandierendonck

,

Kallia Chronaki

,

Dimitrios S. Nikolopoulos

Proceedings of the International Conference for High Performance Computing, 2013

DRASync: distributed region-based memory allocation and synchronization.

[DOI]

Christi Symeonidou

,

Polyvios Pratikakis

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 20th European MPI Users's Group Meeting, 2013

Prefetching and cache management using task lifetimes.

[DOI]

Vassilis Papaefstathiou

,

Manolis Katevenis

,

Dimitrios S. Nikolopoulos

,

Dionisios N. Pnevmatikatos

Proceedings of the International Conference on Supercomputing, 2013

Topic 1: Support Tools and Environments - (Introduction).

[DOI]

Bronis R. de Supinski

,

Bettina Krammer

,

Karl Fürlinger

,

,

Dimitrios S. Nikolopoulos

Proceedings of the Euro-Par 2013 Parallel Processing, 2013

Fast dynamic binary rewriting to support thread migration in shared-ISA asymmetric multicores.

[DOI]

Giorgis Georgakoudis

,

Dimitrios S. Nikolopoulos

,

Proceedings of the First International Workshop on Code Optimisation for Multi and Many Cores, 2013

Inference and Declaration of Independence in Task-Parallel Programs.

[DOI]

Foivos S. Zakkak

,

Dimitrios Chasapis

,

Polyvios Pratikakis

,

,

Dimitrios S. Nikolopoulos

Proceedings of the Advanced Parallel Processing Technologies, 2013

BDDT: Block-Level Dynamic Dependence Analysis for Task-Based Parallelism.

[DOI]

George Tzenakis

,

Angelos Papatriantafyllou

,

Hans Vandierendonck

,

Polyvios Pratikakis

,

Dimitrios S. Nikolopoulos

Proceedings of the Advanced Parallel Processing Technologies, 2013

2012

Critical path-based thread placement for NUMA systems.

[DOI]

,

,

Dimitrios S. Nikolopoulos

,

,

Kirk W. Cameron

,

Bronis R. de Supinski

SIGMETRICS Perform. Evaluation Rev., 2012

EPC: a power instrumentation controller for embedded applications.

[DOI]

Ioannis Manousakis

,

Dimitrios S. Nikolopoulos

SIGBED Rev., 2012

Cache-Integrated Network Interfaces: Flexible On-Chip Communication and Synchronization for Large-Scale CMPs.

[DOI]

Stamatis G. Kavadias

,

Manolis Katevenis

,

Michail Zampetakis

,

Dimitrios S. Nikolopoulos

Int. J. Parallel Program., 2012

BTL: A Framework for Measuring and Modeling Energy in Memory Hierarchies.

[DOI]

Ioannis Manousakis

,

Dimitrios S. Nikolopoulos

Proceedings of the IEEE 24th International Symposium on Computer Architecture and High Performance Computing, 2012

BDDT: : block-level dynamic dependence analysis for deterministic task-based parallelism.

[DOI]

George Tzenakis

,

Angelos Papatriantafyllou

,

,

Polyvios Pratikakis

,

Hans Vandierendonck

,

Dimitrios S. Nikolopoulos

Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012

On the Use of GPUs in Realizing Cost-Effective Distributed RAID.

[DOI]

Aleksandr Khasymski

,

M. Mustafa Rafique

,

,

Sudharshan S. Vazhkudai

,

Dimitrios S. Nikolopoulos

Proceedings of the 20th IEEE International Symposium on Modeling, 2012

The myrmics memory allocator: hierarchical, message-passing allocation for global address spaces.

[DOI]

,

Polyvios Pratikakis

,

Dimitrios S. Nikolopoulos

,

,

,

Bronis R. de Supinski

Proceedings of the International Symposium on Memory Management, 2012

Model-based, memory-centric performance and power optimization on NUMA multiprocessors.

[DOI]

,

,

Dimitrios S. Nikolopoulos

,

Kirk W. Cameron

,

Bronis R. de Supinski

,

Proceedings of the 2012 IEEE International Symposium on Workload Characterization, 2012

Dynamic binary rewriting and migration for shared-ISA asymmetric, multicore processors: summary.

[DOI]

Giorgis Georgakoudis

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 21st International Symposium on High-Performance Parallel and Distributed Computing, 2012

Formic: Cost-efficient and Scalable Prototyping of Manycore Architectures.

[DOI]

,

George Kalokerinos

,

Michalis Lygerakis

,

Vassilis Papaefstathiou

,

Dimitrios Tsaliagkos

,

Manolis Katevenis

,

Dionisios N. Pnevmatikatos

,

Dimitrios S. Nikolopoulos

Proceedings of the 2012 IEEE 20th Annual International Symposium on Field-Programmable Custom Computing Machines, 2012

Topic 16: GPU and Accelerators Computing.

[DOI]

,

Dimitrios S. Nikolopoulos

,

,

Satoshi Matsuoka

Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

Inference and declaration of independence: impact on deterministic task parallelism.

[DOI]

Foivos S. Zakkak

,

Dimitrios Chasapis

,

Polyvios Pratikakis

,

,

Dimitrios S. Nikolopoulos

Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012

2011

A capabilities-aware framework for using computational accelerators in data-intensive computing.

[DOI]

M. Mustafa Rafique

,

,

Dimitrios S. Nikolopoulos

J. Parallel Distributed Comput., 2011

Task-based parallel H.264 video encoding for explicit communication architectures.

[DOI]

Michail Alvanos

,

George Tzenakis

,

Dimitrios S. Nikolopoulos

,

Proceedings of the 2011 International Conference on Embedded Computer Systems: Architectures, 2011

A programming model for deterministic task parallelism.

[DOI]

Polyvios Pratikakis

,

Hans Vandierendonck

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 2011 ACM SIGPLAN workshop on Memory Systems Performance and Correctness: held in conjunction with PLDI '11, 2011

Scalable Runtime Support for Data Intensive Applications on the Single Chip Cloud Computer.

[DOI]

Anastasios Papagiannis

,

Dimitrios S. Nikolopoulos

Proceedings of the 3rd Many-core Applications Research Community (MARC) Symposium. Proceedings of the 3rd MARC Symposium, 2011

Parallel Programming of General-Purpose Programs Using Task-Based Programming Models.

[DOI]

Hans Vandierendonck

,

Polyvios Pratikakis

,

Dimitrios S. Nikolopoulos

Proceedings of the 3rd USENIX Workshop on Hot Topics in Parallelism, 2011

Fine-grain OpenMP runtime support with explicit communication hardware primitives.

[DOI]

Pranav Tendulkar

,

Vassilis Papaefstathiou

,

George Nikiforos

,

Stamatis G. Kavadias

,

Dimitrios S. Nikolopoulos

,

Manolis Katevenis

Proceedings of the Design, Automation and Test in Europe, 2011

Scalable memory registration for high performance networks using helper threads.

[DOI]

,

Kirk W. Cameron

,

Dimitrios S. Nikolopoulos

,

Bronis R. de Supinski

,

Proceedings of the 8th Conference on Computing Frontiers, 2011

A Unified Scheduler for Recursive and Task Dataflow Parallelism.

[DOI]

Hans Vandierendonck

,

George Tzenakis

,

Dimitrios S. Nikolopoulos

Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011

2010

Explicit Communication and Synchronization in SARC.

[DOI]

Manolis Katevenis

,

Vassilis Papaefstathiou

,

Stamatis G. Kavadias

,

Dionisios N. Pnevmatikatos

,

,

Dimitrios S. Nikolopoulos

IEEE Micro, 2010

Strider: Runtime Support for Optimizing Strided Data Accesses on Multi-Cores with Explicitly Managed Memories.

[DOI]

,

Dimitrios S. Nikolopoulos

Proceedings of the Conference on High Performance Computing Networking, 2010

Hybrid MPI/OpenMP power-aware computing.

[DOI]

,

Bronis R. de Supinski

,

,

Kirk W. Cameron

,

Dimitrios S. Nikolopoulos

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Power-aware MPI task aggregation prediction for high-end computing systems.

[DOI]

,

Dimitrios S. Nikolopoulos

,

Kirk W. Cameron

,

Bronis R. de Supinski

,

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Rearchitecting MapReduce for Heterogeneous Multicore Processors with Explicitly Managed Memories.

[DOI]

Anastasios Papagiannis

,

Dimitrios S. Nikolopoulos

Proceedings of the 39th International Conference on Parallel Processing, 2010

<i>Tagged Procedure Calls</i> (<i>TPC</i>): Efficient Runtime Support for Task-Based Parallelism on the Cell Processor.

[DOI]

George Tzenakis

,

Konstantinos Kapelonis

,

Michail Alvanos

,

Konstantinos Koukos

,

Dimitrios S. Nikolopoulos

,

Proceedings of the High Performance Embedded Architectures and Compilers, 2010

Comparing Scalability Prediction Strategies on an SMP of CMPs.

[DOI]

,

Matthew Curtis-Maury

,

,

Filip Blagojevic

,

Dimitrios S. Nikolopoulos

,

Bronis R. de Supinski

,

Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

Evaluation of streaming aggregation on parallel hardware architectures.

[DOI]

Scott Schneider

,

Henrique Andrade

,

,

,

Dimitrios S. Nikolopoulos

Proceedings of the Fourth ACM International Conference on Distributed Event-Based Systems, 2010

On-chip communication and synchronization mechanisms with cache-integrated network interfaces.

[DOI]

Stamatis G. Kavadias

,

Manolis Katevenis

,

Michail Zampetakis

,

Dimitrios S. Nikolopoulos

Proceedings of the 7th Conference on Computing Frontiers, 2010

Designing Accelerator-Based Distributed Systems for High Performance.

[DOI]

M. Mustafa Rafique

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 10th IEEE/ACM International Conference on Cluster, 2010

2009

Supporting MapReduce on large-scale asymmetric multi-core clusters.

[DOI]

M. Mustafa Rafique

,

,

,

Dimitrios S. Nikolopoulos

ACM SIGOPS Oper. Syst. Rev., 2009

Algorithm, software, and hardware optimizations for Delaunay mesh generation on simultaneous multithreaded architectures.

[DOI]

Christos D. Antonopoulos

,

Filip Blagojevic

,

Andrey N. Chernikov

,

Nikos Chrisochoides

,

Dimitrios S. Nikolopoulos

J. Parallel Distributed Comput., 2009

A multigrain Delaunay mesh generation method for multicore SMT-based architectures.

[DOI]

Christos D. Antonopoulos

,

Filip Blagojevic

,

Andrey N. Chernikov

,

Nikos Chrisochoides

,

Dimitrios S. Nikolopoulos

J. Parallel Distributed Comput., 2009

Green Building Blocks - Software Stacks for Energy-Efficient Clusters and Data Centres.

[DOI]

Dimitrios S. Nikolopoulos

ERCIM News, 2009

Programming Multiprocessors with Explicitly Managed Memory Hierarchies.

[DOI]

Scott Schneider

,

,

Dimitrios S. Nikolopoulos

Computer, 2009

A comparison of programming models for multiprocessors with explicitly managed memory hierarchies.

[DOI]

Scott Schneider

,

,

,

John C. Linford

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009

CellMR: A framework for supporting mapreduce on asymmetric cell-based clusters.

[DOI]

M. Mustafa Rafique

,

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Scheduling dynamic parallelism on accelerators.

[DOI]

Filip Blagojevic

,

,

Katherine A. Yelick

,

Matthew Curtis-Maury

,

Dimitrios S. Nikolopoulos

,

Proceedings of the 6th Conference on Computing Frontiers, 2009

2008

Prediction-Based Power-Performance Adaptation of Multithreaded Scientific Codes.

[DOI]

Matthew Curtis-Maury

,

Filip Blagojevic

,

Christos D. Antonopoulos

,

Dimitrios S. Nikolopoulos

IEEE Trans. Parallel Distributed Syst., 2008

Set-Top Supercomputing: Scalable Software for Scientific Simulations on GameConsoles.

[DOI]

Dimitrios S. Nikolopoulos

ERCIM News, 2008

VT-ASOS: Holistic system software customization for many cores.

[DOI]

Dimitrios S. Nikolopoulos

,

,

Jyotirmaya Tripathi

,

Matthew Curtis-Maury

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Modeling Multigrain Parallelism on Heterogeneous Multi-core Processors: A Case Study of the Cell BE.

[DOI]

Filip Blagojevic

,

,

Kirk W. Cameron

,

Dimitrios S. Nikolopoulos

Proceedings of the High Performance Embedded Architectures and Compilers, 2008

DMA-based prefetching for i/o-intensive workloads on the cell architecture.

[DOI]

M. Mustafa Rafique

,

,

Dimitrios S. Nikolopoulos

Proceedings of the 5th Conference on Computing Frontiers, 2008

Cell-SWat: modeling and scheduling wavefront computations on the cell broadband engine.

[DOI]

,

,

Filip Blagojevic

,

Dimitrios S. Nikolopoulos

Proceedings of the 5th Conference on Computing Frontiers, 2008

Scheduling Asymmetric Parallelism on a PlayStation3 Cluster.

[DOI]

Filip Blagojevic

,

Matthew Curtis-Maury

,

,

Scott Schneider

,

Dimitrios S. Nikolopoulos

Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2008), 2008

Prediction models for multi-dimensional power-performance optimization on many cores.

[DOI]

Matthew Curtis-Maury

,

,

Filip Blagojevic

,

Dimitrios S. Nikolopoulos

,

Bronis R. de Supinski

,

Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, 2008

2007

Exploring New Search Algorithms and Hardware for Phylogenetics: RAxML Meets the IBM Cell.

[DOI]

Alexandros Stamatakis

,

Filip Blagojevic

,

Dimitrios S. Nikolopoulos

,

Christos D. Antonopoulos

J. VLSI Signal Process., 2007

Runtime scheduling of dynamic parallelism on accelerator-based multi-core systems.

[DOI]

Filip Blagojevic

,

Dimitrios S. Nikolopoulos

,

Alexandros Stamatakis

,

Christos D. Antonopoulos

,

Matthew Curtis-Maury

Parallel Comput., 2007

Runtime and Programming Support for Memory Adaptation in Scientific Applications via Local Disk and Remote Memory.

[DOI]

Richard Tran Mills

,

,

Andreas Stathopoulos

,

Dimitrios S. Nikolopoulos

J. Grid Comput., 2007

Dynamic multigrain parallelization on the cell broadband engine.

[DOI]

Filip Blagojevic

,

Dimitrios S. Nikolopoulos

,

Alexandros Stamatakis

,

Christos D. Antonopoulos

Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2007

RAxML-Cell: Parallel Phylogenetic Tree Inference on the Cell Broadband Engine.

[DOI]

Filip Blagojevic

,

Alexandros Stamatakis

,

Christos D. Antonopoulos

,

Dimitrios S. Nikolopoulos

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Identifying energy-efficient concurrency levels using machine learning.

[DOI]

Matthew Curtis-Maury

,

,

,

Filip Blagojevic

,

Dimitrios S. Nikolopoulos

,

Bronis R. de Supinski

,

Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

A comparison of online and offline strategies for program adaptation.

[DOI]

Matthew Curtis-Maury

,

Christos D. Antonopoulos

,

Dimitrios S. Nikolopoulos

Proceedings of the 45th Annual Southeast Regional Conference, 2007

2006

PACMAN: A PerformAnce Counters MANager for Intel Hyperthreaded Processors.

[DOI]

Matthew Curtis-Maury

,

Dimitrios S. Nikolopoulos

,

Christos D. Antonopoulos

Proceedings of the Third International Conference on the Quantitative Evaluation of Systems (QEST 2006), 2006

Scalable locality-conscious multithreaded memory allocation.

[DOI]

Scott Schneider

,

Christos D. Antonopoulos

,

Dimitrios S. Nikolopoulos

Proceedings of the 5th International Symposium on Memory Management, 2006

MESA: reducing cache conflicts by integrating static and run-time methods.

[DOI]

,

Dimitrios S. Nikolopoulos

,

,

Proceedings of the 2006 IEEE International Symposium on Performance Analysis of Systems and Software, 2006

Facing the challenges of multicore processor technologies using autonomic system software.

[DOI]

Dimitrios S. Nikolopoulos

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Online strategies for high-performance power-aware thread execution on emerging multiprocessors.

[DOI]

Matthew Curtis-Maury

,

,

Christos D. Antonopoulos

,

Dimitrios S. Nikolopoulos

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Online power-performance adaptation of multithreaded programs using hardware event-based prediction.

[DOI]

Matthew Curtis-Maury

,

,

Christos D. Antonopoulos

,

Dimitrios S. Nikolopoulos

Proceedings of the 20th Annual International Conference on Supercomputing, 2006

Runtime Support for Memory Adaptation in Scientific Applications via Local Disk and Remote Memory.

[DOI]

,

Richard Tran Mills

,

Andreas Stathopoulos

,

Dimitrios S. Nikolopoulos

Proceedings of the 15th IEEE International Symposium on High Performance Distributed Computing, 2006

2005

An Evaluation of OpenMP on Current and Emerging Multithreaded/Multicore Processors.

[DOI]

Matthew Curtis-Maury

,

,

Christos D. Antonopoulos

,

Dimitrios S. Nikolopoulos

Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2005

Scheduling Algorithms for Effective Thread Pairing on Hybrid Multiprocessors.

[DOI]

Robert L. McGregor

,

Christos D. Antonopoulos

,

Dimitrios S. Nikolopoulos

Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

Multigrain parallel Delaunay Mesh generation: challenges and opportunities for multithreaded architectures.

[DOI]

Christos D. Antonopoulos

,

,

Andrey N. Chernikov

,

Filip Blagojevic

,

Dimitrios S. Nikolopoulos

,

Nikos Chrisochoides

Proceedings of the 19th Annual International Conference on Supercomputing, 2005

Factory: An Object-Oriented Parallel Programming Substrate for Deep Multiprocessors.

[DOI]

Scott Schneider

,

Christos D. Antonopoulos

,

Dimitrios S. Nikolopoulos

Proceedings of the High Performance Computing and Communications, 2005

smt- <i>SPRINTS</i>: <i>S</i>oftware <i>Pr</i>ecomputation with <i>Int</i>elligent Streaming for Resource-Constrained SMTs.

[DOI]

,

Christos D. Antonopoulos

,

Dimitrios S. Nikolopoulos

Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

2004

Dynamic tiling for effective use of shared caches on multithreaded processors.

[DOI]

Dimitrios S. Nikolopoulos

Int. J. High Perform. Comput. Netw., 2004

Runtime support for integrating precomputation and thread-level parallelism on simultaneous multithreaded processors.

[DOI]

,

Filip Blagojevic

,

Dimitrios S. Nikolopoulos

Proceedings of the 7th Workshop on languages, 2004

Adapting to Memory Pressure from within Scientific Applications on Multiprogrammed COWs.

[DOI]

Richard Tran Mills

,

Andreas Stathopoulos

,

Dimitrios S. Nikolopoulos

Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

Realistic Workload Scheduling Policies for Taming the Memory Bandwidth Bottleneck of SMPs.

[DOI]

Christos D. Antonopoulos

,

Dimitrios S. Nikolopoulos

,

Theodore S. Papatheodorou

Proceedings of the High Performance Computing, 2004

2003

Scaling non-regular shared-memory codes by reusing custom loop schedules.

[DOI]

Dimitrios S. Nikolopoulos

,

,

Eduard Ayguadé

,

Sci. Program., 2003

Quantifying contention and balancing memory load on hardware DSM multiprocessors.

[DOI]

Dimitrios S. Nikolopoulos

J. Parallel Distributed Comput., 2003

Adaptive scheduling under memory constraints on non-dedicated computationalfarms.

[DOI]

Dimitrios S. Nikolopoulos

,

Constantine D. Polychronopoulos

Future Gener. Comput. Syst., 2003

Code and Data Transformations for Improving Shared Cache Performance on SMT Processors.

[DOI]

Dimitrios S. Nikolopoulos

Proceedings of the High Performance Computing, 5th International Symposium, 2003

Malleable Memory Mapping: User-Level Control of Memory Bounds for Effective Program Adaptation.

[DOI]

Dimitrios S. Nikolopoulos

Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

Scheduling Algorithms with Bus Bandwidth Considerations for SMPs.

[DOI]

Christos D. Antonopoulos

,

Dimitrios S. Nikolopoulos

,

Theodore S. Papatheodorou

Proceedings of the 32nd International Conference on Parallel Processing (ICPP 2003), 2003

2002

Scheduler-Activated Dynamic Page Migration for Multiprogrammed DSM Multiprocessors.

[DOI]

Dimitrios S. Nikolopoulos

,

Constantine D. Polychronopoulos

,

Theodore S. Papatheodorou

,

,

Eduard Ayguadé

J. Parallel Distributed Comput., 2002

Runtime vs. Manual Data Distribution for Architecture-Agnostic Shared-Memory Programming Models.

[DOI]

Dimitrios S. Nikolopoulos

,

Eduard Ayguadé

,

Constantine D. Polychronopoulos

Int. J. Parallel Program., 2002

Adaptive Scheduling under Memory Pressure on Multiprogrammed SMPs.

[DOI]

Dimitrios S. Nikolopoulos

,

Constantine D. Polychronopoulos

Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Quantifying and Resolving Remote Memory Access Contention on Hardware DSM Multiprocessors.

[DOI]

Dimitrios S. Nikolopoulos

Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Effective Cross-Platform, Multilevel Parallelism via Dynamic Adaptive Execution.

[DOI]

,

Mark N. Yankelevsky

,

Dimitrios S. Nikolopoulos

,

Constantine D. Polychronopoulos

Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Adaptive Scheduling under Memory Pressure on Multiprogrammed Cluster.

[DOI]

Dimitrios S. Nikolopoulos

,

Constantine D. Polychronopoulos

Proceedings of the 2nd IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2002), 2002

2001

Exploiting memory affinity in OpenMP through schedule reuse.

[DOI]

Dimitrios S. Nikolopoulos

,

,

Eduard Ayguadé

,

SIGARCH Comput. Archit. News, 2001

The Architectural and Operating System Implications on the Performance of Synchronization on ccNUMA Multiprocessors.

[DOI]

Dimitrios S. Nikolopoulos

,

Theodore S. Papatheodorou

Int. J. Parallel Program., 2001

A Study of Implicit Data Distribution Methods for OpenMP Using the SPEC Benchmarks.

[DOI]

Dimitrios S. Nikolopoulos

,

Eduard Ayguadé

Proceedings of the OpenMP Shared Memory Parallel Programming, 2001

Scaling irregular parallel codes with minimal programming effort.

[DOI]

Dimitrios S. Nikolopoulos

,

Constantine D. Polychronopoulos

,

Eduard Ayguadé

Proceedings of the 2001 ACM/IEEE conference on Supercomputing, 2001

The trade-off between implicit and explicit data distribution in shared-memory programming paradigms.

[DOI]

Dimitrios S. Nikolopoulos

,

Eduard Ayguadé

,

Theodore S. Papatheodorou

,

Constantine D. Polychronopoulos

,

Proceedings of the 15th international conference on Supercomputing, 2001

Informing Algorithms for Efficient Scheduling of Synchronizing Threads on Multiprogrammed SMPs.

[DOI]

Christos D. Antonopoulos

,

Dimitrios S. Nikolopoulos

,

Theodore S. Papatheodorou

Proceedings of the 2001 International Conference on Parallel Processing, 2001

Improving Java Server Performance with Interruptlets.

[DOI]

,

,

,

Dimitrios S. Nikolopoulos

,

Constantine D. Polychronopoulos

Proceedings of the Computational Science - ICCS 2001, 2001

A Transparent Operating System Infrastructure for Embedding Adaptability to Thread-Based Programming Models.

[DOI]

Ioannis E. Venetis

,

Dimitrios S. Nikolopoulos

,

Theodore S. Papatheodorou

Proceedings of the Euro-Par 2001: Parallel Processing, 2001

2000

Is Data Distribution Necessary in OpenMP?

[DOI]

Dimitrios S. Nikolopoulos

,

Theodore S. Papatheodorou

,

Constantine D. Polychronopoulos

,

,

Eduard Ayguadé

Proceedings of the Proceedings Supercomputing 2000, 2000

Efficient Dynamic Parallelism with OpenMP on Linux SMPs.

Christos D. Antonopoulos

,

Ioannis E. Venetis

,

Dimitrios S. Nikolopoulos

,

Theodore S. Papatheodorou

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2000

UPMLIB: A Runtime System for Tuning the Memory Performance of OpenMP Programs on Scalable Shared-Memory Multiprocessors.

[DOI]

Dimitrios S. Nikolopoulos

,

Theodore S. Papatheodorou

,

Constantine D. Polychronopoulos

,

,

Eduard Ayguadé

Proceedings of the Languages, 2000

A Tool to Schedule Parallel Applications on Multiprocessors: The NANOS CPU MANAGER.

[DOI]

Xavier Martorell

,

Julita Corbalán

,

Dimitrios S. Nikolopoulos

,

,

Eleftherios D. Polychronopoulos

,

Theodore S. Papatheodorou

,

Proceedings of the Job Scheduling Strategies for Parallel Processing, IPDPS 2000 Workshop, 2000

Leveraging Transparent Data Distribution in OpenMP via User-Level Dynamic Page Migration.

[DOI]

Dimitrios S. Nikolopoulos

,

Theodore S. Papatheodorou

,

Constantine D. Polychronopoulos

,

,

Eduard Ayguadé

Proceedings of the High Performance Computing, Third International Symposium, 2000

Fast Synchronization on Scalable Cache-Coherent Multiprocessors using Hybrid Primitives.

[DOI]

Dimitrios S. Nikolopoulos

,

Theodore S. Papatheodorou

Proceedings of the 14th International Parallel & Distributed Processing Symposium (IPDPS'00), 2000

A case for use-level dynamic page migration.

[DOI]

Dimitrios S. Nikolopoulos

,

Theodore S. Papatheodorou

,

Constantine D. Polychronopoulos

,

,

Eduard Ayguadé

Proceedings of the 14th international conference on Supercomputing, 2000

User-Level Dynamic Page Migration for Multiprogrammed Shared-Memory Multiprocessors.

[DOI]

Dimitrios S. Nikolopoulos

,

Theodore S. Papatheodorou

,

Constantine D. Polychronopoulos

,

,

Eduard Ayguadé

Proceedings of the 2000 International Conference on Parallel Processing, 2000

1999

Fine-Grain and Multiprogramming-Conscious Nanothreading with the Solaris Operating System.

Dimitrios S. Nikolopoulos

,

Eleftherios D. Polychronopoulos

,

Theodore S. Papatheodorou

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1999

Achieving multiprogramming scalability of parallel programs on Intel SMP platforms: Nanothreading in the Linux kernel.

Dimitrios S. Nikolopoulos

,

Christos D. Antonopoulos

,

Ioannis E. Venetis

,

Panagiotis E. Hadjidoukas

,

Eleftherios D. Polychronopoulos

,

Theodore S. Papatheodorou

Proceedings of the Parallel Computing: Fundamentals & Applications, 1999

A quantitative architectural evaluation of synchronization algorithms and disciplines on ccNUMA systems: the case of the SGI Origin2000.

[DOI]

Dimitrios S. Nikolopoulos

,

Theodore S. Papatheodorou

Proceedings of the 13th international conference on Supercomputing, 1999

1998

Efficient Runtime Thread Management for the Nano-Threads Programming Model.

[DOI]

Dimitrios S. Nikolopoulos

,

Eleftherios D. Polychronopoulos

,

Theodore S. Papatheodorou

Proceedings of the Parallel and Distributed Processing, 10 IPPS/SPDP'98 Workshops Held in Conjunction with the 12th International Parallel Processing Symposium and 9th Symposium on Parallel and Distributed Processing, Orlando, Florida, USA, March 30, 1998

Kernel-level Scheduling for the Nano-threads Programming Model.

[DOI]

Eleftherios D. Polychronopoulos

,

Xavier Martorell

,

Dimitrios S. Nikolopoulos

,

,

Theodore S. Papatheodorou

,

Proceedings of the 12th international conference on Supercomputing, 1998

Enhancing the Performance of Auroscheduling in Distributed Shared Memory Multiprocessors.

[DOI]

Dimitrios S. Nikolopoulos

,

Eleftherios D. Polychronopoulos

,

Theodore S. Papatheodorou

Proceedings of the Euro-Par '98 Parallel Processing, 1998

Loading...