Nectarios Koziris

Orcid: 0000-0002-4890-8427

According to our database1, Nectarios Koziris authored at least 253 papers between 1996 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
IW-NET BDA: A Big Data Infrastructure for Predictive and Geotemporal Analytics of Inland Waterways.
IEEE Access, 2024

Open-Source SpMV Multiplication Hardware Accelerator for FPGA-Based HPC Systems.
Proceedings of the Applied Reconfigurable Computing. Architectures, Tools, and Applications, 2024

2023
PARALiA: A Performance Aware Runtime for Auto-tuning Linear Algebra on Heterogeneous Systems.
ACM Trans. Archit. Code Optim., December, 2023

High-performance and balanced parallel graph coloring on multicore platforms.
J. Supercomput., April, 2023

DaeMon: Architectural Support for Efficient Data Movement in Fully Disaggregated Systems.
Proc. ACM Meas. Anal. Comput. Syst., March, 2023

Architectural Support for Efficient Data Movement in Disaggregated Systems.
CoRR, 2023

DaeMon: Architectural Support for Efficient Data Movement in Disaggregated Systems.
CoRR, 2023

Architectural Support for Efficient Data Movement in Fully Disaggregated Systems.
Proceedings of the Abstract Proceedings of the 2023 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2023

FaaSCell: A Case for Intra-node Resource Management: Work-In-Progress.
Proceedings of the 1st Workshop on SErverless Systems, Applications and MEthodologies, 2023

Feature-based SpMV Performance Analysis on Contemporary Devices.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Invited paper: An Artificial Matrix Generator for Multi-platform SpMV Performance Analysis.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Graph-Centric Crypto Price Prediction.
Proceedings of the IEEE International Conference on Blockchain and Cryptocurrency, 2023

Planning Workflow Executions over the Edge-to-Cloud Continuum.
Proceedings of the Algorithmic Aspects of Cloud Computing - 8th International Symposium, 2023

Deep Reinforcement Learning in Cloud Elasticity Through Offline Learning and Return Based Scaling.
Proceedings of the 16th IEEE International Conference on Cloud Computing, 2023

2022
Enabling Transparent Acceleration of Big Data Frameworks using Heterogeneous Hardware.
Proc. VLDB Endow., 2022

SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Architectures.
Proc. ACM Meas. Anal. Comput. Syst., 2022

Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Systems.
CoRR, 2022

QueryER: A Framework for Fast Analysis-Aware Deduplication over Dirty Data.
CoRR, 2022

SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Systems.
CoRR, 2022

FaaS in the age of (sub-)<i>μs</i> I/O: a performance analysis of snapshotting.
Proceedings of the SYSTOR '22: The 15th ACM International Systems and Storage Conference, Haifa, Israel, June 13, 2022

Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Architectures.
Proceedings of the SIGMETRICS/PERFORMANCE '22: ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems, Mumbai, India, June 6, 2022

SparseP: Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Architectures.
Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2022

Deverlay: Container Snapshots For Virtual Machines.
Proceedings of the 22nd IEEE International Symposium on Cluster, 2022

VenOS: A Virtualization Framework for Multiple Tenant Accommodation on Reconfigurable Platforms.
Proceedings of the Applied Reconfigurable Computing. Architectures, Tools, and Applications, 2022

2021
Workload-aware wavelet synopses for sliding window aggregates.
Distributed Parallel Databases, 2021

RCU-HTM: A generic synchronization technique for highly efficient concurrent search trees.
Concurr. Comput. Pract. Exp., 2021

CoCoPeLia: Communication-Computation Overlap Prediction for Efficient Linear Algebra on GPUs.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2021

Exploiting Page Table Locality for Agile TLB Prefetching.
Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

Clouseau: Blockchain-based Data Integrity for HDFS Clusters.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Ether Price Prediction Using Advanced Deep Learning Models.
Proceedings of the IEEE International Conference on Blockchain and Cryptocurrency, 2021

SynCron: Efficient Synchronization Support for Near-Data-Processing Architectures.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021

Is Systematic Data Sharding able to Stabilize Asynchronous Parameter Server Training?
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Cherry: A Distributed Task-Aware Shuffle Service for Serverless Analytics.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

A Performance Evaluation of Distributed Deep Learning Frameworks on CPU Clusters Using Image Classification Workloads.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

A Mechanism Design and Learning Approach for Revenue Maximization on Cloud Dynamic Spot Markets.
Proceedings of the 14th IEEE International Conference on Cloud Computing, 2021

2020
Enabling Virtual Memory Research on RISC-V with a Configurable TLB Hierarchy for the Rocket Chip Generator.
CoRR, 2020

Efficient Concurrent Range Queries in B+-trees using RCU-HTM.
Proceedings of the SPAA '20: 32nd ACM Symposium on Parallelism in Algorithms and Architectures, 2020

Enhancing and Exploiting Contiguity for Fast Memory Virtualization.
Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

A Configurable TLB Hierarchy for the RISC-V Architecture.
Proceedings of the 30th International Conference on Field-Programmable Logic and Applications, 2020

SELIS BDA: Big Data Analytics for the Logistics Domain.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Fair Procedures for Fair Stable Marriage Outcomes.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Scaling the Construction of Wavelet Synopses for Maximum Error Metrics.
IEEE Trans. Knowl. Data Eng., 2019

Efficient accelerator sharing in virtualized environments: A Xeon Phi use-case.
J. Syst. Softw., 2019

Maintaining Wavelet Synopses for Sliding-Window Aggregates.
Proceedings of the 31st International Conference on Scientific and Statistical Database Management, 2019

Apollo: A Dataset Profiling and Operator Modeling System.
Proceedings of the 2019 International Conference on Management of Data, 2019

Conflict-free symmetric sparse matrix-vector multiplication on multicore architectures.
Proceedings of the International Conference for High Performance Computing, 2019

BASMAT: bottleneck-aware sparse matrix-vector multiplication auto-tuning on GPGPUs.
Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019

On the Performance and Energy Efficiency of Sparse Matrix-Vector Multiplication on FPGAs.
Proceedings of the Parallel Computing: Technology Trends, 2019

Equitable Stable Matchings in Quadratic Time.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

ACTiManager: An end-to-end interference-aware cloud resource manager.
Proceedings of the 20th International Middleware Conference Demos and Posters, 2019

Extending storage support for unikernel containers.
Proceedings of the 5th International Workshop on Serverless Computing, 2019

Predicting Graph Operator Output over Multiple Graphs.
Proceedings of the Web Engineering - 19th International Conference, 2019

DICER: Diligent Cache Partitioning for Efficient Workload Consolidation.
Proceedings of the 48th International Conference on Parallel Processing, 2019

An adaptive concurrent priority queue for NUMA architectures.
Proceedings of the 16th ACM International Conference on Computing Frontiers, 2019

Towards Faster Distributed Deep Learning Using Data Hashing Techniques.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

BigOptiBase: Big Data Analytics for Base Station Energy Consumption Optimization.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

General-Purpose vs. Specialized Data Analytics Systems: A Game of ML & SQL Thrones.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Towards a Multi-engine Query Optimizer for Complex SQL Queries on Big Data.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

2018
SparseX: A Library for High-Performance Sparse Matrix-Vector Multiplication on Multicore Platforms.
ACM Trans. Math. Softw., 2018

Cloud application deployment with transient failure recovery.
J. Cloud Comput., 2018

A Similarity-based Approach to Modeling Graph Operators.
CoRR, 2018

utmem: Towards Memory Elasticity in Cloud Workloads.
Proceedings of the High Performance Computing, 2018

Combining HTM with RCU to Speed Up Graph Coloring on Multicore Platforms.
Proceedings of the High Performance Computing - 33rd International Conference, 2018

Efficient resource management for data centers: the ACTiCLOUD approach.
Proceedings of the 18th International Conference on Embedded Computer Systems: Architectures, 2018

Docker-Sec: A Fully Automated Container Security Enhancement Mechanism.
Proceedings of the 38th IEEE International Conference on Distributed Computing Systems, 2018

ORiON: Online ResOurce Negotiator for Multiple Big Data Analytics Frameworks.
Proceedings of the 2018 IEEE International Conference on Autonomic Computing, 2018

Towards an Adaptive, Fully Automated Performance Modeling Methodology for Cloud Applications.
Proceedings of the 2018 IEEE International Conference on Cloud Engineering, 2018

The Vision of a HeterogeneRous Scheduler.
Proceedings of the 2018 IEEE International Conference on Cloud Computing Technology and Science, 2018

RACCEX: Towards Remote Accelerated Computing Environments.
Proceedings of the 2018 IEEE International Conference on Cloud Computing Technology and Science, 2018

DERP: A Deep Reinforcement Learning Cloud System for Elastic Resource Provisioning.
Proceedings of the 2018 IEEE International Conference on Cloud Computing Technology and Science, 2018

Performance Prediction of NUMA Placement: A Machine-Learning Approach.
Proceedings of the 2018 IEEE International Conference on Cloud Computing Technology and Science, 2018

A Content-Based Approach for Modeling Analytics Operators.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

2017
Elastic Resource Management with Adaptive State Space Partitioning of Markov Decision Processes.
CoRR, 2017

A Decision Tree Based Approach Towards Adaptive Profiling of Cloud Applications.
CoRR, 2017

Predictive communication modeling for HPC applications.
Clust. Comput., 2017

YASMIN: Efficient Intra-node Communication Using Generic Sockets.
Proceedings of the High Performance Computing, 2017

Exploiting Social Networking and Mobile Data for Crisis Detection and Management.
Proceedings of the Information Systems for Crisis Response and Management in Mediterranean Countries, 2017

vPHI: Enabling Xeon Phi Capabilities in Virtual Machines.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Intel Xeon Phi.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Modern Multi- and Many-Core Processors.
Proceedings of the 46th International Conference on Parallel Processing, 2017

Adaptive State Space Partitioning of Markov Decision Processes for Elastic Resource Management.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017


Isolation in Docker through Layer Encryption.
Proceedings of the 37th IEEE International Conference on Distributed Computing Systems, 2017

Improving QoS and Utilisation in modern multi-core servers with Dynamic Cache Partitioning.
Proceedings of the Joined Workshops COSH 2017 and VisorHPC 2017, 2017

Rethinking reinforcement learning for cloud elasticity.
Proceedings of the 2017 Symposium on Cloud Computing, SoCC 2017, Santa Clara, CA, USA, 2017

AURA: Recovering from Transient Failures in Cloud Deployments.
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017

BBQ: Elastic MapReduce over Cloud Platforms.
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017

RASP: Real-time network analytics with distributed NoSQL stream processing.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

Elastic management of cloud applications using adaptive reinforcement learning.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

A decision tree based approach towards adaptive modeling of big data applications.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

Spaten: A spatio-temporal and textual big data generator.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

Automatic Scaling of Resources in a Storm Topology.
Proceedings of the Algorithmic Aspects of Cloud Computing - Third International Workshop, 2017

RCU-HTM: Combining RCU with HTM to Implement Highly Efficient Concurrent Binary Search Trees.
Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques, 2017

2016
Improving virtual host efficiency through resource and interference aware scheduling.
CoRR, 2016

Cloud Resource Allocation from the User Perspective: A Bare-Bones Reinforcement Learning Approach.
Proceedings of the Web Information Systems Engineering - WISE 2016, 2016

Distributed Wavelet Thresholding for Maximum Error Metrics.
Proceedings of the 2016 International Conference on Management of Data, 2016

Reliable and efficient performance monitoring in linux.
Proceedings of the International Conference for High Performance Computing, 2016

Massively Concurrent Red-Black Trees with Hardware Transactional Memory.
Proceedings of the 24th Euromicro International Conference on Parallel, 2016

VGVM: Efficient GPU capabilities in virtual machines.
Proceedings of the International Conference on High Performance Computing & Simulation, 2016

Contention-Aware Scheduling Policies for Fairness and Throughput.
Proceedings of the Co-Scheduling of HPC Applications [extended versions of all papers from COSH@HiPEAC 2016, 2016

A resource-centric Application Classification Approach.
Proceedings of the 1st COSH Workshop on Co-Scheduling of HPC Applications, 2016

Optimizing, Planning and Executing Analytics Workflows over Multiple Engines.
Proceedings of the Workshops of the EDBT/ICDT 2016 Joint Conference, 2016

Multi-engine Analytics with IReS.
Proceedings of the Real-Time Business Intelligence and Analytics, 2016

MuSQLE: Distributed SQL query execution over multiple engine environments.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Mix 'n' match multi-engine analytics.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Fair, Fast and Frugal Large-Scale Matchmaking for VM Placement.
Proceedings of the Algorithmic Aspects of Cloud Computing - Second International Workshop, 2016

Recovering from Cloud Application Deployment Failures Through Re-execution.
Proceedings of the Algorithmic Aspects of Cloud Computing - Second International Workshop, 2016

2015
A lightweight optimization selection method for Sparse Matrix-Vector Multiplication.
CoRR, 2015

Datix: A System for Scalable Network Analytics.
Comput. Commun. Rev., 2015

Graph-Aware, Workload-Adaptive SPARQL Query Caching.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

MoDisSENSE: A Distributed Spatio-Temporal and Textual Processing Platform for Social Networking Services.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

IReS: Intelligent, Multi-Engine Resource Scheduler for Big Data Analytics Workflows.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Fifty years of evolution in virtualization technologies: from the first IBM machines to modern hyperconverged infrastructures.
Proceedings of the 19th Panhellenic Conference on Informatics, 2015

A Generic Architecture for Scalable and Highly Available Content Serving Applications in the Cloud.
Proceedings of the Fourth IEEE Symposium on Network Cloud Computing and Applications, 2015

DPDNS Introduction and Committees.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

An Equitable Solution to the Stable Marriage Problem.
Proceedings of the 27th IEEE International Conference on Tools with Artificial Intelligence, 2015

I/O Performance Modeling for Big Data Applications over Cloud Infrastructures.
Proceedings of the 2015 IEEE International Conference on Cloud Engineering, 2015

PANIC: Modeling Application Performance over Virtualized Resources.
Proceedings of the 2015 IEEE International Conference on Cloud Engineering, 2015

V4VSockets: low-overhead intra-node communication in Xen.
Proceedings of the 5th International Workshop on Cloud Data and Platforms, 2015

A Machine-Learning Approach for Communication Prediction of Large-Scale Applications.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

Heterogeneous k-anonymization with high utility.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

2014
~okeanos: Large-Scale Cloud Service Using Ceph.
login Usenix Mag., 2014

Xen2MX: High-performance communication in virtualized environments.
J. Syst. Softw., 2014

Scalable Indexing and Adaptive Querying of RDF Data in the cloud.
Proceedings of the Sixth Workshop on Semantic Web Information Management, 2014

H<sub>2</sub>RDF+: an efficient data management system for big RDF graphs.
Proceedings of the International Conference on Management of Data, 2014

MoDisSENSE: A distributed platform for social networking services over mobile devices.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

Automated workload-aware elasticity of NoSQL clusters in the cloud.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

CELAR: Automated application elasticity platform.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

LCA: a memory link and cache-aware co-scheduling approach for CMPs.
Proceedings of the International Conference on Parallel Architectures and Compilation, 2014

2013
Synnefo: A Complete Cloud Stack over Ganeti.
login Usenix Mag., 2013

An Extended Compression Format for the Optimization of Sparse Matrix-Vector Multiplication.
IEEE Trans. Parallel Distributed Syst., 2013

~okeanos: Building a Cloud, Cluster by Cluster.
IEEE Internet Comput., 2013

DBalancer: distributed load balancing for NoSQL data-stores.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

COCCUS: self-configured cost-based query services in the cloud.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Improving the Performance of the Symmetric Sparse Matrix-Vector Multiplication in Multicore.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Topic 3: Scheduling and Load Balancing - (Introduction).
Proceedings of the Euro-Par 2013 Parallel Processing, 2013

Automated, Elastic Resource Provisioning for NoSQL Clusters Using TIRAMOLA.
Proceedings of the 13th IEEE/ACM International Symposium on Cluster, 2013

H2RDF+: High-performance distributed joins over large-scale RDF graphs.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

2012
A DHT-Based System for the Management of Loosely Structured, Multidimensional Data.
Trans. Large Scale Data Knowl. Centered Syst., 2012

TarBase 6.0: capturing the exponential growth of miRNA targets with experimental support.
Nucleic Acids Res., 2012

H2RDF: adaptive query processing on RDF data in the cloud.
Proceedings of the 21st World Wide Web Conference, 2012

TIRAMOLA: elastic nosql provisioning through a cloud management platform.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Automatic scaling of selective SPARQL joins using the TIRAMOLA system.
Proceedings of the 4th International Workshop on Semantic Web Information Management, 2012

Using State-of-the-Art Sparse Matrix Optimizations for Accelerating the Performance of Multiphysics Simulations.
Proceedings of the Applied Parallel and Scientific Computing, 2012

An Approach to Parallelize Kruskal's Algorithm Using Helper Threads.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

Characterizing thread placement in the IBM POWER7 processor.
Proceedings of the 2012 IEEE International Symposium on Workload Characterization, 2012

Efficient Updates for Web-Scale Indexes over the Cloud.
Proceedings of the Workshops Proceedings of the IEEE 28th International Conference on Data Engineering, 2012

Public vs private cloud usage costs: the StratusLab case.
Proceedings of the 2nd International Workshop on Cloud Computing Platforms, 2012

Topic 4: High-Performance Architecture and Compilers.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

Xen2MX: Towards High-Performance Communication in the Cloud.
Proceedings of the Euro-Par 2012: Parallel Processing Workshops, 2012

2011
Fast and Cost-Effective Online Load-Balancing in Distributed Range-Queriable Systems.
IEEE Trans. Parallel Distributed Syst., 2011

DIANA-microT Web server upgrade supports Fly and Worm miRNA target prediction and bibliographic miRNA to disease association.
Nucleic Acids Res., 2011

Brown Dwarf: A fully-distributed, fault-tolerant data warehousing system.
J. Parallel Distributed Comput., 2011

Online querying of d-dimensional hierarchies.
J. Parallel Distributed Comput., 2011

CSX: an extended compression format for spmv on shared memory systems.
Proceedings of the 16th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2011

A Smart HPC Interconnect for Clusters of Virtual Machines.
Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011

Coexisting Scheduling Policies Boosting I/O Virtual Machines.
Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011

LinkedPeers: A Distributed System for Interlinking Multidimensional Data.
Proceedings of the Database and Expert Systems Applications, 2011

On the elasticity of NoSQL databases over cloud management platforms.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

2010
Exploiting compression opportunities to improve SpMxV performance on shared memory systems.
ACM Trans. Archit. Code Optim., 2010

Replica-aware, multi-dimensional range queries in Distributed Hash Tables.
Comput. Commun., 2010

GMBlock: Optimizing data movement in a block-level storage sharing system over Myrinet.
Clust. Comput., 2010

Distributing and searching concept hierarchies: an adaptive DHT-based system.
Clust. Comput., 2010

Distributed indexing of web scale datasets for the cloud.
Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud, 2010

Efficient updates for a shared nothing analytics platform.
Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud, 2010

Solving the advection PDE on the cell broadband engine.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Distributing the power of OLAP.
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, 2010

Exploring I/O Virtualization Data Paths for MPI Applications in a Cluster of VMs: A Networking Perspective.
Proceedings of the Euro-Par 2010 Parallel Processing Workshops, 2010

Brown dwarf: a P2P data-warehousing system.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009
Communication-Aware Supernode Shape.
IEEE Trans. Parallel Distributed Syst., 2009

Performance evaluation of the sparse matrix-vector multiplication on modern architectures.
J. Supercomput., 2009

DIANA-microT web server: elucidating microRNA functions through target prediction.
Nucleic Acids Res., 2009

Efficient hybrid parallelisation of tiled algorithms on SMP clusters.
Int. J. Comput. Sci. Eng., 2009

A grid middleware for data management exploiting peer-to-peer techniques.
Future Gener. Comput. Syst., 2009

Accurate microRNA target prediction correlates with protein repression levels.
BMC Bioinform., 2009

Measuring the Cost of Online Load-Balancing in Distributed Range-Queriable Systems.
Proceedings of the Proceedings P2P 2009, 2009

Optimizing Data Management in Grid Environments.
Proceedings of the On the Move to Meaningful Internet Systems: OTM 2009, 2009

Exploring the effect of block shapes on the performance of sparse kernels.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Early experiences on accelerating Dijkstra's algorithm using transactional memory.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Employing Transactional Memory and Helper Threads to Speedup Dijkstra's Algorithm.
Proceedings of the ICPP 2009, 2009

Perfomance Models for Blocked Sparse Matrix-Vector Multiplication Kernels.
Proceedings of the ICPP 2009, 2009

GridNews: A distributed automatic Greek broadcast transcription system.
Proceedings of the IEEE International Conference on Acoustics, 2009

An adaptive online system for efficient processing of hierarchical data.
Proceedings of the 18th ACM International Symposium on High Performance Distributed Computing, 2009

MyriXen: Message Passing in Xen Virtual Machines over Myrinet and Ethernet.
Proceedings of the Euro-Par 2009, 2009

A Comparative Study of Blocking Storage Methods for Sparse Matrices on Multicore Architectures.
Proceedings of the 12th IEEE International Conference on Computational Science and Engineering, 2009

Overlapping computation and communication in SMT clusters with commodity interconnects.
Proceedings of the 2009 IEEE International Conference on Cluster Computing, August 31, 2009

2008
Exploring the performance limits of simultaneous multithreading for memory intensive applications.
J. Supercomput., 2008

HiPPIS: an online P2P system for efficient lookups on d-dimensional hierarchies.
Proceedings of the 10th ACM International Workshop on Web Information and Data Management (WIDM 2008), 2008

Understanding the Performance of Sparse Matrix-Vector Multiplication.
Proceedings of the 16th Euromicro International Conference on Parallel, 2008

Support for Concept Hierarchies in DHTs.
Proceedings of the Proceedings P2P'08, 2008

PASS It ON (PASSION): An Adaptive Online Load-Balancing Algorithm for Distributed Range-Query Specialized Systems.
Proceedings of the On the Move to Meaningful Internet Systems: OTM 2008 Workshops, 2008

Online Querying of Concept Hierarchies in P2P Systems.
Proceedings of the On the Move to Meaningful Internet Systems: OTM 2008, 2008

Evaluation of dynamic scheduling methods in simulations of storm-time ion acceleration.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Synchronized send operations for efficient streaming block I/O over Myrinet.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Facilitating efficient synchronization of asymmetric threads on hyper-threaded processors.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Improving the Performance of Multithreaded Sparse Matrix-Vector Multiplication Using Index and Value Compression.
Proceedings of the 2008 International Conference on Parallel Processing, 2008

Optimizing sparse matrix-vector multiplication using index and value compression.
Proceedings of the 5th Conference on Computing Frontiers, 2008

2007
Efficient Block Device Sharing over Myrinet with Memory Bypass.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Coarse-grain Parallel Execution for 2-dimensional PDE Problems.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Global-scale peer-to-peer file services with DFS.
Proceedings of the 8th IEEE/ACM International Conference on Grid Computing (GRID 2007), 2007

2006
The Effect of Process Topology and Load Balancing on Parallel Programming Models for SMP Clusters and Iterative Algorithms.
J. Supercomput., 2006

Message-passing code generation for non-rectangular tiling transformations.
Parallel Comput., 2006

Selecting the tile shape to reduce the total communication volume.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Exploring the Performance Limits of Simultaneous Multithreading for Scientific Codes.
Proceedings of the 2006 International Conference on Parallel Processing (ICPP 2006), 2006

Memory and Network Bandwidth Aware Scheduling of Multiprogrammed Workloads on Clusters of SMPs.
Proceedings of the 12th International Conference on Parallel and Distributed Systems, 2006

Exploring the Capacity of a Modern SMT Architecture to Deliver High Scientific Application Performance.
Proceedings of the High Performance Computing and Communications, 2006

2005
Hyperplane Grouping and Pipelined Schedules: How to Execute Tiled Loops Fast on Clusters of SMPs.
J. Supercomput., 2005

Fast indexing for blocked array layouts to reduce cache misses.
Int. J. High Perform. Comput. Netw., 2005

Editorial message: special track on distributed systems and grid computing.
Proceedings of the 2005 ACM Symposium on Applied Computing (SAC), 2005

Memory Bandwidth Aware Scheduling for SMP Cluster Nodes.
Proceedings of the 13th Euromicro Workshop on Parallel, 2005

Storing and Locating Mutable Data in Structured Peer-to-Peer Overlay Networks.
Proceedings of the Advances in Informatics, 2005

Tuning Blocked Array Layouts to Exploit Memory Hierarchy in SMT Architectures.
Proceedings of the Advances in Informatics, 2005

Load Balancing Hybrid Programming Models for SMP Clusters and Fully Permutable Loops.
Proceedings of the 34th International Conference on Parallel Processing Workshops (ICPP 2005 Workshops), 2005

A Peer-to-Peer Replica Management Service for High-Throughput Grids.
Proceedings of the 34th International Conference on Parallel Processing (ICPP 2005), 2005

A tile size selection analysis for blocked array layouts.
Proceedings of the 9th Annual Workshop on Interaction between Compilers and Computer Architectures, 2005

2004
Automatic parallel code generation for tiled nested loops.
Proceedings of the 2004 ACM Symposium on Applied Computing (SAC), 2004

Editorial message: special track on parallel and distributed systems.
Proceedings of the 2004 ACM Symposium on Applied Computing (SAC), 2004

Scheduling of Tiled Nested Loops onto a Cluster with a Fixed Number of SMP Nodes.
Proceedings of the 12th Euromicro Workshop on Parallel, 2004

Improving Cache Locality with Blocked Array Layouts.
Proceedings of the 12th Euromicro Workshop on Parallel, 2004

Performance Comparison of Pure MPI vs Hybrid MPI-OpenMP Parallelization Models on SMP Clusters.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

Fast Indexing for Blocked Array Layouts to Improve Multi-Level Cache Locality.
Proceedings of the 8th Annual Workshop on Interaction between Compilers and Computer Architecture (INTERACT-8 2004), 2004

2003
An Efficient Code Generation Technique for Tiled Iteration Spaces.
IEEE Trans. Parallel Distributed Syst., 2003

A pipelined schedule to minimize completion time for loop tiling with computation and communication overlapping.
J. Parallel Distributed Comput., 2003

Parallel and Distributed Systems and Networking Track Editorial.
Proceedings of the 2003 ACM Symposium on Applied Computing (SAC), 2003

Advanced Hybrid MPI/OpenMP Parallelization Paradigms for Nested Loop Algorithms onto Clusters of SMPs.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface,10th European PVM/MPI Users' Group Meeting, Venice, Italy, September 29, 2003

Delivering High Performance to Parallel Applications Using Advanced Scheduling.
Proceedings of the Parallel Computing: Software Technology, 2003

2002
Code Generation Methods for Tiling Transformations .
J. Inf. Sci. Eng., 2002

Pipelined scheduling of tiled nested loops onto clusters of SMPs using memory mapped network interfaces.
Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002

Automatic code generation for executing tiled nested loops onto parallel architectures.
Proceedings of the 2002 ACM Symposium on Applied Computing (SAC), 2002

Data Parallel Code Generation for Arbitrarily Tiled Loop Nests.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2002

Geometric Scheduling of 2-D UET-UCT Uniform Dependence Loops.
Proceedings of the 10th Euromicro Workshop on Parallel, 2002

Enhancing the Performance of Tiled Loop Execution onto Clusters Using Memory Mapped Network Interfaces and Pipelined Schedules.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

A Pipelined Execution of Tiled Nested Loops on SMPs with Computation and Communication Overlapping.
Proceedings of the 31st International Conference on Parallel Processing Workshops (ICPP 2002 Workshops), 2002

Compiling Tiled Iteration Spaces for Clusters.
Proceedings of the 2002 IEEE International Conference on Cluster Computing (CLUSTER 2002), 2002

Efficient Utilization of Memory Mapped NICs onto Clusters using Pipelined Schedules.
Proceedings of the 2nd IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2002), 2002

2001
TOPPER: A Tool for Optimizing the Performance of Parallel Applications.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2001

TOPPER: An Integrated Environment for Task Allocation and Execution of MPI Applications onto Parallel Architectures.
Proceedings of the Advances in Informatics, 8th Panhellenic Conference on Informatics, 2001

Minimizing Completion Time for Loop Tiling with Computation and Communication Overlapping.
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

Geometric Scheduling of 2-D Uniform Dependence Loops.
Proceedings of the Eigth International Conference on Parallel and Distributed Systems, 2001

An Open Distributed Shared Memory System.
Proceedings of the High-Performance Computing and Networking, 9th International Conference, 2001

2000
Chain Grouping: A Method for Partitioning Loops onto Mesh-Connected Processor Arrays.
IEEE Trans. Parallel Distributed Syst., 2000

An efficient algorithm for the physical mapping of clustered task graphs onto multiprocessor architectures.
Proceedings of the Eight Euromicro Workshop on Parallel and Distributed Processing, 2000

Optimal scheduling for UET-UCT grids into fixed number of processors.
Proceedings of the Eight Euromicro Workshop on Parallel and Distributed Processing, 2000

Evaluation of Loop Grouping Methods Based on Orthogonal Projection Spaces.
Proceedings of the 2000 International Conference on Parallel Processing, 2000

1999
Optimal Scheduling for UET/UET-UCT Generalized n-Dimensional Grid Task Graphs.
J. Parallel Distributed Comput., 1999

1998
A Parallel Parsing VLSI Architecture for Arbitrary Context Free Grammars.
Proceedings of the International Conference on Parallel and Distributed Systems, 1998

Automatic generation of a VLSI parallel architecture for QRS detection.
Proceedings of the 9th European Signal Processing Conference, 1998

A Digital Library Model for the Grey Literature of Academic Institutes.
Proceedings of the Research and Advanced Technology for Digital Libraries, 1998

1997
Lower Time and Processor Bounds for Efficient Mapping of Uniform Dependence Algorithms into Systolic Arrays.
Parallel Algorithms Appl., 1997

Optimal Scheduling for UET-UCT Generalized n-Dimensional Grid Task Graphs.
Proceedings of the 11th International Parallel Processing Symposium (IPPS '97), 1997

Mapping nested loops onto distributed memory multiprocessors.
Proceedings of the 1997 International Conference on Parallel and Distributed Systems (ICPADS '97), 1997

Automatic Hardware Synthesis of Nested Loops Using UET Grids and VHDL.
Proceedings of the High-Performance Computing and Networking, 1997

1996
Optimal Time and Efficient Space Free Scheduling For Nested Loops.
Comput. J., 1996


  Loading...