Aleksandar Ilic

Orcid: 0000-0002-8594-3539

According to our database1, Aleksandar Ilic authored at least 123 papers between 2008 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Strategies for increasing maximum throughput and reducing latency in tree-based WSNs.
PhD thesis, 2023

Maximal diameter of integral circulant graphs.
CoRR, 2023

Special issue: 20th international workshop on algorithms, models and tools for parallel computing on heterogeneous platforms (HeteroPar'22).
Concurr. Comput. Pract. Exp., 2023

Performance Modelling-Driven Optimization of RISC-V Hardware for Efficient SpMV.
Proceedings of the High Performance Computing, 2023

Bringing Order to Sparsity: A Sparse Matrix Reordering Study on Multicore CPUs.
Proceedings of the International Conference for High Performance Computing, 2023

A Performance Modelling-Driven Approach to Hardware Resource Scaling.
Proceedings of the Euro-Par 2023: Parallel Processing Workshops - Euro-Par 2023 International Workshops, Limassol, Cyprus, August 28, 2023

Sparse-Aware CARM: Rooflining Locality of Sparse Computations.
Proceedings of the Euro-Par 2023: Parallel Processing Workshops - Euro-Par 2023 International Workshops, Limassol, Cyprus, August 28, 2023

Interpreting High Order Epistasis Using Sparse Transformers.
Proceedings of the IEEE/ACM Conference on Connected Health: Applications, 2023

2022
Performance optimization of the MGB hydrological model for multi-core and GPU architectures.
Environ. Model. Softw., 2022

Stochastic simulated annealing for directed feedback vertex set.
Appl. Soft Comput., 2022

Unlocking Personalized Healthcare on Modern CPUs/GPUs: Three-way Gene Interaction Study.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

Tensor-Accelerated Fourth-Order Epistasis Detection on GPUs.
Proceedings of the 51st International Conference on Parallel Processing, 2022

2021
Retargeting Tensor Accelerators for Epistasis Detection.
IEEE Trans. Parallel Distributed Syst., 2021

Mansard Roofline Model: Reinforcing the Accuracy of the Roofs.
ACM Trans. Model. Perform. Evaluation Comput. Syst., 2021

Comparing Wiener complexity with eccentric complexity.
Discret. Appl. Math., 2021

On conjectures of network distance measures by using graph spectra.
Discret. Appl. Math., 2021

On the computational complexity of the Steiner k-eccentricity.
CoRR, 2021

Fourth-Order Exhaustive Epistasis Detection for the xPU Era.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021

HEDAcc: FPGA-based Accelerator for High-order Epistasis Detection.
Proceedings of the 29th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2021

2020
Parallel evolutionary computation for multiobjective gene interaction analysis.
J. Comput. Sci., 2020

Application-driven Cache-Aware Roofline Model.
Future Gener. Comput. Syst., 2020

Relations and bounds for the zeros of graph polynomials using vertex orbits.
Appl. Math. Comput., 2020

Accelerating 3-Way Epistasis Detection with CPU+GPU Processing.
Proceedings of the Job Scheduling Strategies for Parallel Processing, 2020

Exploring the Binary Precision Capabilities of Tensor Cores for Epistasis Detection.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

Performance Optimization and Scalability Analysis of the MGB Hydrological Model.
Proceedings of the 27th IEEE International Conference on High Performance Computing, 2020

Heterogeneous CPU+iGPU Processing for Efficient Epistasis Detection.
Proceedings of the Euro-Par 2020: Parallel Processing, 2020

2019
Modeling and Decoupling the GPU Power Consumption for Cross-Domain DVFS.
IEEE Trans. Parallel Distributed Syst., 2019

Modeling Non-Uniform Memory Access on Large Compute Nodes with the Cache-Aware Roofline Model.
IEEE Trans. Parallel Distributed Syst., 2019

DVFS-aware application classification to improve GPGPUs energy efficiency.
Parallel Comput., 2019

Fast block distributed CUDA implementation of the Hungarian algorithm.
J. Parallel Distributed Comput., 2019

Hungarian algorithm for subcarrier assignment problem using GPU and CUDA.
Int. J. Commun. Syst., 2019

Path matrix and path energy of graphs.
Appl. Math. Comput., 2019

GPU Static Modeling Using PTX and Deep Structured Learning.
IEEE Access, 2019

Maximal Diameter on a Class of Circulant Graphs.
Proceedings of the Algebraic Informatics - 8th International Conference, 2019

HeTM: Transactional Memory for Heterogeneous Systems.
Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019

2018
Highly parallel HEVC decoding for heterogeneous systems with CPU and GPU.
Signal Process. Image Commun., 2018

Bipartite graphs with bounded vertex degree and largest eigenvalue of the form r.
Appl. Math. Comput., 2018

Cache-Aware Roofline Model and Medical Image Processing Optimizations in GPUs.
Proceedings of the High Performance Computing, 2018

GPGPU Power Modeling for Multi-domain Voltage-Frequency Scaling.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2018

Accelerating CNN computation: quantisation tuning and network resizing.
Proceedings of the 2nd Workshop on AutotuniNg and aDaptivity AppRoaches for Energy efficient HPC Systems, 2018

2017
GHEVC: An Efficient HEVC Decoder for Graphics Processing Units.
IEEE Trans. Multim., 2017

Beyond the Roofline: Cache-Aware Power and Energy-Efficiency Modeling for Multi-Cores.
IEEE Trans. Computers, 2017

GPU Parallelization of HEVC In-Loop Filters.
Int. J. Parallel Program., 2017

Cache-aware Roofline Model in Intel® Advisor.
ERCIM News, 2017

Accelerating the phylogenetic parsimony function on heterogeneous systems.
Concurr. Comput. Pract. Exp., 2017

Energy-aware mechanism for stencil-based MPDATA algorithm with constraints.
Concurr. Comput. Pract. Exp., 2017

The parameters of Fibonacci and Lucas cubes.
Ars Math. Contemp., 2017

Counterexamples to conjectures on graph distance measures based on topological indexes.
Appl. Math. Comput., 2017

Modeling Large Compute Nodes with Heterogeneous Memories with Cache-Aware Roofline Model.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2017

Exploring GPU performance, power and energy-efficiency bounds with Cache-aware Roofline Modeling.
Proceedings of the 2017 IEEE International Symposium on Performance Analysis of Systems and Software, 2017

Analyzing Performance of Multi-cores and Applications with Cache-aware Roofline Model.
Proceedings of the 2017 International Conference on High Performance Computing & Simulation, 2017

Performance Analysis with Cache-Aware Roofline Model in Intel Advisor.
Proceedings of the 2017 International Conference on High Performance Computing & Simulation, 2017

On Boosting Energy-Efficiency of Heterogeneous Embedded Systems via Game Theory.
Proceedings of the 8th Workshop and 6th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and Design Tools and Architectures for Multicore Embedded Computing Platforms, 2017

2016
Adaptive Scheduling Framework for Real-Time Video Encoding on Heterogeneous Systems.
IEEE Trans. Circuits Syst. Video Technol., 2016

A Framework for Application-Guided Task Management on Heterogeneous Embedded Systems.
ACM Trans. Archit. Code Optim., 2016

GPU-assisted HEVC intra decoder.
J. Real Time Image Process., 2016

On the extremal values of general degree-based graph entropies.
Inf. Sci., 2016

The number of spanning trees of a graph with given matching number.
Int. J. Comput. Math., 2016

A proof of the conjecture regarding the sum of domination number and average eccentricity.
Discret. Appl. Math., 2016

The eccentric distance sum, the Harary index and the degree powers of graphs with given diameter.
Ars Comb., 2016

Note on the harmonic index of a graph.
Ars Comb., 2016

Efficient HEVC decoder for heterogeneous CPU with GPU systems.
Proceedings of the 18th IEEE International Workshop on Multimedia Signal Processing, 2016

SET response of a SEL protection switch for 130 and 250 nm CMOS technologies.
Proceedings of the 22nd IEEE International Symposium on On-Line Testing and Robust System Design, 2016

Performance and Power-Aware Classification for Frequency Scaling of GPGPU Applications.
Proceedings of the Euro-Par 2016: Parallel Processing Workshops, 2016

2015
Finding Critical Regions and Region-Disjoint Paths in a Network.
IEEE/ACM Trans. Netw., 2015

On the variable common due date, minimal tardy jobs bicriteria two-machine flow shop problem with ordered machines.
Theor. Comput. Sci., 2015

On the distance based graph entropies.
Appl. Math. Comput., 2015

Attaining performance fairness in big.LITTLE systems.
Proceedings of the 12th International Workshop on Intelligent Solutions in Embedded Systems, 2015

HEVC in-loop filters GPU parallelization in embedded systems.
Proceedings of the 2015 International Conference on Embedded Computer Systems: Architectures, 2015

Multi-kernel Auto-Tuning on GPUs: Performance and Energy-Aware Optimization.
Proceedings of the 23rd Euromicro International Conference on Parallel, 2015

Towards GPU HEVC intra decoding: Seizing fine-grain parallelism.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

GPU acceleration of the HEVC decoder inter prediction module.
Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015

2014
Dynamic Load Balancing for Real-Time Video Encoding on Heterogeneous CPU+GPU Systems.
IEEE Trans. Multim., 2014

Cache-aware Roofline model: Upgrading the loft.
IEEE Comput. Archit. Lett., 2014

The minimal positive index of inertia of signed unicyclic graphs.
Ars Comb., 2014

Performance-Aware Task Management and Frequency Scaling in Embedded Systems.
Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

FEVES: Framework for Efficient Parallel Video Encoding on Heterogeneous Systems.
Proceedings of the 43rd International Conference on Parallel Processing, 2014

Collaborative inter-prediction on CPU+GPU systems.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

SchedMon: A Performance and Energy Monitoring Tool for Modern Multi-cores.
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

2013
The weighted vertex PI index.
Math. Comput. Model., 2013

Efficient algorithm for the vertex connectivity of trapezoid graphs.
Inf. Process. Lett., 2013

Generalizations of Wiener Polarity Index and Terminal Wiener Index.
Graphs Comb., 2013

Degree Distance of Unicyclic Graphs with Given Matching Number.
Graphs Comb., 2013

Transparent Application Acceleration by Intelligent Scheduling of Shared Library Calls on Heterogeneous Systems.
Proceedings of the Parallel Processing and Applied Mathematics, 2013

Monitoring Performance and Power for Application Characterization with the Cache-Aware Roofline Model.
Proceedings of the Parallel Processing and Applied Mathematics, 2013

Critical regions and region-disjoint paths in a network.
Proceedings of the IFIP Networking Conference, 2013, Brooklyn, 2013

2012
The index of a binary word.
Theor. Comput. Sci., 2012

A general variable neighborhood search for the one-commodity pickup-and-delivery travelling salesman problem.
Eur. J. Oper. Res., 2012

Generalized Fibonacci cubes.
Discret. Math., 2012

Ballot matrix as Catalan matrix power and related identities.
Discret. Appl. Math., 2012

On reformulated Zagreb indices.
Discret. Appl. Math., 2012

Parity Index of Binary Words and Powers of Prime Words.
Electron. J. Comb., 2012

On the extremal properties of the average eccentricity.
Comput. Math. Appl., 2012

On the second maximal and minimal Wiener index of unicyclic graphs with given girth.
Ars Comb., 2012

The Laplacian spectral radius of graphs with given connectivity.
Ars Comb., 2012

On Realistic Divisible Load Scheduling in Highly Heterogeneous Distributed Systems.
Proceedings of the 20th Euromicro International Conference on Parallel, 2012

Simultaneous Multi-Level Divisible Load Balancing for Heterogeneous Desktop Systems.
Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012

Hierarchical Partitioning Algorithm for Scientific Computing on Highly Heterogeneous CPU + GPU Clusters.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

2011
CHPS: An Environment for Collaborative Execution on Heterogeneous Desktop Systems.
Int. J. Netw. Comput., 2011

Degree distance of unicyclic and bicyclic graphs.
Discret. Appl. Math., 2011

On vertex covers and matching number of trapezoid graphs
CoRR, 2011

Constructions of hamiltonian graphs with bounded degree and diameter O (log n)
CoRR, 2011

Network analysis using a novel highly discriminating topological index.
Complex., 2011

On the Automorphism Group of Integral Circulant Graphs.
Electron. J. Comb., 2011

New results on the energy of integral circulant graphs.
Appl. Math. Comput., 2011

Scheduling Divisible Loads on Heterogeneous Desktop Systems with Limited Memory.
Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011

2010
Note on PI and Szeged indices.
Math. Comput. Model., 2010

A general variable neighborhood search for solving the uncapacitated single allocation p-hub median problem.
Eur. J. Oper. Res., 2010

On distance-balanced graphs.
Eur. J. Comb., 2010

Distance spectral radius of trees with given matching number.
Discret. Appl. Math., 2010

On the chromatic number of integral circulant graphs.
Comput. Math. Appl., 2010

Trees with minimal Laplacian coefficients.
Comput. Math. Appl., 2010

The hyper-Wiener index of trees with given parameters.
Ars Comb., 2010

On the extremal graphs with respect to the vertex PI index.
Appl. Math. Lett., 2010

Zagreb, Harary and hyper-Wiener indices of graphs with a given matching number.
Appl. Math. Lett., 2010

Graphs for small multiprocessor interconnection networks.
Appl. Math. Comput., 2010

Collaborative execution environment for heterogeneous parallel systems.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

High-Performance Computing on Heterogeneous Systems: Database Queries on CPU and GPU.
Proceedings of the High Performance Computing: From Grids and Clouds to Exascale, 2010

2009
Constructions of hamiltonian graphs with bounded degree and diameter O(logn).
Appl. Math. Lett., 2009

On the clique number of integral circulant graphs.
Appl. Math. Lett., 2009

Catalan matrix and related combinatorial identities.
Appl. Math. Comput., 2009

2008
Generalizing (1 - 1)ⁿ = 0: 11230.
Am. Math. Mon., 2008

Distributed Web-based Platform for Computer Architecture Simulation.
Proceedings of the 7th International Symposium on Parallel and Distributed Computing (ISPDC 2008), 2008


  Loading...