Philippe Olivier Alexandre Navaux

Emmanuell Diaz Carreño

Luciano Paschoal Gaspary

Fernando Fernandes dos Santos

Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017

Evaluation and Mitigation of Soft-Errors in Neural Network-Based Object Detection in Three GPU Architectures.

[BibT_eX]

[DOI]

Lucas Draghetti

Lucas Weigel

Proceedings of the 47th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops, 2017

Kernel vulnerability factor and efficient hardening for histogram of oriented gradients.

[BibT_eX]

[DOI]

Lucas Weigel

Fernando Fernandes

Proceedings of the IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems, 2017

CAROL-FI: an Efficient Fault-Injection Tool for Vulnerability Evaluation of Modern HPC Parallel Accelerators.

[BibT_eX]

[DOI]

Daniel Oliveira

Vinicius Fratin

Israel Koren

Proceedings of the Computing Frontiers Conference, 2017

Data mining the memory access stream to detect anomalous application behavior.

[BibT_eX]

[DOI]

Francis B. Moreira

Israel Koren

Proceedings of the Computing Frontiers Conference, 2017

Optimizing memory affinity with a hybrid compiler/OS approach.

[BibT_eX]

[DOI]

Proceedings of the Computing Frontiers Conference, 2017

Performance Prediction of Acoustic Wave Numerical Kernel on Intel Xeon Phi Processor.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - 4th Latin American Conference, 2017

IoT Workload Distribution Impact Between Edge and Cloud Computing in a Smart Grid Application.

[BibT_eX]

[DOI]

Otávio Carvalho

Manuel Garcia

Emmanuell Diaz Carreño

Proceedings of the High Performance Computing - 4th Latin American Conference, 2017

2016

Kernel-Based Thread and Data Mapping for Improved Memory Affinity.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2016

Evaluation of Histogram of Oriented Gradients Soft Errors Criticality for Automotive Applications.

[BibT_eX]

[DOI]

Fernando Fernandes

Lucas Weigel

Cláudio R. Jung

ACM Trans. Archit. Code Optim., 2016

Hardware-Assisted Thread and Data Mapping in Hierarchical Multicore Architectures.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2016

A dynamic block-level execution profiler.

[BibT_eX]

[DOI]

Francis B. Moreira

Israel Koren

Parallel Comput., 2016

LAPT: A locality-aware page table for thread and data mapping.

[BibT_eX]

[DOI]

Parallel Comput., 2016

Seismic wave propagation simulations on low-power and performance-centric manycores.

[BibT_eX]

[DOI]

Parallel Comput., 2016

Automatic I/O scheduling algorithm selection for parallel file systems.

[BibT_eX]

[DOI]

Guilherme Grunewald Magalhaes

Concurr. Comput. Pract. Exp., 2016

How Programming Languages and Paradigms Affect Performance and Energy in Multithreaded Applications.

[BibT_eX]

[DOI]

Anderson Luiz Sartor

Arthur Francisco Lorenzon

Antonio Carlos Schneider Beck

Proceedings of the VI Brazilian Symposium on Computing Systems Engineering, 2016

Exploring Cache Size and Core Count Tradeoffs in Systems with Reduced Memory Access Latency.

[BibT_eX]

[DOI]

Proceedings of the 24th Euromicro International Conference on Parallel, 2016

Analyzing and Improving Memory Access Patterns of Large Irregular Applications on NUMA Machines.

[BibT_eX]

[DOI]

Artur Mariano

Christian H. Bischof

Proceedings of the 24th Euromicro International Conference on Parallel, 2016

Communication in Shared Memory: Concepts, Definitions, and Efficient Detection.

[BibT_eX]

[DOI]

Proceedings of the 24th Euromicro International Conference on Parallel, 2016

Towards Weather Forecasting in the Cloud.

[BibT_eX]

[DOI]

Proceedings of the 24th Euromicro International Conference on Parallel, 2016

System energy analysis for shared memory multiprocessing applications.

[BibT_eX]

[DOI]

Dieison Soares Silveira

Sergio Bampi

Gabriel B. Moro

Proceedings of the 2016 IEEE International Conference on Electronics, Circuits and Systems, 2016

A Sharing-Aware Memory Management Unit for Online Mapping in Multi-core Architectures.

[BibT_eX]

[DOI]

John Anderson García Henao

Proceedings of the Euro-Par 2016: Parallel Processing, 2016

enerGyPU and enerGyPhi Monitor for Power Consumption and Performance Evaluation on Nvidia Tesla GPU and Intel Xeon Phi.

[BibT_eX]

[DOI]

Esteban Hernandez B.

Carlos E. Montenegro

Carlos Jaime Barrios Hernández

Proceedings of the IEEE/ACM 16th International Symposium on Cluster, 2016

Fostering Collaboration in Energy Research and Technological Developments Applying New Exascale HPC Techniques.

[BibT_eX]

[DOI]

José María Cela

Alvaro L. G. A. Coutinho

Rafael Mayo-García

Proceedings of the IEEE/ACM 16th International Symposium on Cluster, 2016

Automatic Communication Optimization of Parallel Applications in Public Clouds.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM 16th International Symposium on Cluster, 2016

Performance Evaluation of Multiple Cloud Data Centers Allocations for HPC.

[BibT_eX]

[DOI]

Jimmy K. M. Valverde-Sánchez

Emmanuell Diaz Carreño

Matheus da Silva Serpa

Guillaume Houzeaux

Luciano Paschoal Gaspary

Proceedings of the High Performance Computing - Third Latin American Conference, 2016

Exploration of Load Balancing Thresholds to Save Energy on Iterative Applications.

[BibT_eX]

[DOI]

Márcio Castro

Daniel Fernández-Galisteo

Proceedings of the High Performance Computing - Third Latin American Conference, 2016

Enhancing Energy Production with Exascale HPC Methods.

[BibT_eX]

[DOI]

Alvaro L. G. A. Coutinho

Manuel Aurelio Rodriguez Pascual

Daniel de Oliveira

Vítor Silva

Renan Souza

Patrick Valduriez

Proceedings of the High Performance Computing - Third Latin American Conference, 2016

2015

Characterizing communication and page usage of parallel applications for thread and data mapping.

[BibT_eX]

[DOI]

Fabrice Dupros

Perform. Evaluation, 2015

Communication-aware process and thread mapping using online communication detection.

[BibT_eX]

[DOI]

Parallel Comput., 2015

On the energy efficiency and performance of irregular application executions on multicore, NUMA and manycore platforms.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2015

Performance/energy trade-off in scientific computing: the case of ARM big.LITTLE and Intel Sandy Bridge.

[BibT_eX]

[DOI]

Márcio Castro

IET Comput. Digit. Tech., 2015

Communication-aware thread mapping using the translation lookaside buffer.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2015

TABARNAC: visualizing and resolving memory access issues on NUMA architectures.

[BibT_eX]

[DOI]

David Beniamine

Proceedings of the 2nd Workshop on Visual Performance Analysis, 2015

Towards Seismic Wave Modeling on Heterogeneous Many-Core Architectures Using Task-Based Runtime System.

[BibT_eX]

[DOI]

Proceedings of the 27th International Symposium on Computer Architecture and High Performance Computing, 2015

Characterizing Anomalies of a Multicore ARMv7 Cluster with Parallel N-Body Simulations.

[BibT_eX]

[DOI]

Jean Luca Bez

Proceedings of the 2015 International Symposium on Computer Architecture and High Performance Computing Workshops, 2015

Performance impact of operating systems' caching parameters on parallel file systems.

[BibT_eX]

[DOI]

Eduardo Camilo Inacio

Mario A. R. Dantas

Douglas Dyllon Jeronimo de Macedo

Proceedings of the 30th Annual ACM Symposium on Applied Computing, 2015

Partial coscheduling of virtual machines based on memory access patterns.

[BibT_eX]

[DOI]

Jan Hendrik Schönherr

Proceedings of the 30th Annual ACM Symposium on Applied Computing, 2015

Towards fast profiling of storage devices regarding access sequentiality.

[BibT_eX]

[DOI]

Rodrigo Kassick

Proceedings of the 30th Annual ACM Symposium on Applied Computing, 2015

Locality vs. Balance: Exploring Data Mapping Policies on NUMA Systems.

[BibT_eX]

[DOI]

Proceedings of the 23rd Euromicro International Conference on Parallel, 2015

An Efficient Algorithm for Communication-Based Task Mapping.

[BibT_eX]

[DOI]

Proceedings of the 23rd Euromicro International Conference on Parallel, 2015

Challenges and Solutions in Executing Numerical Weather Prediction in a Cloud Infrastructure.

[BibT_eX]

[DOI]

Daniel Alfonso Gonçalves de Oliveira

Proceedings of the International Conference on Computational Science, 2015

The Path to Exascale: Code Optimizations and Hardening Solutions Reliability.

[BibT_eX]

[DOI]

Caio B. Lunardi

Proceedings of the 5th Workshop on Fault Tolerance for HPC at eXtreme Scale, 2015

SiNUCA: A Validated Micro-Architecture Simulator.

[BibT_eX]

[DOI]

Carlos Villavieja

Francis Birck Moreira

Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

Understanding GPU errors on large-scale HPC systems and the implications for system design and operation.

[BibT_eX]

[DOI]

Sudharshan S. Vazhkudai

Daniel Oliveira

Dave Londo

Nathan DeBardeleben

Arthur S. Bland

Proceedings of the 21st IEEE International Symposium on High Performance Computer Architecture, 2015

Locality and Balance for Communication-Aware Thread Mapping in Multicore Systems.

[BibT_eX]

[DOI]

Mohammad S. Alhakeem

Proceedings of the Euro-Par 2015: Parallel Processing, 2015

Porting a Numerical Atmospheric Model to a Cloud Service.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - Second Latin American Conference, 2015

2014

Best of SBAC-PAD 2012.

[BibT_eX]

[DOI]

Parallel Comput., 2014

Dynamic thread mapping of shared memory applications by exploiting cache coherence protocols.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2014

A topology-aware load balancing algorithm for clustered hierarchical multi-core machines.

[BibT_eX]

[DOI]

Pierre Coucheney

François Broquedis

Bruno Gaujal

Future Gener. Comput. Syst., 2014

Optimizing Memory Locality Using a Locality-Aware Page Table.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

Energy Efficient Seismic Wave Propagation Simulation on a Low-Power Manycore Processor.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

Improving the Performance of Seismic Wave Simulations with Dynamic Load Balancing.

[BibT_eX]

[DOI]

Rafael Keller Tesser

Fabrice Dupros

Proceedings of the 22nd Euromicro International Conference on Parallel, 2014

Saving energy by exploiting residual imbalances on iterative applications.

[BibT_eX]

[DOI]

Márcio Castro

Proceedings of the 21st International Conference on High Performance Computing, 2014

Impact of GPUs Parallelism Management on Safety-Critical and HPC Applications Reliability.

[BibT_eX]

[DOI]

Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2014

Radiation Sensitivity of High Performance Computing Applications on Kepler-Based GPGPUs.

[BibT_eX]

[DOI]

Daniel A. G. de Oliveira

Caio B. Lunardi

Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2014

GPGPUs ECC efficiency and efficacy.

[BibT_eX]

[DOI]

Daniel A. G. de Oliveira

Proceedings of the 2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems, 2014

kMAF: automatic kernel-level management of thread and data affinity.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Parallel Architectures and Compilation, 2014

2013

Preserving the original MPI semantics in a virtualized processor environment.

[BibT_eX]

[DOI]

Sci. Comput. Program., 2013

Evaluating application performance and energy consumption on hybrid CPU+GPU architecture.

[BibT_eX]

[DOI]

Pedro Velho

Clust. Comput., 2013

Energy Efficient Last Level Caches via Last Read/Write Prediction.

[BibT_eX]

[DOI]

Carlos Villavieja

Proceedings of the 25th International Symposium on Computer Architecture and High Performance Computing, 2013

Communication-Based Mapping Using Shared Pages.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

AGIOS: Application-Guided I/O Scheduling for Parallel File Systems.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Parallel and Distributed Systems, 2013

Neutron sensitivity and software hardening strategies for matrix multiplication and FFT on graphics processing units.

[BibT_eX]

[DOI]

Francesco Silvestri

Proceedings of the 3rd Workshop on Fault-tolerance for HPC at extreme scale, 2013

2012

A hierarchical aggregation model to achieve visualization scalability in the analysis of parallel applications.

[BibT_eX]

[DOI]

Parallel Comput., 2012

Memory-aware Thread and Data Mapping for Hierarchical Multi-core Platforms.

[BibT_eX]

[DOI]

Marco Antonio Zanata Alves

Int. J. Netw. Comput., 2012

Atmospheric models hybrid OpenMP/MPI implementation multicore cluster evaluation.

[BibT_eX]

[DOI]

Carla Osthoff

Pablo Javier Grunmann

Robert L. Walko

Int. J. Inf. Technol. Commun. Convergence, 2012

Energy Savings via Dead Sub-Block Prediction.

[BibT_eX]

[DOI]

Yale N. Patt

Proceedings of the IEEE 24th International Symposium on Computer Architecture and High Performance Computing, 2012

DIMVHCM: An On-line Distributed Monitoring Data Collection Model.

[BibT_eX]

[DOI]

Rafael Keller Tesser

Proceedings of the 20th Euromicro International Conference on Parallel, 2012

Using the Translation Lookaside Buffer to Map Threads in Parallel Applications Based on Shared Memory.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Evaluating Performance and Energy on ARM-based Clusters for High Performance Computing.

[BibT_eX]

[DOI]

Daniel A. G. de Oliveira

Pedro Velho

Proceedings of the 41st International Conference on Parallel Processing Workshops, 2012

A Hierarchical Approach for Load Balancing on Parallel Multi-core Systems.

[BibT_eX]

[DOI]

Daniel Cordeiro

Chao Mei

Abhinav Bhatele

François Broquedis

Laxmikant V. Kalé

Proceedings of the 41st International Conference on Parallel Processing, 2012

Asymptotically Optimal Load Balancing for Hierarchical Multi-Core Systems.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Parallel and Distributed Systems, 2012

High Performance Computing in the cloud: Deployment, performance and cost efficiency.

[BibT_eX]

[DOI]

Proceedings of the 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings, 2012

Evaluating High Performance Computing on the Windows Azure Platform.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Fifth International Conference on Cloud Computing, 2012

2011

High Latency and Contention on Shared L2-Cache for Many-Core Architectures.

[BibT_eX]

[DOI]

Antonio Carlos Schneider Beck

Parallel Process. Lett., 2011

Boosting Parallel Applications Performance on Applying DIM Technique in a Multiprocessing Environment.

[BibT_eX]

[DOI]

Mateus B. Rutzig

Int. J. Reconfigurable Comput., 2011

Challenges and solutions to improve the scalability of an operational regional meteorological forecasting model.

[BibT_eX]

[DOI]

Alvaro Luiz Fazenda

Daniel M. Katsurayama

Luiz Flavio Rodrigues

Luis F. G. Motta

Int. J. High Perform. Syst. Archit., 2011

The impact of applications' I/O strategies on the performance of the Lustre parallel file system.

[BibT_eX]

[DOI]

Int. J. High Perform. Syst. Archit., 2011

Dynamic I/O Reconfiguration for a NFS-Based Parallel File System.

[BibT_eX]

[DOI]

Rodrigo Kassick

Proceedings of the 19th International Euromicro Conference on Parallel, 2011

Improving Performance on Atmospheric Models through a Hybrid OpenMP/MPI Implementation.

[BibT_eX]

[DOI]

Carla Osthoff

Pablo Javier Grunmann

Rodrigo Kassick

Robert L. Walko

Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2011

Using Memory Access Traces to Map Threads and Data on Hierarchical Multi-core Platforms.

[BibT_eX]

[DOI]

Marco Antonio Zanata Alves

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Combining Multiple Metrics to Control BSP Process Rescheduling in Response to Resource and Application Dynamics.

[BibT_eX]

[DOI]

Lucas Graebin

Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

2010

Observing the Impact of Multiple Metrics and Runtime Adaptations on BSP Process Rescheduling.

[BibT_eX]

[DOI]

Parallel Process. Lett., 2010

Triva: Interactive 3D visualization for performance analysis of parallel applications.

[BibT_eX]

[DOI]

Gerson Geraldo Homrich Cavalheiro

Future Gener. Comput. Syst., 2010

Preface to CIESC 2009 Special Issue.

[BibT_eX]

[DOI]

Adenauer Corrêa Yamin

CLEI Electron. J., 2010

Applying Process Migration on a BSP-Based LU Decomposition Application.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing for Computational Science - VECPAR 2010, 2010

A Comparative Analysis of Load Balancing Algorithms Applied to a Weather Forecast Model.

[BibT_eX]

[DOI]

Proceedings of the 22st International Symposium on Computer Architecture and High Performance Computing, 2010

I/O Performance Evaluation on Multicore Clusters with Atmospheric Model Environment.

[BibT_eX]

[DOI]

Carla Osthoff

Pablo Javier Grunmann

Pedro Pais Lopes

Proceedings of the 22nd International Symposium on Computer Architecture and High Performance Computing Workshops, 2010

Impact of I/O Coordination on a NFS-Based Parallel File System with Dynamic Reconfiguration.

[BibT_eX]

[DOI]

Proceedings of the 22st International Symposium on Computer Architecture and High Performance Computing, 2010

A new technique for data privatization in user-level threads and its use in parallel applications.

[BibT_eX]

[DOI]

Proceedings of the 2010 ACM Symposium on Applied Computing (SAC), 2010

Challenges and Issues of Supporting Task Parallelism in MPI.

[BibT_eX]

[DOI]

Márcia C. Cera

João V. F. Lima

Proceedings of the Recent Advances in the Message Passing Interface, 2010

Parallel Shared-Memory Workloads Performance on Asymmetric Multi-core Architectures.

[BibT_eX]

[DOI]

Proceedings of the 18th Euromicro Conference on Parallel, 2010

Impact of Parallel Workloads on NoC Architecture Design.

[BibT_eX]

[DOI]

Marco Antonio Zanata Alves

Proceedings of the 18th Euromicro Conference on Parallel, 2010

Supporting performance and adaptivity on BSP process rescheduling.

[BibT_eX]

[DOI]

Proceedings of the 15th IEEE Symposium on Computers and Communications, 2010

TLP and ILP exploitation through a reconfigurable multiprocessor system.

[BibT_eX]

[DOI]

Mateus B. Rutzig

Antonio Carlos Schneider Beck

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Supporting Malleability in Parallel Architectures with Dynamic CPUSETsMapping and Dynamic MPI.

[BibT_eX]

[DOI]

Proceedings of the Distributed Computing and Networking, 11th International Conference, 2010

Evaluating Thread Placement Based on Memory Access Patterns for Multi-core Processors.

[BibT_eX]

[DOI]

Jörg Schneider

Proceedings of the 12th IEEE International Conference on High Performance Computing and Communications, 2010

Optimizing an MPI weather forecasting model via processor virtualization.

[BibT_eX]

[DOI]

Laxmikant V. Kalé

Proceedings of the 2010 International Conference on High Performance Computing, 2010

2009

Parallel Lattice Boltzmann Method with Blocked Partitioning.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2009

Visual Mapping of Program Components to Resources Representation: A 3D Analysis of Grid Parallel Applications.

[BibT_eX]

[DOI]

Proceedings of the 21st International Symposium on Computer Architecture and High Performance Computing, 2009

Performance Evaluation of NoC Architectures for Parallel Workloads.

[BibT_eX]

[DOI]

Proceedings of the Third International Symposium on Networks-on-Chips, 2009

Design of a Grid workflow for a climate application.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE Symposium on Computers and Communications (ISCC 2009), 2009

Multi-core aware process mapping and its impact on communication overhead of parallel applications.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE Symposium on Computers and Communications (ISCC 2009), 2009

Design of Interleaved Multithreading for Network Processors on Chip.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

Applying Processes Rescheduling over Irregular BSP Application.

[BibT_eX]

[DOI]

Proceedings of the Computational Science, 2009

MigBSP: A Novel Migration Model for Bulk-Synchronous Parallel Processes Rescheduling.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE International Conference on High Performance Computing and Communications, 2009

On the design of reconfigurable crossbar switch for adaptable on-chip topologies in programmable NoC routers.

[BibT_eX]

[DOI]

Proceedings of the 19th ACM Great Lakes Symposium on VLSI 2009, 2009

Towards Visualization Scalability through Time Intervals and Hierarchical Organization of Monitoring Data.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, 2009

2008

Controlling Processes Reassignment in BSP Applications.

[BibT_eX]

[DOI]

Proceedings of the 20th International Symposium on Computer Architecture and High Performance Computing, 2008

3D approach to the visualization of parallel applications and Grid monitoring information.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE/ACM International Conference on Grid Computing (Grid 2008), Tsukuba, Japan, September 29, 2008

NOC architecture design for multi-cluster chips.

[BibT_eX]

[DOI]

Proceedings of the FPL 2008, 2008

ICE: Managing Multiple Clusters Using Web Services.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE International Conference on Computational Science and Engineering, 2008

A High-Throughput Multi-cluster NoC Architecture.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE International Conference on Computational Science and Engineering, 2008

2007

Limits for a feasible speculative trace reuse implementation.

[BibT_eX]

[DOI]

Felipe Maia Galvão França

Bruce R. Childers

Amarildo T. da Costa

Int. J. High Perform. Syst. Archit., 2007

Automatic heart localization in ultrasound fetal images.

[BibT_eX]

Proceedings of the VISAPP 2007: Proceedings of the Second International Conference on Computer Vision Theory and Applications, Barcelona, Spain, March 8-11, 2007, 2007

On-line Scheduling of MPI-2 Programs with Hierarchical Work Stealing.

[BibT_eX]

[DOI]

Proceedings of the 19th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2007), 2007

Evaluating Network-on-Chip for Homogeneous Embedded Multiprocessors in FPGAs.

[BibT_eX]

[DOI]

Fernanda Lima Kastensmidt

Dalton M. Colombo

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

The Use of Artificial Neural Networks in the Speech Understanding Model - SUM.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks, 2007

Processing Mesoscale Climatology in a Grid Environment.

[BibT_eX]

[DOI]

Roberto Pinto Souto

M. X. Py

Haroldo F. de Campos Velho

Stephan Stephany

Airam Jonatas Preto

E. S. Almeida

A. W. Gandu

Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2007), 2007

2006

Metaserver Locality and Scalability in a Distributed NFS.

[BibT_eX]

[DOI]

Everton Hermann

Proceedings of the High Performance Computing for Computational Science, 2006

A Speculative Trace Reuse Architecture with Reduced Hardware Requirements.

[BibT_eX]

[DOI]

Proceedings of the 18th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2006), 2006

Improving the Dynamic Creation of Processes in MPI-2.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

A Model to Computational Speech Understanding.

[BibT_eX]

[DOI]

Proceedings of the Computational Processing of the Portuguese Language, 2006

Scheduling Dynamically Spawned Processes in MPI-2.

[BibT_eX]

[DOI]

Proceedings of the Job Scheduling Strategies for Parallel Processing, 2006

A Connectionist Approach to Speech Understanding.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2006

DIMVisual: Data Integration Model for Visualization of Parallel Programs Behavior.

[BibT_eX]

[DOI]

Benhur de Oliveira Stein

Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2006), 2006

ICE: A Service Oriented Approach to Uniform the Access and Management of Cluster Environments.

[BibT_eX]

[DOI]

Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2006), 2006

Computational Model of Speech Understanding.

[BibT_eX]

[DOI]

Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, 2006

2005

Reusing Traces in a Dynamic Conditional Execution Architecture.

[BibT_eX]

[DOI]

Sergio Bampi

Proceedings of the 17th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2005), 2005

Asynchronous Communication in Java over Infiniband and DECK.

[BibT_eX]

[DOI]

Márcia C. Cera

Marcelo Pasin

Proceedings of the 17th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2005), 2005

Branch Prediction Topologies for SMT Architectures.

[BibT_eX]

[DOI]

Guilherme Dal Pizzol

Proceedings of the 17th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2005), 2005

Cluster and network management integration an SNMP-based Solution.

[BibT_eX]

Rodrigo Sanger Alves

Lisandro Zambenedetti Granville

Proceedings of the ICETE 2005, 2005

Evaluating the performance of the dNFSP file system.

[BibT_eX]

[DOI]

Proceedings of the 5th International Symposium on Cluster Computing and the Grid (CCGrid 2005), 2005

2004

Parallel Computational Model with Dynamic Load Balancing in PC Clusters.

[BibT_eX]

[DOI]

André L. Martinotto

Delcino Picinin

Proceedings of the High Performance Computing for Computational Science, 2004

Value Predictors for Reuse through Speculation on Traces.

[BibT_eX]

[DOI]

Felipe Maia Galvão França

Bruce R. Childers

Amarildo T. da Costa

Proceedings of the 16th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2004), 2004

Performance Evaluation of a Prototype Distributed NFS Server.

[BibT_eX]

[DOI]

Pierre Lombard

Adrien Lebre

Proceedings of the 16th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2004), 2004

High Performance Cluster Management Based on SNMP: Experiences on Integration Between Network Patterns and Cluster Management Concepts.

[BibT_eX]

[DOI]

Rodrigo Sanger Alves

Lisandro Zambenedetti Granville

Proceedings of the Telecommunications and Networking, 2004

2003

Performance Analysis of DECK Collective Communication Service.

[BibT_eX]

[DOI]

Proceedings of the 15th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2003), 2003

Complex Branch Profiling for Dynamic Conditional Execution.

[BibT_eX]

[DOI]

Rafael R. dos Santos

Sergio Bampi

Mario Nemirovsky

Proceedings of the 15th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2003), 2003

Dynamic Load Balancing in PC Clusters: An Application to a Multi-Physics Model.

[BibT_eX]

[DOI]

Proceedings of the 15th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2003), 2003

Parallelization of Krylov Subspace Methods in Multiprocessor PC Clusters.

[BibT_eX]

Delcino Picinin Jr.

André L. Martinotto

Carlos Amaral Hölbig

Proceedings of the Parallel Computing: Software Technology, 2003

An Oscillatory Neural Network for Image Segmentation.

[BibT_eX]

[DOI]

Dênis Fernandes

Proceedings of the Progress in Pattern Recognition, 2003

2002

Echocardiographic Image Sequence Segmentation and Analysis Using Self-Organizing Maps.

[BibT_eX]

[DOI]

Jacob Scharcanski

J. VLSI Signal Process., 2002

Architecture of Oscillatory Neural Network for Image Segmentation.

[BibT_eX]

[DOI]

Dênis Fernandes

J. Stedile

Proceedings of the 14th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2002), 2002

Parallelizing Conjugate Gradient Method for Clusters Using MPI and Threads.

[BibT_eX]

Delcino Picinin Jr.

André L. Martinotto

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2002

An Evaluation of Simple and Efficient Optimization Techniques for Matrix Muliplication.

[BibT_eX]

Diego Fraga Contessa

Rodrigo Sanger Alves

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2002

Message-passing Over Shared Memory for the SECK Programming Environment.

[BibT_eX]

Caciano Machado

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2002

Improving SMT Performance Scheduling Processes.

[BibT_eX]

[DOI]

Ronaldo Gonçalves

Proceedings of the 10th Euromicro Workshop on Parallel, 2002

2001

Segmentation of TEM Images Using Oscillatory Neural Networks.

[BibT_eX]

[DOI]

Dênis Fernandes

Paulo Fernando Papaleo Fichtner

Proceedings of the 14th Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI 2001), 2001

Evaluating the Effects of Branch Prediction Accuracy on the Performance of SMT Architectures.

[BibT_eX]

[DOI]

Ronaldo Gonçalves

Guilherme Dal Pizzol

Rafael Santos

Proceedings of the Ninth Euromicro Workshop on Parallel and Distributed Processing, 2001

DECK-SCI: High-Performance Communication and Multithreading for SCI Clusters.

[BibT_eX]

[DOI]

Fabio A. D. de Oliveira

Marcos E. Barreto

Proceedings of the 2001 IEEE International Conference on Cluster Computing (CLUSTER 2001), 2001

2000

Fetal Left Atrium Segmentation using Kohonen Maps to Measure the Septum Primum Redundancy Index.

[BibT_eX]

[DOI]

Guilherme Drehmer

Proceedings of the 6th Brazilian Symposium on Neural Networks (SBRN 2000), 2000

DPC++: Object-Oriented Programming Applied to Cluster Computing.

[BibT_eX]

André Silveira

Marcos E. Barreto

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2000

Distributed Processor Allocation in Mesh-Connected Multicomputers.

[BibT_eX]

Cláudio F. R. Geyer

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2000

A Selection Mechanism to Group Processes in a Parallel Debugger.

[BibT_eX]

Jacques Chassin de Kergommeaux

Denise Stringhini

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2000

The MultiCluster Model to the Integrated Use of Multiple Workstation Clusters.

[BibT_eX]

[DOI]

Marcos E. Barreto

Proceedings of the Parallel and Distributed Processing, 2000

Distributed Processor Allocation in Large PC Clusters.

[BibT_eX]

[DOI]

Proceedings of the Ninth IEEE International Symposium on High Performance Distributed Computing, 2000

Distributed Processor Allocation in Multicomputers.

[BibT_eX]

[DOI]

Rose Rose

Proceedings of the 2000 IEEE International Conference on Cluster Computing (CLUSTER 2000), November 28th, 2000

1998

Analysing a Multistreamed Superscalar Speculative Fetch Mechanism.

[BibT_eX]

[DOI]

Rafael R. dos Santos

Proceedings of the Euro-Par '98 Parallel Processing, 1998

1996

High performance with high accuracy laboratory.

[BibT_eX]

Dalcidio Moraes Claudio

Carlos Amaral Hölbig

Ursula A. L. Fernandes

R. L. Sagula

RITA, 1996

1995

Performance evaluation in image processing with GAPP array processor.

[BibT_eX]

[DOI]

Gerson G. H. Cavalheiro

Microprocess. Microprogramming, 1995

Flexible Kernel: The AURORA Approach for Multiprocessor Operating System.

[BibT_eX]

Luiz Carlos Zancanella

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1995

1993

Os processos de compilação e execução em AURORA.

[BibT_eX]

[DOI]

Luiz Carlos Zancanella

Proceedings of the 7th Brazilian Symposium on Software Engineering, 1993

1988

SARA: A processor interconnection performance analysis tool.

[BibT_eX]

[DOI]

Paulo Fernandes

Maurizio Tazza

Microprocess. Microprogramming, 1988

1982

SSIP - A Processor Interconnection Simulator.

[BibT_eX]

Raul Weber

Jairo Prezzi

Maurizio Tazza

Proceedings of the Parallel and Large-Scale Computers: Performance, 1982

1980

Data Base Processor MAGE.

[BibT_eX]

[DOI]

Gilles Berger-Sabbatel

Proceedings of the Papers of the Fifth Workshop on Computer Architecture for Non-Numeric Processing, 1980

1979

Processeur base de données MAGE : aspect matériel.

[BibT_eX]

[DOI]