Guido Araujo

Matthew Gaudet

Gleison Souza Diniz Mendonca

Parallel Comput., 2016

Parallel Computation for the All-Pairs Suffix-Prefix Problem.

[BibT_eX]

[DOI]

Proceedings of the String Processing and Information Retrieval, 2016

Automatic Insertion of Copy Annotation in Data-Parallel Programs.

[BibT_eX]

[DOI]

Breno Campos Ferreira Guimarães

Péricles Rafael Oliveira Alves

Fernando Magno Quintão Pereira

Proceedings of the 28th International Symposium on Computer Architecture and High Performance Computing, 2016

Evaluating and Improving Thread-Level Speculation in Hardware Transactional Memories.

[BibT_eX]

[DOI]

Juan Salamanca

Divino Cesar Soares Lucas

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Task parallel programming model + hardware acceleration = performance advantage.

[BibT_eX]

[DOI]

Tamer Dallou

Lucas Morais

Eduardo Ferreira Barbosa

Michael Frank

Richard Bagley

Raj Sayana

Proceedings of the 2016 IEEE Hot Chips 28 Symposium (HCS), 2016

Cylindrical Reconvergence Physical Unclonable Function.

[BibT_eX]

[DOI]

Proceedings of the 2016 Euromicro Conference on Digital System Design, 2016

2015

Guest Editorial: SBAC-PAD 2013.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2015

Improving the Statistical Variability of Delay-based Physical Unclonable Functions.

[BibT_eX]

[DOI]

Jefferson Capovilla

Mario Lúcio Côrtes

Proceedings of the 28th Symposium on Integrated Circuits and Systems Design, 2015

Using Hardware Transactional Memory to Enable Speculative Trace Optimization.

[BibT_eX]

[DOI]

Juan Salamanca

Proceedings of the 2015 International Symposium on Computer Architecture and High Performance Computing Workshops, 2015

Serialization Management for Best-Effort Hardware Transactional Memory.

[BibT_eX]

[DOI]

Matthew Gaudet

Proceedings of the 27th International Symposium on Computer Architecture and High Performance Computing, 2015

Performance implications of dynamic memory allocators on transactional memory systems.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

The Batched DOACROSS loop parallelization algorithm.

[BibT_eX]

[DOI]

Divino Cesar S. Lucas

Proceedings of the 2015 International Conference on High Performance Computing & Simulation, 2015

Computer security by hardware-intrinsic authentication.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Hardware/Software Codesign and System Synthesis, 2015

2014

Microcode Compression Using Structured-Constrained Clustering.

[BibT_eX]

[DOI]

Maurício Breternitz Jr.

Youfeng Wu

Int. J. Parallel Program., 2014

Cloud-based OpenMP Parallelization Using a MapReduce Runtime.

[BibT_eX]

[DOI]

Rodolfo Wottrich

Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

Multi-dimensional Evaluation of Haswell's Transactional Memory Performance.

[BibT_eX]

[DOI]

Matthew Gaudet

Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

Loop-Carried Dependence Verification in OpenMP.

[BibT_eX]

[DOI]

Juan Salamanca

Luis Mattos

Proceedings of the Using and Improving OpenMP for Devices, Tasks, and More, 2014

Measuring Effective Work to Reward Success in Dynamic Transaction Scheduling.

[BibT_eX]

[DOI]

Proceedings of the 43rd International Conference on Parallel Processing, 2014

Wear-out analysis of Error Correction Techniques in Phase-Change Memory.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2014

2013

Extending decoupled software pipeline to parallelize Java programs.

[BibT_eX]

[DOI]

André Loureiro

João Paulo Porto

Softw. Pract. Exp., 2013

Transaction Scheduling Using Dynamic Conflict Avoidance.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2013

Modeling virtual machines misprediction overhead.

[BibT_eX]

[DOI]

Divino Cesar S. Lucas

Proceedings of the IEEE International Symposium on Workload Characterization, 2013

Transaction scheduling using conflict avoidance and Contention Intensity.

[BibT_eX]

[DOI]

Luiz Eduardo Buzato

Proceedings of the 20th Annual International Conference on High Performance Computing, 2013

Cache-based cross-iteration coherence for speculative parallelization.

[BibT_eX]

[DOI]

Andre Baixo

João Paulo Porto

Proceedings of the 20th Annual International Conference on High Performance Computing, 2013

2012

Data center power and performance optimization through global selection of P-states and utilization rates.

[BibT_eX]

[DOI]

Reinaldo A. Bergamaschi

Sustain. Comput. Informatics Syst., 2012

Computational reflection and its application to platform verification.

[BibT_eX]

[DOI]

Bruno C. Albertini

Sandro Rigo

Des. Autom. Embed. Syst., 2012

Exploring Dynamic Program Behavior with Frames and Phases.

[BibT_eX]

[DOI]

Divino Cesar S. Lucas

Proceedings of the 13th Symposium on Computer Systems, 2012

2011

Structure-Constrained Microcode Compression.

[BibT_eX]

[DOI]

Maurício Breternitz Jr.

Youfeng Wu

Proceedings of the 23rd International Symposium on Computer Architecture and High Performance Computing, 2011

LUTS: A Lightweight User-Level Transaction Scheduler.

[BibT_eX]

[DOI]

Proceedings of the Algorithms and Architectures for Parallel Processing, 2011

2010

ISAMAP: Instruction Mapping Driven by Dynamic Binary Translation.

[BibT_eX]

[DOI]

Maxwell Souza

Proceedings of the Computer Architecture, 2010

Trace Execution Automata in Dynamic Binary Translation.

[BibT_eX]

[DOI]

Proceedings of the Computer Architecture, 2010

Reducing False Aborts in STM Systems.

[BibT_eX]

[DOI]

Proceedings of the Algorithms and Architectures for Parallel Processing, 2010

T-DRE: a hardware trusted computing base for direct recording electronic vote machines.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth Annual Computer Security Applications Conference, 2010

2009

A Multi-Model Engine for High-Level Power Estimation Accuracy Optimization.

[BibT_eX]

[DOI]

Felipe Klein

Roberto Leao

Luiz C. V. dos Santos

IEEE Trans. Very Large Scale Integr. Syst., 2009

Characterizing the Energy Consumption of Software Transactional Memory.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2009

On the energy-efficiency of software transactional memory.

[BibT_eX]

[DOI]

Proceedings of the 22st Annual Symposium on Integrated Circuits and Systems Design: Chip on the Dunes, 2009

2008

Instruction Scheduling Based on Subgraph Isomorphism for a High Performance Computer Processor.

[BibT_eX]

[DOI]

Ricardo Santos

J. Univers. Comput. Sci., 2008

2007

A Custom Instruction Approach for Hardware and Software Implementations of Finite Field Arithmetic over F2163 using Gaussian Normal Bases.

[BibT_eX]

[DOI]

Marcio Juliato

Julio César López-Hernández

Ricardo Dahab

J. VLSI Signal Process., 2007

A Flexible Platform Framework for Rapid Transactional Memory Systems Prototyping and Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Workshop on Rapid System Prototyping (RSP 2007), 2007

A Methodology and Toolset to Enable SystemC and VHDL Co-simulation.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE Computer Society Annual Symposium on VLSI (ISVLSI 2007), 2007

On the Limitations of Power Macromodeling Techniques.

[BibT_eX]

[DOI]

Luiz C. V. dos Santos

Proceedings of the 2007 IEEE Computer Society Annual Symposium on VLSI (ISVLSI 2007), 2007

A multi-model power estimation engine for accuracy optimization.

[BibT_eX]

[DOI]

Luiz C. V. dos Santos

Proceedings of the 2007 International Symposium on Low Power Electronics and Design, 2007

The Image Forest Transform Architecture.

[BibT_eX]

[DOI]

Fabio Augusto Cappabianco

Alexandre X. Falcão

Proceedings of the 2007 International Conference on Field-Programmable Technology, 2007

A computational reflection mechanism to support platform debugging in SystemC.

[BibT_eX]

[DOI]

Bruno C. Albertini

Sandro Rigo

Edna Barros

Willians Azevedo

Proceedings of the 5th International Conference on Hardware/Software Codesign and System Synthesis, 2007

2006

Offset assignment using simultaneous variable coalescing.

[BibT_eX]

[DOI]

ACM Trans. Embed. Comput. Syst., 2006

Exploiting dynamic reconfiguration techniques: the 2D-VLIW approach.

[BibT_eX]

[DOI]

Ricardo Santos

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Clustering-Based Microcode Compression.

[BibT_eX]

[DOI]

Maurício Breternitz Jr.

Youfeng Wu

Proceedings of the 24th International Conference on Computer Design (ICCD 2006), 2006

Software-Based Transparent and Comprehensive Control-Flow Error Detection.

[BibT_eX]

[DOI]

Proceedings of the Fourth IEEE/ACM International Symposium on Code Generation and Optimization (CGO 2006), 2006

2D-VLIW: An Architecture Based on the Geometry of Computation.

[BibT_eX]

[DOI]

Ricardo Santos

Proceedings of the 2006 IEEE International Conference on Application-Specific Systems, 2006

2005

Efficient datapath merging for partially reconfigurable architectures.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2005

Dynamic binary control-flow errors detection.

[BibT_eX]

[DOI]

SIGARCH Comput. Archit. News, 2005

The datapath merging problem in reconfigurable systems: Complexity, dual bounds and heuristic evaluation.

[BibT_eX]

[DOI]

ACM J. Exp. Algorithmics, 2005

The ArchC Architecture Description Language and Tools.

[BibT_eX]

[DOI]

Edna Barros

Int. J. Parallel Program., 2005

Platform designer: An approach for modeling multiprocessor platforms based on SystemC.

[BibT_eX]

[DOI]

Des. Autom. Embed. Syst., 2005

A SystemC-only design methodology and the CINE-IP multimedia platform.

[BibT_eX]

[DOI]

Karina R. G. da Silva

Bruno O. Prado

Manoel Eusébio de Lima

Des. Autom. Embed. Syst., 2005

Design of a decompressor engine on a SPARC processor.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Symposium on Integrated Circuits and Systems Design, 2005

High-Level Switching Activity Prediction Through Sampled Monitored Simulation.

[BibT_eX]

[DOI]

Felipe Klein

Proceedings of the 2005 International Symposium on System-on-Chip, 2005

A custom instruction approach for hardware and software implementations of finite field arithmetic over F263 using Gaussian normal bases.

[BibT_eX]

Marcio Juliato

Julio César López-Hernández

Ricardo Dahab

Proceedings of the 2005 IEEE International Conference on Field-Programmable Technology, 2005

Processor Centric Specification and Modelling of MPSoCs.

[BibT_eX]

[DOI]

Edna Barros

Proceedings of the Forum on specification and Design Languages, 2005

2004

The design of dynamically reconfigurable datapath coprocessors.

[BibT_eX]

[DOI]

ACM Trans. Embed. Comput. Syst., 2004

The Datapath Merging Problem in Reconfigurable Systems: Lower Bounds and Heuristic Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Experimental and Efficient Algorithms, Third International Workshop, 2004

Teaching computer architecture using an architecture description language.

[BibT_eX]

[DOI]

Proceedings of the 2004 workshop on Computer architecture education, 2004

An automatic testbench generation tool for a SystemC functional verification methodology.

[BibT_eX]

[DOI]

Karina R. G. da Silva

Elmar U. K. Melcher

Valdiney Alves Pimenta

Proceedings of the 17th Annual Symposium on Integrated Circuits and Systems Design, 2004

ArchC: A SystemC-Based Architecture Description Language.

[BibT_eX]

[DOI]

Proceedings of the 16th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2004), 2004

Multi-Profile Instruction Based Compression.

[BibT_eX]

[DOI]

Proceedings of the 16th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2004), 2004

Optimizations for Compiled Simulation Using Instruction Type Information.

[BibT_eX]

[DOI]

Proceedings of the 16th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2004), 2004

Fast instruction set custornization.

[BibT_eX]

[DOI]

Proceedings of the 2nd Workshop on Embedded Systems for Real-Time Multimedia, 2004

Modeling and Simulating Memory Hierarchies in a Platform-Based Design Methodology.

[BibT_eX]

[DOI]

Proceedings of the 2004 Design, 2004

Multi-profile based code compression.

[BibT_eX]

[DOI]

Proceedings of the 41th Design Automation Conference, 2004

2003

Address register allocation for arrays in loops of embedded programs.

[BibT_eX]

[DOI]

Guilherme Ottoni

Microelectron. J., 2003

Improving Offset Assignment through Simultaneous Variable Coalescing.

[BibT_eX]

[DOI]

Proceedings of the Software and Compilers for Embedded Systems, 7th International Workshop, 2003

Exploring Memory Hierarchy with ArchC.

[BibT_eX]

[DOI]

Proceedings of the 15th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2003), 2003

Mixed static/dynamic profiling for dictionary based code compression.

[BibT_eX]

[DOI]

Proceedings of the 2003 International Symposium on System-on-Chip, 2003

2002

Global array reference allocation.

[BibT_eX]

[DOI]

Guilherme Ottoni

Marcelo Silva Cintra

ACM Trans. Design Autom. Electr. Syst., 2002

Datapath Merging and Interconnection Sharing for Reconfigurable Architectures.

[BibT_eX]

[DOI]

Proceedings of the 15th International Symposium on System Synthesis (ISSS 2002), 2002

2001

A retargetable VLIW compiler framework for DSPs withinstruction-level parallelism.

[BibT_eX]

[DOI]

Subramanian Rajagopalan

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2001

Optimal Live Range Merge for Address Register Allocation in Embedded Programs.

[BibT_eX]

[DOI]

Guilherme Ottoni

Sandro Rigo

Subramanian Rajagopalan

Proceedings of the Compiler Construction, 10th International Conference, 2001

Tailoring pipeline bypassing and functional unit mapping to application in clustered VLIW architectures.

[BibT_eX]

[DOI]

Proceedings of the 2001 International Conference on Compilers, 2001

2000

Expression-tree-based algorithms for code compression on embedded RISC architectures.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2000

Array Reference Allocation Using SSA-Form and Live Range Growth.

[BibT_eX]

[DOI]

Marcelo Silva Cintra

Proceedings of the Languages, 2000

1999

Compressed Code Execution on DSP Architectures.

[BibT_eX]

[DOI]

Ricardo Pannain

Proceedings of the 12th International Symposium on System Synthesis, 1999

1998

Code generation for fixed-point DSPs.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 1998

Code Compression Based on Operand Factorization.

[BibT_eX]

[DOI]

Proceedings of the 31st Annual IEEE/ACM International Symposium on Microarchitecture, 1998

1996

Instruction Set Design and Optimizations for Address Computation in DSP Architectures.

[BibT_eX]

[DOI]

Ashok Sudarsanam

Proceedings of the 9th International Symposium on System Synthesis, 1996

Using Register-Transfer Paths in Code Generation for Heterogeneous Memory-Register Architectures.

[BibT_eX]

[DOI]

Mike Tien-Chien Lee

Proceedings of the 33st Conference on Design Automation, 1996

1995

Optimal code generation for embedded memory non-homogeneous register architectures.

[BibT_eX]

[DOI]