José González

IEEE Trans. Parallel Distributed Syst., 2012

2011

CROB: Implementing a Large Instruction Window through Compression.

[BibT_eX]

[DOI]

Trans. High Perform. Embed. Archit. Compil., 2011

Thread shuffling: combining DVFS and thread migration toreduce energy consumptions for multi-core systems.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Symposium on Low Power Electronics and Design, 2011

2010

Thread-management techniques to maximize efficiency in multicore and simultaneous multithreaded microprocessors.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2010

Energy efficiency via thread fusion and value reuse.

[BibT_eX]

[DOI]

IET Comput. Digit. Tech., 2010

Elastic cooperative caching: an autonomous dynamically adaptive memory hierarchy for chip multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the 37th International Symposium on Computer Architecture (ISCA 2010), 2010

Power-Efficient Spilling Techniques for Chip Multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

2009

A lossy 3D wavelet transform for high-quality compression of medical video.

[BibT_eX]

[DOI]

J. Syst. Softw., 2009

Dynamic thermal management using thin-film thermoelectric cooling.

[BibT_eX]

[DOI]

Proceedings of the 2009 International Symposium on Low Power Electronics and Design, 2009

2008

Thread fusion.

[BibT_eX]

[DOI]

Proceedings of the 2008 International Symposium on Low Power Electronics and Design, 2008

Efficient resources assignment schemes for clustered multithreaded processors.

[BibT_eX]

[DOI]

Fernando Latorre

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

A software-hardware hybrid steering mechanism for clustered microarchitectures.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Distributed cooperative caching.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, 2008

Meeting points: using thread criticality to adapt multicore hardware to parallel regions.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, 2008

2007

Understanding the Thermal Implications of Multi-Core Architectures.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2007

An efficient implementation of a 3D wavelet transform based encoder on hyper-threading technology.

[BibT_eX]

[DOI]

Ricardo Fernández Pascual

Manuel E. Acacio

Parallel Comput., 2007

Building a large instruction window through ROB compression.

[BibT_eX]

[DOI]

Proceedings of the 2007 workshop on MEmory performance, 2007

2006

Independent front-end and back-end dynamic voltage scaling for a GALS microarchitecture.

[BibT_eX]

[DOI]

Proceedings of the 2006 International Symposium on Low Power Electronics and Design, 2006

2005

Reducing 3D Fast Wavelet Transform Execution Time Using Blocking and the Streaming SIMD Extensions.

[BibT_eX]

[DOI]

J. VLSI Signal Process., 2005

A Two-Level Directory Architecture for Highly Scalable cc-NUMA Multiprocessors.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2005

Distributing the Frontend for Temperature Reduction.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on High-Performance Computer Architecture (HPCA-11 2005), 2005

2004

An Architecture for High-Performance Scalable Shared-Memory Multiprocessors Exploiting On-Chip Integration.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2004

Late Allocation and Early Release of Physical Registers.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2004

Architecture of an automatically tuned linear algebra library.

[BibT_eX]

[DOI]

Javier Cuenca

Domingo Giménez

Parallel Comput., 2004

Cache organizations for clustered microarchitectures.

[BibT_eX]

[DOI]

Fernando Latorre

Proceedings of the 3rd Workshop on Memory Performance Issues, 2004

Back-end assignment schemes for clustered multithreaded processors.

[BibT_eX]

[DOI]

Fernando Latorre

Proceedings of the 18th Annual International Conference on Supercomputing, 2004

Frontend Frequency-Voltage Adaptation for Optimal Energy-Delay^2.

[BibT_eX]

[DOI]

Grigorios Magklis

Proceedings of the 22nd IEEE International Conference on Computer Design: VLSI in Computers & Processors (ICCD 2004), 2004

Thermal-Aware Clustered Microarchitectures.

[BibT_eX]

[DOI]

Pedro Chaparro

Proceedings of the 22nd IEEE International Conference on Computer Design: VLSI in Computers & Processors (ICCD 2004), 2004

2003

Empirical Modelling of Parallel Linear Algebra Routines.

[BibT_eX]

[DOI]

Proceedings of the Parallel Processing and Applied Mathematics, 2003

Automatic Optimisation of Parallel Linear Algebra Routines in Systems with Variable Load.

[BibT_eX]

[DOI]

Proceedings of the 11th Euromicro Workshop on Parallel, 2003

Reducing 3D Wavelet Transform Execution Time through the Streaming SIMD Extensions.

[BibT_eX]

[DOI]

Proceedings of the 11th Euromicro Workshop on Parallel, 2003

Dynamic Cluster Resizing.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Computer Design (ICCD 2003), 2003

Power-Aware Control Speculation through Selective Throttling.

[BibT_eX]

[DOI]

Juan L. Aragón

Proceedings of the Ninth International Symposium on High-Performance Computer Architecture (HPCA'03), 2003

2002

Owner prediction for accelerating cache-to-cache transfer misses in a cc-NUMA architecture.

[BibT_eX]

[DOI]

Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002

Towards the Design of an Automatically Tuned Linear Algebra Library.

[BibT_eX]

[DOI]

Javier Cuenca

Domingo Giménez

Proceedings of the 10th Euromicro Workshop on Parallel, 2002

Reducing the Latency of L2 Misses in Shared-Memory Multiprocessors through On-Chip Directory Integration.

[BibT_eX]

[DOI]

Proceedings of the 10th Euromicro Workshop on Parallel, 2002

A Novel Approach to Reduce L2 Miss Latency in Shared-Memory Multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Dual path instruction processing.

[BibT_eX]

[DOI]

Proceedings of the 16th international conference on Supercomputing, 2002

Memory Conscious 3D Wavelet Transform.

[BibT_eX]

[DOI]

Proceedings of the 28th EUROMICRO Conference 2002, 4-6 September 2002, Dortmund, Germany, 2002

The Use of Prediction for Accelerating Upgrade Misses in cc-NUMA Multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the 2002 International Conference on Parallel Architectures and Compilation Techniques (PACT 2002), 2002

2001

Control-Flow Speculation through Value Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2001

Modeling the Behaviour of Linear Algebra Algorithms with Message-Passing.

[BibT_eX]

[DOI]

Javier Cuenca

Domingo Giménez

Proceedings of the Ninth Euromicro Workshop on Parallel and Distributed Processing, 2001

Selective Branch Prediction Reversal By Correlating with Data Values and Control Flow.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Computer Design (ICCD 2001), 2001

A New Scalable Directory Architecture for Large-Scale Multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the Seventh International Symposium on High-Performance Computer Architecture (HPCA'01), 2001

Confidence Estimation for Branch Prediction Reversal.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - HiPC 2001, 8th International Conference, 2001

2000

Dynamic Register Renaming Through Virtual-Physical Registers.

[BibT_eX]

[DOI]

J. Instr. Level Parallelism, 2000

1999

Delaying Physical Register Allocation through Virtual-Physical Registers.

[BibT_eX]

[DOI]

Proceedings of the 32nd Annual IEEE/ACM International Symposium on Microarchitecture, 1999

Control-Flow Speculation through Value Prediction for Superscalar Processors.

[BibT_eX]

[DOI]

Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques, 1999

1998

Data value speculation in superscalar processors.

[BibT_eX]

[DOI]

Microprocess. Microsystems, 1998

Limits of Instruction Level Parallelism with Data Value Speculation.

[BibT_eX]

[DOI]

Proceedings of the Vector and Parallel Processing, 1998

The Potential of Data Value Speculation to Boost ILP.

[BibT_eX]

[DOI]

Proceedings of the 12th international conference on Supercomputing, 1998

Virtual-Physical Registers.

[BibT_eX]

[DOI]

Mateo Valero

Proceedings of the Fourth International Symposium on High-Performance Computer Architecture, Las Vegas, Nevada, USA, January 31, 1998

1997

The Design and Performance of a Conflict-Avoiding Cache.

[BibT_eX]

[DOI]

Nigel P. Topham

Proceedings of the Thirtieth Annual IEEE/ACM International Symposium on Microarchitecture, 1997

Speculative Execution via Address Prediction and Data Prefetching.

[BibT_eX]

[DOI]

Proceedings of the 11th international conference on Supercomputing, 1997

Virtual registers.

[BibT_eX]

[DOI]

Proceedings of the Fourth International on High-Performance Computing, 1997

Memory Address Prediction for Data Speculation.

[BibT_eX]

[DOI]