Alexander V. Veidenbaum

Constantine D. Polychronopoulos

Proceedings of the 20th Annual International Conference on Supercomputing, 2006

Fast Speculative Address Generation and Way Caching for Reducing L1 Data Cache Energy.

[BibT_eX]

[DOI]

Babak Salamat

Proceedings of the 24th International Conference on Computer Design (ICCD 2006), 2006

Probablistic Self-Scheduling.

[BibT_eX]

[DOI]

Constantine D. Polychronopoulos

Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

Challenges in exploitation of loop parallelism in embedded applications.

[BibT_eX]

[DOI]

Arun Kejariwal

Proceedings of the 4th International Conference on Hardware/Software Codesign and System Synthesis, 2006

2005

Line Size Adaptivity Analysis of Parameterized Loop Nests for Direct Mapped Data Cache.

[BibT_eX]

[DOI]

Paolo D'Alberto

IEEE Trans. Computers, 2005

Decoupled State-Execute Architecture.

[BibT_eX]

[DOI]

Miquel Pericàs

Rubén González

Proceedings of the High-Performance Computing - 6th International Symposium, 2005

Using a Way Cache to Improve Performance of Set-Associative Caches.

[BibT_eX]

[DOI]

Proceedings of the High-Performance Computing - 6th International Symposium, 2005

An asymmetric clustered processor based on value content.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual International Conference on Supercomputing, 2005

A New Pointer-based Instruction Queue Design and Its Power-Performance Evaluation.

[BibT_eX]

[DOI]

Marco Antonio Ramírez

Proceedings of the 23rd International Conference on Computer Design (ICCD 2005), 2005

High performance annotation-aware JVM for Java cards.

[BibT_eX]

[DOI]

Ana Azevedo

Arun Kejariwal

Proceedings of the EMSOFT 2005, 2005

Energy-Effective Instruction Fetch Unit for Wide Issue Processors.

[BibT_eX]

[DOI]

Juan L. Aragón

Proceedings of the Advances in Computer Systems Architecture, 10th Asia-Pacific Conference, 2005

2004

Instructions-Wake-Up mechanism: Power and Timing Evaluation.

[BibT_eX]

[DOI]

Marco A. Ramírez

Res. Comput. Sci., 2004

Guest Editor's Introduction: Application-Specific Processors.

[BibT_eX]

[DOI]

IEEE Micro, 2004

A partitioned instruction queue to reduce instruction wakeup energy.

[BibT_eX]

[DOI]

Marco Antonio Ramírez

Int. J. High Perform. Comput. Netw., 2004

An Optimized Front-End Physical Register File with Banking and Writeback Filtering.

[BibT_eX]

[DOI]

Miquel Pericàs

Rubén González

Proceedings of the Power-Aware Computer Systems, 4th International Workshop, 2004

Caching Values in the Load Store Queue.

[BibT_eX]

[DOI]

Proceedings of the 12th International Workshop on Modeling, 2004

A Content Aware Integer Register File Organization.

[BibT_eX]

[DOI]

Rubén González

Daniel Ortega

Proceedings of the 31st International Symposium on Computer Architecture (ISCA 2004), 2004

Low Energy, Highly-Associative Cache Design for Embedded Processors.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Conference on Computer Design: VLSI in Computers & Processors (ICCD 2004), 2004

Energy-Efficient Design for Highly Associative Instruction Caches in Next-Generation Embedded Processors.

[BibT_eX]

[DOI]

Juan L. Aragón

Ana-Maria Badulescu

Proceedings of the 2004 Design, 2004

2003

Power-Aware Compilation for Register File Energy Reduction.

[BibT_eX]

[DOI]

José L. Ayala

Marisa Luisa López-Vallejo

Int. J. Parallel Program., 2003

Guest Editors' Introduction: Application-Specific Microprocessors.

[BibT_eX]

Alex Orailoglu

IEEE Des. Test Comput., 2003

A Data Cache with Dynamic Mapping.

[BibT_eX]

[DOI]

Paolo D'Alberto

Proceedings of the Languages and Compilers for Parallel Computing, 2003

Reducing data cache energy consumption via cached load/store queue.

[BibT_eX]

[DOI]

Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003

A Simple Low-Energy Instruction Wakeup Mechanism.

[BibT_eX]

[DOI]

Marco Antonio Ramírez

Proceedings of the High Performance Computing, 5th International Symposium, 2003

Improving Branch Prediction Accuracy in Embedded Processors in the Presence of Context Switches.

[BibT_eX]

[DOI]

Sudeep Pasricha

Proceedings of the 21st International Conference on Computer Design (ICCD 2003), 2003

Reducing Power Consumption for High-Associativity Data Caches in Embedded Processors.

[BibT_eX]

[DOI]

Marisa Luisa López-Vallejo

Proceedings of the 2003 Design, 2003

Energy Aware Register File Implementation through Instruction Predecode.

[BibT_eX]

[DOI]

José L. Ayala

Carlos A. Lopez

Proceedings of the 14th IEEE International Conference on Application-Specific Systems, 2003

Low Energy Associative Data Caches for Embedded Systems.

[BibT_eX]

[DOI]

Alex Nicolau

Proceedings of the Embedded Software for SoC, 2003

2002

Integrated I-cache Way Predictor and Branch Target Buffer to Reduce Energy Consumption.

[BibT_eX]

[DOI]

Weiyu Tang

Proceedings of the High Performance Computing, 4th International Symposium, 2002

Profile-Based Dynamic Voltage Scheduling Using Program Checkpoints.

[BibT_eX]

[DOI]

Proceedings of the 2002 Design, 2002

2001

Guest Editor's Introduction.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2001

2000

On Interaction between Interconnection Network Design and Latency Hiding Techniques in Multiprocessors.

[BibT_eX]

[DOI]

J. Supercomput., 2000

Compiler-Directed Cache Assist Adaptivity.

[BibT_eX]

[DOI]

Xiaomei Ji

Proceedings of the High Performance Computing, Third International Symposium, 2000

Compiler-Directed Cache Line Size Adaptivity.

[BibT_eX]

[DOI]

Xiaomei Ji

Proceedings of the Intelligent Memory Systems, Second International Workshop, 2000

1999

Interconnection network organization and its impact on performance and cost in shared memory multiprocessors.

[BibT_eX]

[DOI]

Parallel Comput., 1999

Non-Sequential Instruction Cache Prefetching for Multiple-Issue Processors.

[BibT_eX]

[DOI]

Qingbo Zhao

Abduhl Shameer

Int. J. High Speed Comput., 1999

Adapting cache line size to application behavior.

[BibT_eX]

[DOI]

Proceedings of the 13th international conference on Supercomputing, 1999

1998

Retrospective: The Cedar System.

[BibT_eX]

[DOI]

Constantine D. Polychronopoulos

Pen-Chung Yew

David J. Kuck

David A. Padua

Edward S. Davidson

Kyle A. Gallivan

Proceedings of the 25 Years of the International Symposia on Computer Architecture (Selected Papers)., 1998

1997

Instruction Cache Prefetching Using Multilevel Branch Prediction.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing, International Symposium, 1997

Stride-directed Prefetching for Secondary Caches.

[BibT_eX]

[DOI]

Proceedings of the 1997 International Conference on Parallel Processing (ICPP '97), 1997

The Effect of Limited Network Bandwidth and its Utilization by Latency Hiding Techniques in Large-Scale Shared Memory Systems.

[BibT_eX]

[DOI]

Proceedings of the 1997 Conference on Parallel Architectures and Compilation Techniques (PACT '97), 1997

1995

Combining flow and dependence analyses to expose redundant array accesses.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 1995

On Shortest Path Routing in Single Stage Shuffle-Exchange Networks.

[BibT_eX]

[DOI]

Proceedings of the 7th Annual ACM Symposium on Parallel Algorithms and Architectures, 1995

1994

Scalability of the Cedar system.

[BibT_eX]

[DOI]

Stephen W. Turner

Proceedings of the Proceedings Supercomputing '94, 1994

An Integrated Hardware/Software Data Prefetching Scheme for Shared-Memory Multiprocessors.

[BibT_eX]

[DOI]

Edward H. Gornish

Proceedings of the 1994 International Conference on Parallel Processing, 1994

1993

The Cedar System and an Initial Performance Study.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual International Symposium on Computer Architecture, 1993

Performance Evaluation of Memory Caches in Multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the 1993 International Conference on Parallel Processing, 1993

1992

An Effective Write Policy for Software Coherence Schemes.

[BibT_eX]

[DOI]

Proceedings of the Proceedings Supercomputing '92, 1992

1991

Detecting redundant accesses to array data.

[BibT_eX]

[DOI]

Proceedings of the Proceedings Supercomputing '91, 1991

Comparison and analysis of software and directory coherence schemes.

[BibT_eX]

[DOI]

Proceedings of the Proceedings Supercomputing '91, 1991

Chief: A Parallel Simulation Environment for Parallel Systems.

[BibT_eX]

[DOI]

John D. Bruner

Pen-Chung Yew

Proceedings of the Fifth International Parallel Processing Symposium, Proceedings, Anaheim, California, USA, April 30, 1991

A software coherence scheme with the assistance of directories.

[BibT_eX]

[DOI]

Proceedings of the 5th international conference on Supercomputing, 1991

The Organization of the Cedar System.

[BibT_eX]

Jeff Konicek

Tracy Tilton

Proceedings of the International Conference on Parallel Processing, 1991

An Integrated Hardware/Software Solution for Effective Management of Local Storage in High-Performance Systems.

[BibT_eX]

Proceedings of the International Conference on Parallel Processing, 1991

Preliminary Performance Analysis of the Cedar Multiprocessor Memory System.

[BibT_eX]

Kyle A. Gallivan

William Jalby

Stephen W. Turner

Harry A. G. Wijshoff

Proceedings of the International Conference on Parallel Processing, 1991

1990

Compiler-Directed Cache Management in Multiprocessors.

[BibT_eX]

[DOI]

Computer, 1990

Compiler-directed data prefetching in multiprocessors with memory hierarchies.

[BibT_eX]

[DOI]

Edward H. Gornish

Proceedings of the 4th international conference on Supercomputing, 1990

1989

A version control approach to Cache coherence.

[BibT_eX]

[DOI]

Proceedings of the 3rd international conference on Supercomputing, 1989

1988

A Cache Coherence Scheme With Fast Selective Invalidation.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual International Symposium on Computer Architecture, 1988

Performance of a shared memory system for vector multiprocessors.

[BibT_eX]

[DOI]

Stephen W. Turner

Proceedings of the 2nd international conference on Supercomputing, 1988

Stale Data Detection and Coherence Enforcement Using Flow Analysis.

[BibT_eX]

Proceedings of the International Conference on Parallel Processing, 1988

1987

The Performance of Software-managed Multiprocessor Caches on Parallel Numerical Programs.

[BibT_eX]

[DOI]

Proceedings of the Supercomputing, 1987

1986

A Compiler-Assisted Cache Coherence Solution for Multiprcessors.

[BibT_eX]