Edward S. Davidson

Proceedings of the 12th international conference on Supercomputing, 1998

Evaluating the performance of active cache management schemes.

[BibT_eX]

[DOI]

Edward S. Tam

Vijayalakshmi Srinivasan

Gary S. Tyson

Proceedings of the International Conference on Computer Design: VLSI in Computers and Processors, 1998

Origin 2000 Design Enhancements for Communication Intensive Applications.

[BibT_eX]

[DOI]

Gheith A. Abandah

Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques, 1998

1997

Efficient Formulation for Optimal Modulo Schedulers.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN '97 Conference on Programming Language Design and Implementation (PLDI), 1997

On High-Bandwidth Data Cache Design for Multi-Issue Processors.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth Annual IEEE/ACM International Symposium on Microarchitecture, 1997

On Effective Data Supply For Multi-Issue Processors.

[BibT_eX]

[DOI]

Edward S. Tam

Proceedings of the Proceedings 1997 International Conference on Computer Design: VLSI in Computers & Processors, 1997

1996

Performance Issues in Integrating Temporality-Based Caching with Prefetching.

[BibT_eX]

Perform. Evaluation, 1996

Minimizing Register Requirements of a Modulo Schedule via Optimum Stage Scheduling.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 1996

A Reduced Multipipeline Machine Description that Preserves Scheduling Constraints.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN'96 Conference on Programming Language Design and Implementation (PLDI), 1996

Modeling the Communication Performance of the IBM SP2.

[BibT_eX]

[DOI]

Gheith A. Abandah

Proceedings of IPPS '96, 1996

Profile Driven Weighted Decomposition.

[BibT_eX]

[DOI]

Karen A. Tomko

Proceedings of the 10th international conference on Supercomputing, 1996

Reducing Conflicts in Direct-Mapped Caches with a Temporality-Based Design.

[BibT_eX]

[DOI]

Proceedings of the 1996 International Conference on Parallel Processing, 1996

1995

Maximum rate single-phase clocking of a closed pipeline including wave pipelining, stoppability, and startability.

[BibT_eX]

[DOI]

Chuan-Hua Chang

Karem A. Sakallah

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 1995

Stage scheduling: a technique to reduce the register requirements of a modulo schedule.

[BibT_eX]

[DOI]

Proceedings of the 28th Annual International Symposium on Microarchitecture, Ann Arbor, Michigan, USA, November 29, 1995

[BibT_eX]

[DOI]

Proceedings of the 28th Annual International Symposium on Microarchitecture, Ann Arbor, Michigan, USA, November 29, 1995

Optimum Modulo Schedules for Minimum Register Requirements.

[BibT_eX]

[DOI]

Proceedings of the 9th international conference on Supercomputing, 1995

The resource conflict methodology for early-stage design space exploration of superscalar RISC processors.

[BibT_eX]

[DOI]

John-David Wellman

Proceedings of the 1995 International Conference on Computer Design (ICCD '95), 1995

1994

Minimum register requirements for a modulo schedule.

[BibT_eX]

[DOI]

Proceedings of the 27th Annual International Symposium on Microarchitecture, San Jose, California, USA, November 30, 1994

Optimal local register allocation for a multiple-issue machine.

[BibT_eX]

[DOI]

Waleed Meleis

Proceedings of the 8th international conference on Supercomputing, 1994

Communication in the KSR1 MPP: performance evaluation using synthetic workload experiments.

[BibT_eX]

[DOI]

Eric L. Boyd

Proceedings of the 8th international conference on Supercomputing, 1994

A Hierarchical Approach to Modeling and Improving the Performance of Scientific Applications on the KSR1.

[BibT_eX]

[DOI]

Proceedings of the 1994 International Conference on Parallel Processing, 1994

Grouping Array Layouts to Reduce Communication and Improve Locality of Parallel Programs.

[BibT_eX]

[DOI]

Tien-Pao Shih

Proceedings of the Proceedings 1994 International Conference on Parallel and Distributed Systems, 1994

1993

Synchronization of pipelines.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 1993

Approaching a machine-application bound in delivered performance on scientific code.

[BibT_eX]

[DOI]

Tien-Pao Shih

Proc. IEEE, 1993

The Cedar System and an Initial Performance Study.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual International Symposium on Computer Architecture, 1993

Hierarchical Performance Modeling with MACS: A Case Study of the Convex C-240.

[BibT_eX]

[DOI]

Eric L. Boyd

Proceedings of the 20th Annual International Symposium on Computer Architecture, 1993

KSR 1 Multiprocessor: Analysis of Latency Hiding Techniques in a Sparse Solver.

[BibT_eX]

[DOI]

Proceedings of the Seventh International Parallel Processing Symposium, 1993

Evaluating the Communication Performance of MPPs Using Synthetic Sparse Matrix Multiplication Workloads.

[BibT_eX]

[DOI]

Proceedings of the 7th international conference on Supercomputing, 1993

1992

[BibT_eX]

[DOI]

Proceedings of the 6th international conference on Supercomputing, 1992

Using constraint geometry to determine maximum rate pipeline clocking.

[BibT_eX]

[DOI]

Chuan-Hua Chang

Karem A. Sakallah

Proceedings of the 1992 IEEE/ACM International Conference on Computer-Aided Design, 1992

1991

A Performance Comparison of the IBM RS/6000 and the Astronautics ZS-1.

[BibT_eX]

[DOI]

Computer, 1991

An integrated approach to developing manufacturing control software.

[BibT_eX]

[DOI]

Jarir K. Chaar

Richard A. Volz

Proceedings of the 1991 IEEE International Conference on Robotics and Automation, 1991

The Organization of the Cedar System.

[BibT_eX]

Jeff Konicek

Tracy Tilton

Alexander V. Veidenbaum

Proceedings of the International Conference on Parallel Processing, 1991

Optimal Clocking of Circular Pipelines.

[BibT_eX]

[DOI]

Proceedings of the Proceedings 1991 IEEE International Conference on Computer Design: VLSI in Computer & Processors, 1991

Vector Register Design for Polycyclic Vector Scheduling.

[BibT_eX]

[DOI]

Proceedings of the ASPLOS-IV Proceedings, 1991

1990

Cyclic job shop scheduling using reservation tables.

[BibT_eX]

[DOI]

Jarir K. Chaar

Proceedings of the 1990 IEEE International Conference on Robotics and Automation, 1990

1988

Pairwise Reduction for the Direct, Parallel Solution of Sparse, Unsymmetric Sets of Linear Equations.

[BibT_eX]

[DOI]

Timothy A. Davis

IEEE Trans. Computers, 1988

Polycyclic Vector scheduling vs. Chaining on 1-Port Vector supercomputers.

[BibT_eX]

[DOI]

J. H. Tang

J. Tong

Proceedings of the Proceedings Supercomputing '88, Orlando, FL, USA, November 12-17, 1988, 1988

Analysis of Memory Referencing Behavior For Design of Local Memories.

[BibT_eX]

[DOI]

Geoffrey D. McNiven

Proceedings of the 15th Annual International Symposium on Computer Architecture, 1988

An evaluation of Cray X-MP performance on vectorizable Livermore FORTRAN kernels.

[BibT_eX]

[DOI]

J. H. Tang

Proceedings of the 2nd international conference on Supercomputing, 1988

1987

Characterization of Branch and Data Dependencies in Programs for Evaluating Pipeline Performance.

[BibT_eX]

[DOI]

Philip G. Emma

IEEE Trans. Computers, 1987

PSOLVE : A Concurrent Algorithm for Solving Sparse Systems of Linear Equations.

[BibT_eX]

Timothy A. Davis

Proceedings of the International Conference on Parallel Processing, 1987

1986

Highly Concurrent Scalar Processing.

[BibT_eX]

[DOI]

Peter Y.-T. Hsu

Proceedings of the 13th Annual Symposium on Computer Architecture, Tokyo, Japan, June 1986, 1986

A Communication Model for Optimizing Hierarchical Multiprocessor Systems.

[BibT_eX]

Proceedings of the International Conference on Parallel Processing, 1986

Features of the Structured Memory Access (SMA) Architecture.

[BibT_eX]

Proceedings of the Spring COMPCON'86, 1986

A Broader Range of Possible Answers to the Issues Raised by RISC.

[BibT_eX]

Proceedings of the Spring COMPCON'86, 1986

1985

An Efficient LISP-Execution Architecture with a New Representation for List Structures.

[BibT_eX]

[DOI]

Gurindar S. Sohi

Proceedings of the 12th Annual Symposium on Computer Architecture, 1985

TIDBITS: Speedup Via Time-Delay Bit-Slicing in ALU Design for VLSI Technology.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Symposium on Computer Architecture, 1985

A custom-designed integrated circuit for the realization of residue number digital filters.

[BibT_eX]

[DOI]

W. K. Jenkins

D. F. Paul

Proceedings of the IEEE International Conference on Acoustics, 1985

1984

Design of Instruction Set Architectures for Support of High-Level Languages .

[BibT_eX]

[DOI]

Pradip Bose

Proceedings of the 11th Annual Symposium on Computer Architecture, 1984

1983

Shared Cache for Multiple-Stream Computer Systems.

[BibT_eX]

[DOI]

Phil C. C. Yeh

IEEE Trans. Computers, 1983

Performance of Shared Cache for Parallel-Pipelined Computer Systems

[BibT_eX]

[DOI]

Phil C. C. Yeh

Proceedings of the 10th Annual Symposium on Computer Architecture, 1983, 1983

Structured Memory Access Architecture.

[BibT_eX]

Andrew R. Pleszkun

Proceedings of the International Conference on Parallel Processing, 1983

1982

Memory Interference in Synchronous Multiprocessor Systems.

[BibT_eX]

[DOI]

David W. L. Yen

IEEE Trans. Computers, 1982

Evaluating database management systems.

[BibT_eX]

[DOI]

Proceedings of the American Federation of Information Processing Societies: 1982 National Computer Conference, 1982

1981

DMIN: An Algorithm for Computing the Optimal Dynamic Allocation in a Virtual Memory Computer.

[BibT_eX]

[DOI]

IEEE Trans. Software Eng., 1981

A Comparison of Dynamic and Static Virtual Memory Allocation Algorithms.

[BibT_eX]

[DOI]

Robert L. Budzinski

IEEE Trans. Software Eng., 1981

1980

Computer System Design Using a Hierarchical Approach to Performance Evaluation.

[BibT_eX]

[DOI]

B. Kumar

Commun. ACM, 1980

A Multiple Stream Microprocessor Prototype System: AMP-1.

[BibT_eX]

[DOI]

William Joseph Kaminsky Jr.

Proceedings of the 7th Annual Symposium on Computer Architecture, 1980

1979

Special Feature: Developing a Multiple-Instruction-Stream Single-Chip Processor.

[BibT_eX]

[DOI]

Computer, 1979

1978

Performance Evaluation of Highly Concurrent Computers by Deterministic Simulation.

[BibT_eX]

[DOI]

B. Kumar

Commun. ACM, 1978

1977

Organization of Semiconductor Memories for Parallel-Pipelined Processors.

[BibT_eX]

[DOI]

Faye A. Briggs

IEEE Trans. Computers, 1977

Information Content of CPU Memory Referencing Behavior.

[BibT_eX]

[DOI]

Dan W. Hammerstrom

Proceedings of the 4th Annual Symposium on Computer Architecture, 1977

1974

Redundancy Testing in Combinational Networks.

[BibT_eX]

[DOI]

Hsien-Hsin S. Lee

IEEE Trans. Computers, 1974

A multiminiprocessor system implemented through pipelining.

[BibT_eX]

[DOI]

Leonard E. Shar

Computer, 1974

Optimal Searching Algorithms for Parallel Pipelined Computers.

[BibT_eX]

[DOI]

Daniel L. Weller

Proceedings of the Parallel Processing, Proceedings of the Sagamore Computer Conference, 1974

1972

Comments on "A Minimization Technique for TANT Networks".

[BibT_eX]

[DOI]

Hsiao-Peng Lee

IEEE Trans. Computers, 1972

A Transform for NAND Network Design.

[BibT_eX]

[DOI]

Hsiao-Peng Lee

IEEE Trans. Computers, 1972

1969

Authors' Reply<sup>4</sup>.

[BibT_eX]

[DOI]

Gernot Metze

IEEE Trans. Computers, 1969

An Algorithm for NAND Decomposition Under Network Constraints.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 1969

1968

An Algorithm for Nand Decomposition of Combinational Switching Functions

[BibT_eX]

[DOI]

PhD thesis, 1968

Comments on "An Algorithm for Synthesis of Multiple-Output Combinational Logic".

[BibT_eX]

[DOI]