Edward S. Davidson

Affiliations:
  • University of Michigan, Ann Arbor, USA


According to our database1, Edward S. Davidson authored at least 97 papers between 1968 and 2014.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2014
Author retrospective for optimum modulo schedules for minimum register requirements.
Proceedings of the ACM International Conference on Supercomputing 25th Anniversary Volume, 2014

2008
A freespace crossbar for multi-core processors.
Proceedings of the 22nd Annual International Conference on Supercomputing, 2008

2004
A Prefetch Taxonomy.
IEEE Trans. Computers, 2004

Probabilistic Predicate-Aware Modulo Scheduling.
Proceedings of the 2nd IEEE / ACM International Symposium on Code Generation and Optimization (CGO 2004), 2004

2003
Call graph prefetching for database applications.
ACM Trans. Comput. Syst., 2003

Predicate-Aware Scheduling: A Technique for Reducing Resource Constraints.
Proceedings of the 1st IEEE / ACM International Symposium on Code Generation and Optimization (CGO 2003), 2003

2002
Boosting trace cache performance with nonhead miss speculation.
Proceedings of the 16th international conference on Supercomputing, 2002

TAXI: Trace Analysis for X86 Interpretation.
Proceedings of the 20th International Conference on Computer Design (ICCD 2002), 2002

2001
Evaluating the Use of Register Queues in Software Pipelined Loops.
IEEE Trans. Computers, 2001

Data prefetching by dependence graph precomputation.
Proceedings of the 28th Annual International Symposium on Computer Architecture, 2001

Allocation by Conflict: A Simple Effective Multilateral Cache Management Scheme.
Proceedings of the 19th International Conference on Computer Design (ICCD 2001), 2001

Branch History Guided Instruction Prefetching.
Proceedings of the Seventh International Symposium on High-Performance Computer Architecture (HPCA'01), 2001

2000
Improving BTB performance in the presence of DLLs.
Proceedings of the 33rd Annual IEEE/ACM International Symposium on Microarchitecture, 2000

Instruction overhead and data locality effects in superscalar processors.
Proceedings of the 2000 IEEE International Symposium on Performance Analysis of Systems and Software, 2000

Register Queues: A New Hardware/Software Approach to Efficient Software Pipelining.
Proceedings of the 2000 International Conference on Parallel Architectures and Compilation Techniques (PACT'00), 2000

1999
Active Management of Data Caches by Exploiting Reuse Information.
IEEE Trans. Computers, 1999

Introduction to "The ENIAC".
Proc. IEEE, 1999

Dual-Issue Scheduling with Spills for Binary Trees.
Proceedings of the Tenth Annual ACM-SIAM Symposium on Discrete Algorithms, 1999

1998
Characterizing Distributed Shared Memory Performance: A Case Study of the Convex SPP1000.
IEEE Trans. Parallel Distributed Syst., 1998

mlcache: A Flexible Multi-Lateral Cache Simulator.
Proceedings of the MASCOTS 1998, 1998

Retrospective: The Cedar System.
Proceedings of the 25 Years of the International Symposia on Computer Architecture (Selected Papers)., 1998

Improving the Throughput of a Pipeline by Insertion of Delays.
Proceedings of the 25 Years of the International Symposia on Computer Architecture (Selected Papers)., 1998

The Cedar System and an Initial Performance Study.
Proceedings of the 25 Years of the International Symposia on Computer Architecture (Selected Papers)., 1998

Effects of Architectural and Technological Advances on the HP/Convex Exemplar's Memory and Communication Performance.
Proceedings of the 25th Annual International Symposium on Computer Architecture, 1998

Configuration Independent Analysis for Characterizing Shared-Memory Applications.
Proceedings of the 12th International Parallel Processing Symposium / 9th Symposium on Parallel and Distributed Processing (IPPS/SPDP '98), March 30, 1998

Utilizing Reuse Information in Data Cache Management.
Proceedings of the 12th international conference on Supercomputing, 1998

Evaluating the performance of active cache management schemes.
Proceedings of the International Conference on Computer Design: VLSI in Computers and Processors, 1998

Origin 2000 Design Enhancements for Communication Intensive Applications.
Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques, 1998

1997
Efficient Formulation for Optimal Modulo Schedulers.
Proceedings of the ACM SIGPLAN '97 Conference on Programming Language Design and Implementation (PLDI), 1997

On High-Bandwidth Data Cache Design for Multi-Issue Processors.
Proceedings of the Thirtieth Annual IEEE/ACM International Symposium on Microarchitecture, 1997

On Effective Data Supply For Multi-Issue Processors.
Proceedings of the Proceedings 1997 International Conference on Computer Design: VLSI in Computers & Processors, 1997

1996
Performance Issues in Integrating Temporality-Based Caching with Prefetching.
Perform. Evaluation, 1996

Minimizing Register Requirements of a Modulo Schedule via Optimum Stage Scheduling.
Int. J. Parallel Program., 1996

A Reduced Multipipeline Machine Description that Preserves Scheduling Constraints.
Proceedings of the ACM SIGPLAN'96 Conference on Programming Language Design and Implementation (PLDI), 1996

Modeling the Communication Performance of the IBM SP2.
Proceedings of IPPS '96, 1996

Profile Driven Weighted Decomposition.
Proceedings of the 10th international conference on Supercomputing, 1996

Reducing Conflicts in Direct-Mapped Caches with a Temporality-Based Design.
Proceedings of the 1996 International Conference on Parallel Processing, 1996

1995
Maximum rate single-phase clocking of a closed pipeline including wave pipelining, stoppability, and startability.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 1995

Stage scheduling: a technique to reduce the register requirements of a modulo schedule.
Proceedings of the 28th Annual International Symposium on Microarchitecture, Ann Arbor, Michigan, USA, November 29, 1995

Register allocation for predicated code.
Proceedings of the 28th Annual International Symposium on Microarchitecture, Ann Arbor, Michigan, USA, November 29, 1995

Optimum Modulo Schedules for Minimum Register Requirements.
Proceedings of the 9th international conference on Supercomputing, 1995

The resource conflict methodology for early-stage design space exploration of superscalar RISC processors.
Proceedings of the 1995 International Conference on Computer Design (ICCD '95), 1995

1994
Minimum register requirements for a modulo schedule.
Proceedings of the 27th Annual International Symposium on Microarchitecture, San Jose, California, USA, November 30, 1994

Optimal local register allocation for a multiple-issue machine.
Proceedings of the 8th international conference on Supercomputing, 1994

Communication in the KSR1 MPP: performance evaluation using synthetic workload experiments.
Proceedings of the 8th international conference on Supercomputing, 1994

A Hierarchical Approach to Modeling and Improving the Performance of Scientific Applications on the KSR1.
Proceedings of the 1994 International Conference on Parallel Processing, 1994

Grouping Array Layouts to Reduce Communication and Improve Locality of Parallel Programs.
Proceedings of the Proceedings 1994 International Conference on Parallel and Distributed Systems, 1994

1993
Synchronization of pipelines.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 1993

Approaching a machine-application bound in delivered performance on scientific code.
Proc. IEEE, 1993


Hierarchical Performance Modeling with MACS: A Case Study of the Convex C-240.
Proceedings of the 20th Annual International Symposium on Computer Architecture, 1993

KSR 1 Multiprocessor: Analysis of Latency Hiding Techniques in a Sparse Solver.
Proceedings of the Seventh International Parallel Processing Symposium, 1993

Evaluating the Communication Performance of MPPs Using Synthetic Sparse Matrix Multiplication Workloads.
Proceedings of the 7th international conference on Supercomputing, 1993

1992
Register requirements of pipelined processors.
Proceedings of the 6th international conference on Supercomputing, 1992

Using constraint geometry to determine maximum rate pipeline clocking.
Proceedings of the 1992 IEEE/ACM International Conference on Computer-Aided Design, 1992

1991
A Performance Comparison of the IBM RS/6000 and the Astronautics ZS-1.
Computer, 1991

An integrated approach to developing manufacturing control software.
Proceedings of the 1991 IEEE International Conference on Robotics and Automation, 1991


Optimal Clocking of Circular Pipelines.
Proceedings of the Proceedings 1991 IEEE International Conference on Computer Design: VLSI in Computer & Processors, 1991

Vector Register Design for Polycyclic Vector Scheduling.
Proceedings of the ASPLOS-IV Proceedings, 1991

1990
Cyclic job shop scheduling using reservation tables.
Proceedings of the 1990 IEEE International Conference on Robotics and Automation, 1990

1988
Pairwise Reduction for the Direct, Parallel Solution of Sparse, Unsymmetric Sets of Linear Equations.
IEEE Trans. Computers, 1988

Polycyclic Vector scheduling vs. Chaining on 1-Port Vector supercomputers.
Proceedings of the Proceedings Supercomputing '88, Orlando, FL, USA, November 12-17, 1988, 1988

Analysis of Memory Referencing Behavior For Design of Local Memories.
Proceedings of the 15th Annual International Symposium on Computer Architecture, 1988

An evaluation of Cray X-MP performance on vectorizable Livermore FORTRAN kernels.
Proceedings of the 2nd international conference on Supercomputing, 1988

1987
Characterization of Branch and Data Dependencies in Programs for Evaluating Pipeline Performance.
IEEE Trans. Computers, 1987

PSOLVE : A Concurrent Algorithm for Solving Sparse Systems of Linear Equations.
Proceedings of the International Conference on Parallel Processing, 1987

1986
Highly Concurrent Scalar Processing.
Proceedings of the 13th Annual Symposium on Computer Architecture, Tokyo, Japan, June 1986, 1986

A Communication Model for Optimizing Hierarchical Multiprocessor Systems.
Proceedings of the International Conference on Parallel Processing, 1986

Features of the Structured Memory Access (SMA) Architecture.
Proceedings of the Spring COMPCON'86, 1986

A Broader Range of Possible Answers to the Issues Raised by RISC.
Proceedings of the Spring COMPCON'86, 1986

1985
An Efficient LISP-Execution Architecture with a New Representation for List Structures.
Proceedings of the 12th Annual Symposium on Computer Architecture, 1985

TIDBITS: Speedup Via Time-Delay Bit-Slicing in ALU Design for VLSI Technology.
Proceedings of the 12th Annual Symposium on Computer Architecture, 1985

A custom-designed integrated circuit for the realization of residue number digital filters.
Proceedings of the IEEE International Conference on Acoustics, 1985

1984
Design of Instruction Set Architectures for Support of High-Level Languages .
Proceedings of the 11th Annual Symposium on Computer Architecture, 1984

1983
Shared Cache for Multiple-Stream Computer Systems.
IEEE Trans. Computers, 1983

Performance of Shared Cache for Parallel-Pipelined Computer Systems
Proceedings of the 10th Annual Symposium on Computer Architecture, 1983, 1983

Structured Memory Access Architecture.
Proceedings of the International Conference on Parallel Processing, 1983

1982
Memory Interference in Synchronous Multiprocessor Systems.
IEEE Trans. Computers, 1982

Evaluating database management systems.
Proceedings of the American Federation of Information Processing Societies: 1982 National Computer Conference, 1982

1981
DMIN: An Algorithm for Computing the Optimal Dynamic Allocation in a Virtual Memory Computer.
IEEE Trans. Software Eng., 1981

A Comparison of Dynamic and Static Virtual Memory Allocation Algorithms.
IEEE Trans. Software Eng., 1981

1980
Computer System Design Using a Hierarchical Approach to Performance Evaluation.
Commun. ACM, 1980

A Multiple Stream Microprocessor Prototype System: AMP-1.
Proceedings of the 7th Annual Symposium on Computer Architecture, 1980

1979
Special Feature: Developing a Multiple-Instruction-Stream Single-Chip Processor.
Computer, 1979

1978
Performance Evaluation of Highly Concurrent Computers by Deterministic Simulation.
Commun. ACM, 1978

1977
Organization of Semiconductor Memories for Parallel-Pipelined Processors.
IEEE Trans. Computers, 1977

Information Content of CPU Memory Referencing Behavior.
Proceedings of the 4th Annual Symposium on Computer Architecture, 1977

1974
Redundancy Testing in Combinational Networks.
IEEE Trans. Computers, 1974

A multiminiprocessor system implemented through pipelining.
Computer, 1974

Optimal Searching Algorithms for Parallel Pipelined Computers.
Proceedings of the Parallel Processing, Proceedings of the Sagamore Computer Conference, 1974

1972
Comments on "A Minimization Technique for TANT Networks".
IEEE Trans. Computers, 1972

A Transform for NAND Network Design.
IEEE Trans. Computers, 1972

1969
Authors' Reply<sup>4</sup>.
IEEE Trans. Computers, 1969

An Algorithm for NAND Decomposition Under Network Constraints.
IEEE Trans. Computers, 1969

1968
An Algorithm for Nand Decomposition of Combinational Switching Functions
PhD thesis, 1968

Comments on "An Algorithm for Synthesis of Multiple-Output Combinational Logic".
IEEE Trans. Computers, 1968


  Loading...