We stand with Ukraine

We stand with Ukraine

Olivier Temam

According to our database¹, Olivier Temam authored at least 107 papers between 1992 and 2020.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2020

ParaML: A Polyvalent Multicore Accelerator for Machine Learning.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

2017

An Accelerator for High Efficient Vision Processing.

[DOI]

,

,

Robert Fasthuber

,

,

,

,

,

,

,

,

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2017

DaDianNao: A Neural Network Supercomputer.

[DOI]

,

,

,

,

,

,

,

,

IEEE Trans. Computers, 2017

Introduction to the workshop on trends in machine learning.

[DOI]

,

Proceedings of the Workshop on Trends in Machine-Learning (and impact on computer architecture), 2017

2016

DianNao family: energy-efficient hardware accelerators for machine learning.

[DOI]

,

,

,

,

Commun. ACM, 2016

Enabling future progress in machine-learning.

[DOI]

Proceedings of the 2016 IEEE Symposium on VLSI Circuits, 2016

2015

Robust Design Space Modeling.

[DOI]

,

,

,

,

,

,

ACM Trans. Design Autom. Electr. Syst., 2015

A Small-Footprint Accelerator for Large-Scale Neural Networks.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

ACM Trans. Comput. Syst., 2015

Leveraging the Error Resilience of Neural Networks for Designing Highly Energy Efficient Accelerators.

[DOI]

,

Avinash Lingamneni

,

,

Krishna V. Palem

,

,

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2015

Statistical Performance Comparisons of Computers.

[DOI]

,

,

,

,

,

,

IEEE Trans. Computers, 2015

Practical Iterative Optimization for the Data Center.

[DOI]

,

,

,

Lieven Eeckhout

,

,

,

,

ACM Trans. Archit. Code Optim., 2015

Alternative Computing Designs and Technologies.

[DOI]

,

IEEE Micro, 2015

A High-Throughput Neural Network Accelerator.

[DOI]

,

,

,

,

,

,

IEEE Micro, 2015

Cluster Cache Monitor: Leveraging the Proximity Data in CMP.

[DOI]

,

,

,

,

Int. J. Parallel Program., 2015

Neuromorphic accelerators: a comparison between neuroscience and machine-learning approaches.

[DOI]

,

Daniel D. Ben-Dayan Rubin

,

,

,

,

,

,

Proceedings of the 48th International Symposium on Microarchitecture, 2015

ShiDianNao: shifting vision processing closer to the sensor.

[DOI]

,

Robert Fasthuber

,

,

,

,

,

,

,

Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015

Hardware Neural Networks: From Inflated Expectations to Plateau of Productivity.

[DOI]

Proceedings of the Federated Computing Research Conference, 2015

Retraining-based timing error mitigation for hardware neural networks.

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, 2015

PuDianNao: A Polyvalent Machine Learning Accelerator.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems, 2015

2014

Performance Portability Across Heterogeneous SoCs Using a Generalized Library-Based Approach.

[DOI]

,

,

,

,

,

Lieven Eeckhout

,

,

,

,

ACM Trans. Archit. Code Optim., 2014

DaDianNao: A Machine-Learning Supercomputer.

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014

ArchRanker: A ranking approach to design space exploration.

[DOI]

,

,

,

,

,

,

Proceedings of the ACM/IEEE 41st International Symposium on Computer Architecture, 2014

A low-cost memory interface for high-throughput accelerators.

[DOI]

,

,

,

,

,

Proceedings of the 2014 International Conference on Compilers, 2014

The improbable but highly appropriate marriage of 3D stacking and neuromorphic accelerators.

[DOI]

,

Alexandre Valentian

,

,

,

,

Proceedings of the 2014 International Conference on Compilers, 2014

DianNao: a small-footprint high-throughput accelerator for ubiquitous machine-learning.

[DOI]

,

,

,

,

,

,

Proceedings of the Architectural Support for Programming Languages and Operating Systems, 2014

Leveraging the error resilience of machine-learning applications for designing highly energy efficient accelerators.

[DOI]

,

Krishna V. Palem

,

Lingamneni Avinash

,

,

,

Proceedings of the 19th Asia and South Pacific Design Automation Conference, 2014

Advanced technologies for brain-inspired computing.

[DOI]

Fabien Clermidy

,

Rodolphe Héliot

,

Alexandre Valentian

,

Christian Gamrat

,

Olivier Bichler

,

,

,

Proceedings of the 19th Asia and South Pacific Design Automation Conference, 2014

2013

Cluster Cache Monitor.

[DOI]

,

,

,

,

Proceedings of the 25th International Symposium on Computer Architecture and High Performance Computing, 2013

Continuous real-world inputs can open up alternative accelerator designs.

[DOI]

,

Antoine Joubert

,

,

Rodolphe Héliot

,

Proceedings of the 40th Annual International Symposium on Computer Architecture, 2013

Elastic CGRAs.

[DOI]

,

,

,

,

Proceedings of the 2013 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2013

Hardware neural network accelerators.

[DOI]

Proceedings of the International Conference on Hardware/Software Codesign and System Synthesis, 2013

2012

Deconstructing iterative optimization.

[DOI]

,

,

,

Lieven Eeckhout

,

,

,

ACM Trans. Archit. Code Optim., 2012

SWAP: Parallelization through Algorithm Substitution.

[DOI]

,

,

,

Lieven Eeckhout

,

,

IEEE Micro, 2012

Configurable conduction delay circuits for high spiking rates.

[DOI]

,

Antoine Joubert

,

,

Rodolphe Héliot

Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

A defect-tolerant accelerator for emerging high-performance applications.

[DOI]

Proceedings of the 39th International Symposium on Computer Architecture (ISCA 2012), 2012

Hardware spiking neurons design: Analog or digital?

[DOI]

Antoine Joubert

,

,

,

Rodolphe Héliot

Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), 2012

BenchNN: On the broad potential application scope of hardware neural network accelerators.

[DOI]

,

,

,

,

,

Mikko H. Lipasti

,

,

,

,

Proceedings of the 2012 IEEE International Symposium on Workload Characterization, 2012

Statistical performance comparisons of computers.

[DOI]

,

,

,

,

,

Proceedings of the 18th IEEE International Symposium on High Performance Computer Architecture, 2012

Capacitance of TSVs in 3-D stacked chips a problem?: not for neuromorphic systems!

[DOI]

Antoine Joubert

,

,

,

,

Rodolphe Héliot

Proceedings of the 49th Annual Design Automation Conference 2012, 2012

Iterative optimization for the data center.

[DOI]

,

,

Lieven Eeckhout

,

,

Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems, 2012

2011

Milepost GCC: Machine Learning Enabled Self-tuning Compiler.

[DOI]

,

Yuriy Kashnikov

,

Abdul Wahid Memon

,

Zbigniew Chamski

,

,

Mircea Namolaru

,

,

Bilha Mendelson

,

,

,

François Bodin

,

,

,

Edwin V. Bonilla

,

,

Christopher K. I. Williams

,

Michael F. P. O'Boyle

Int. J. Parallel Program., 2011

How sensitive is processor customization to the workload's input datasets?

[DOI]

Maximilien Breughe

,

,

,

,

,

,

Lieven Eeckhout

Proceedings of the IEEE 9th Symposium on Application Specific Processors, 2011

Automatic abstraction and fault tolerance in cortical microachitectures.

[DOI]

,

,

,

Mikko H. Lipasti

Proceedings of the 38th International Symposium on Computer Architecture (ISCA 2011), 2011

A Very Fast Simulator for Exploring the Many-Core Future.

[DOI]

Olivier Certner

,

,

,

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Implementation of signal processing tasks on neuromorphic hardware.

[DOI]

,

Rodolphe Héliot

Proceedings of the 2011 International Joint Conference on Neural Networks, 2011

2010

Collective optimization: A practical collaborative approach.

[DOI]

,

ACM Trans. Archit. Code Optim., 2010

ArchExplorer for Automatic Design Space Exploration.

[DOI]

,

,

,

,

IEEE Micro, 2010

CMA: Chip multi-accelerator.

[DOI]

,

,

,

,

Proceedings of the IEEE 8th Symposium on Application Specific Processors, 2010

Transparent sampling.

[DOI]

Taj Muhammad Khan

,

Daniel Gracia Pérez

,

Proceedings of the 2010 International Conference on Embedded Computer Systems: Architectures, 2010

Evaluating iterative optimization across 1000 datasets.

[DOI]

,

,

Lieven Eeckhout

,

,

,

,

Proceedings of the 2010 ACM SIGPLAN Conference on Programming Language Design and Implementation, 2010

ArchExplorer.org: A methodology for facilitating a fair Comparison of research ideas.

[DOI]

,

,

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2010

The rebirth of neural networks.

[DOI]

Proceedings of the 37th International Symposium on Computer Architecture (ISCA 2010), 2010

A memory interface for multi-purpose multi-stream accelerators.

[DOI]

,

,

,

,

Proceedings of the 2010 International Conference on Compilers, 2010

Scalable hardware support for conditional parallelization.

[DOI]

,

Olivier Certner

,

,

Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, 2010

2009

Reconciling specialization and flexibility through compound circuits.

[DOI]

,

,

,

Proceedings of the 15th International Conference on High-Performance Computer Architecture (HPCA-15 2009), 2009

Collective Optimization.

[DOI]

,

Proceedings of the High Performance Embedded Architectures and Compilers, 2009

2008

A Practical Approach for Reconciling High and Predictable Performance in Non-Regular Parallel Programs.

[DOI]

Olivier Certner

,

,

,

,

,

Proceedings of the Design, Automation and Test in Europe, 2008

2007

Quick and Practical Run-Time Evaluation of Multiple Program Optimizations.

[DOI]

,

,

Michael F. P. O'Boyle

,

Trans. High Perform. Embed. Archit. Compil., 2007

High-Performance Embedded Architecture and Compilation Roadmap.

[DOI]

Koen De Bosschere

,

,

Xavier Martorell

,

,

Michael F. P. O'Boyle

,

Dionisios N. Pnevmatikatos

,

,

,

,

,

Trans. High Perform. Embed. Archit. Compil., 2007

Modeling self-developing biological neural networks.

[DOI]

,

Neurocomputing, 2007

UNISIM: An Open Simulation Environment and Library for Complex Architecture Design and Collaborative Development.

[DOI]

David I. August

,

,

,

Daniel Gracia Pérez

,

Gilles Mouchard

,

,

,

Neil Vachharajani

IEEE Comput. Archit. Lett., 2007

MiDataSets: Creating the Conditions for a More Realistic Evaluation of Iterative Optimization.

[DOI]

,

,

Michael F. P. O'Boyle

,

Proceedings of the High Performance Embedded Architectures and Compilers, 2007

Rapidly Selecting Good Compiler Optimizations using Performance Counters.

[DOI]

,

,

Felix V. Agakov

,

Edwin V. Bonilla

,

Michael F. P. O'Boyle

,

Proceedings of the Fifth International Symposium on Code Generation and Optimization (CGO 2007), 2007

Fast compiler optimisation evaluation using code-feature based performance prediction.

[DOI]

Christophe Dubach

,

,

,

,

Michael F. P. O'Boyle

,

Proceedings of the 4th Conference on Computing Frontiers, 2007

2006

A Sampling Method Focusing on Practicality.

[DOI]

Daniel Gracia Pérez

,

,

IEEE Micro, 2006

Load squared: Adding logic close to memory to reduce the latency of indirect loads in embedded and general systems.

[DOI]

,

Jean-Francois Collard

,

J. Embed. Comput., 2006

Semi-Automatic Composition of Loop Transformations for Deep Parallelism and Memory Hierarchies.

[DOI]

,

Nicolas Vasilache

,

Cédric Bastoul

,

,

,

,

Int. J. Parallel Program., 2006

CAPSULE: Hardware-Assisted Parallel Execution of Component-Based Programs.

[DOI]

,

,

Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-39 2006), 2006

Automatic performance model construction for the fast software exploration of new hardware designs.

[DOI]

,

Christophe Dubach

,

Felix V. Agakov

,

Edwin V. Bonilla

,

Michael F. P. O'Boyle

,

,

Proceedings of the 2006 International Conference on Compilers, 2006

2005

Chaos in computer performance

[DOI]

,

Daniel Gracia Pérez

,

CoRR, 2005

Symbiotic Processing: Toward a Better Balance Between Architecture, Compiler and User Efforts.

,

,

Proceedings of the 1st International Workshop on Reconfigurable Communication-centric Systems-on-Chip, 2005

Characterizing Self-developing Biological Neural Networks: A First Step Towards Their Application to Computing Systems.

[DOI]

,

Proceedings of the Computational Intelligence and Bioinspired Systems, 2005

Facilitating the search for compositions of program transformations.

[DOI]

,

,

,

,

,

Nicolas Vasilache

Proceedings of the 19th Annual International Conference on Supercomputing, 2005

A Practical Method for Quickly Evaluating Program Optimizations.

[DOI]

,

,

Michael F. P. O'Boyle

,

Proceedings of the High Performance Embedded Architectures and Compilers, 2005

2004

A fast and accurate method for determining a lower bound on execution time.

[DOI]

,

Michael F. P. O'Boyle

,

,

Concurr. Comput. Pract. Exp., 2004

Towards a Systematic, Pragmatic and Architecture-Aware Program Optimization Process for Complex Processors.

[DOI]

,

,

,

Jean-Marie Verdun

Proceedings of the ACM/IEEE SC2004 Conference on High Performance Networking and Computing, 2004

MicroLib: A Case for the Quantitative Comparison of Micro-Architecture Mechanisms.

[DOI]

Daniel Gracia Pérez

,

Gilles Mouchard

,

Proceedings of the 37th Annual International Symposium on Microarchitecture (MICRO-37 2004), 2004

Load squared: adding logic close to memory to reduce the latency of indirect loads with high miss ratios.

[DOI]

,

Jean-Francois Collard

,

Proceedings of the 2004 workshop on MEmory performance, 2004

From Sequences of Dependent Instructions to Functions: An Approach for Improving Performance without ILP or Speculation.

[DOI]

,

Proceedings of the 31st International Symposium on Computer Architecture (ISCA 2004), 2004

A Polyhedral Approach to Ease the Composition of Program Transformations.

[DOI]

,

,

Proceedings of the Euro-Par 2004 Parallel Processing, 2004

A New Optimized Implemention of the SystemC Engine Using Acyclic Scheduling.

[DOI]

Daniel Gracia Pérez

,

Gilles Mouchard

,

Proceedings of the 2004 Design, 2004

VHC: Quickly Building an Optimizer for Complex Embedded Architectures.

[DOI]

,

,

Proceedings of the 2nd IEEE / ACM International Symposium on Code Generation and Optimization (CGO 2004), 2004

BLOB computing.

[DOI]

Frédéric Gruau

,

,

,

Proceedings of the First Conference on Computing Frontiers, 2004

2003

DiST: a simple, reliable and scalable method to significantly reduce processor architecture simulation time.

[DOI]

,

Gilles Mouchard

,

,

Proceedings of the International Conference on Measurements and Modeling of Computer Systems, 2003

Putting Polyhedral Loop Transformations to Work.

[DOI]

Cédric Bastoul

,

,

,

,

Proceedings of the Languages and Compilers for Parallel Computing, 2003

2002

Increasing hardware data prefetching performance using the second-level cache.

[DOI]

,

Jean-Luc Béchennec

,

J. Syst. Archit., 2002

Digital LC-2: from bits & gates to a little computer.

[DOI]

,

Proceedings of the 2002 workshop on Computer architecture education, 2002

On increasing architecture awareness in program optimizations to bridge the gap between peak and sustained processor performance: matrix-multiply revisited.

[DOI]

,

,

Jean-Marie Verdun

Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002

2000

Load Scheduling with Profile Information.

[DOI]

Götz Lindenmaier

,

Kathryn S. McKinley

,

Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

1999

Quantifying loop nest locality using SPEC'95 and the perfect benchmarks.

[DOI]

Kathryn S. McKinley

,

ACM Trans. Comput. Syst., 1999

An Algorithm for Optimally Exploiting Spatial and Temporal Locality in Upper Memory Levels.

[DOI]

IEEE Trans. Computers, 1999

1998

Dataflow Analysis of Branch Mispredictions and Its Application to Early Resolution of Branch Outcomes.

[DOI]

Alexandre Farcy

,

,

,

Proceedings of the 31st Annual IEEE/ACM International Symposium on Microarchitecture, 1998

Investigating Optimal Local Memory Performance.

[DOI]

Proceedings of the ASPLOS-VIII Proceedings of the 8th International Conference on Architectural Support for Programming Languages and Operating Systems, 1998

1997

A Cache Visualization Tool.

[DOI]

Eric van der Deijl

,

,

,

Elana D. Granston

Computer, 1997

Data Caches for Superscalar Processors.

[DOI]

,

Juan J. Navarro

,

Proceedings of the 11th international conference on Supercomputing, 1997

1996

Improving Single-Process Performance with Multithreaded Processors.

[DOI]

Alexandre Farcy

,

Proceedings of the 10th international conference on Supercomputing, 1996

Streaming Prefetch.

[DOI]

Proceedings of the Euro-Par '96 Parallel Processing, 1996

A Quantitative Analysis of Loop Nest Locality.

[DOI]

Kathryn S. McKinley

,

Proceedings of the ASPLOS-VII Proceedings, 1996

1995

Influence of Cross-Interferences on Blocked Loops: A Case Study with Matric-Vector Multiply

[DOI]

Christine Fricker

,

,

ACM Trans. Program. Lang. Syst., 1995

Software Assistance for Data Caches.

[DOI]

,

Proceedings of the 1st IEEE Symposium on High-Performance Computer Architecture (HPCA 1995), 1995

1994

Cache Interference Phenomena.

[DOI]

,

Christine Fricker

,

Proceedings of the 1994 ACM SIGMETRICS conference on Measurement and modeling of computer systems, 1994

Using virtual lines to enhance locality exploitation.

[DOI]

,

Proceedings of the 8th international conference on Supercomputing, 1994

1993

To copy or not to copy: a compile-time technique for assessing when data copying should be used to eliminate cache conflicts.

[DOI]

,

Elana D. Granston

,

Proceedings of the Proceedings Supercomputing '93, 1993

Speculative Prefetching.

[DOI]

,

Proceedings of the 7th international conference on Supercomputing, 1993

Evaluating the Impact of Cache Interferences on Numerical Codes.

[DOI]

,

Christine Fricker

,

Proceedings of the 1993 International Conference on Parallel Processing, 1993

Fast Enumeration of Solutions for Data Dependence Analysis and Data Locality Optimization.

[DOI]

Christine Eisenbeis

,

,

Harry A. G. Wijshoff

Proceedings of the 1993 International Conference on Parallel Processing, 1993

1992

Characterizing the Behavior of Sparse Algorithms on Caches.

[DOI]

,

Proceedings of the Proceedings Supercomputing '92, 1992

Loading...