Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

PPMC: Hardware scheduling and memory management support for multi accelerators.

[BibT_eX]

[DOI]

Tassadaq Hussain

Miquel Pericàs

Nacho Navarro

Eduard Ayguadé

Proceedings of the 22nd International Conference on Field Programmable Logic and Applications (FPL), 2012

BSArc: blacksmith streaming architecture for HPC accelerators.

[BibT_eX]

[DOI]

Proceedings of the Computing Frontiers Conference, CF'12, 2012

PPMC: A Programmable Pattern Based Memory Controller.

[BibT_eX]

[DOI]

Proceedings of the Reconfigurable Computing: Architectures, Tools and Applications, 2012

2011

Assessing Accelerator-Based HPC Reverse Time Migration.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2011

Local Memory Design Space Exploration for High-Performance Computing.

[BibT_eX]

[DOI]

Comput. J., 2011

TARCAD: A template architecture for reconfigurable accelerator designs.

[BibT_eX]

[DOI]

Proceedings of the IEEE 9th Symposium on Application Specific Processors, 2011

Design space exploration for aggressive core replication schemes in CMPs.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM International Symposium on High Performance Distributed Computing, 2011

Implementation of a Reverse Time Migration kernel using the HCE High Level Synthesis tool.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Conference on Field-Programmable Technology, 2011

FELI: HW/SW Support for On-Chip Distributed Shared Memory in Multicores.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

DiDi: Mitigating the Performance Impact of TLB Shootdowns Using a Shared TLB Directory.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011

2010

Multicore: The View from Europe.

[BibT_eX]

[DOI]

Mateo Valero

Nacho Navarro

IEEE Micro, 2010

Decomposable and responsive power models for multicore processors using performance counters.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Supercomputing, 2010

FEM: A Step Towards a Common Memory Layout for FPGA Based Accelerators.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Field Programmable Logic and Applications, 2010

An asymmetric distributed shared memory model for heterogeneous parallel systems.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Architectural Support for Programming Languages and Operating Systems, 2010

2009

Linux Kernel Compaction through Cold Code Swapping.

[BibT_eX]

[DOI]

Trans. High Perform. Embed. Archit. Compil., 2009

REMOTE, a Wireless Sensor Network Based System to Monitor Rowing Performance.

[BibT_eX]

[DOI]

Sensors, 2009

CASES 2007 guest editors' introduction.

[BibT_eX]

[DOI]

Steven S. Lumetta

Nacho Navarro

Des. Autom. Embed. Syst., 2009

High-Performance Reverse Time Migration on GPU.

[BibT_eX]

[DOI]

Proceedings of the 2009 International Conference of the Chilean Computer Science Society, 2009

Cetra: A trace and analysis framework for the evaluation of Cell BE systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2009

Predictive Runtime Code Scheduling for Heterogeneous Architectures.

[BibT_eX]

[DOI]

Proceedings of the High Performance Embedded Architectures and Compilers, 2009

Exploiting memory customization in FPGA for 3D stencil computations.

[BibT_eX]

[DOI]

Proceedings of the 2009 International Conference on Field-Programmable Technology, 2009

2008

CUBA: an architecture for efficient CPU/co-processor data communication.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual International Conference on Supercomputing, 2008

2007

High-Performance Embedded Architecture and Compilation Roadmap.

[BibT_eX]

[DOI]

Michael F. P. O'Boyle

Dionisios N. Pnevmatikatos

Trans. High Perform. Embed. Archit. Compil., 2007

Implicitly Parallel Programming Models for Thousand-Core Microprocessors.

[BibT_eX]

[DOI]

Proceedings of the 44th Design Automation Conference, 2007

CIGAR: Application Partitioning for a CPU/Coprocessor Architecture.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Parallel Architectures and Compilation Techniques (PACT 2007), 2007

2006

Beating In-Order Stalls with "Flea-Flicker" Two-Pass Pipelining.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2006

Java Virtual Machine: the key for accurated memory prefetching.

[BibT_eX]

Proceedings of the International Conference on Software Engineering Research and Practice & Conference on Programming Languages and Compilers, 2006

2003

Evaluating the importance of virtual memory for Java.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Symposium on Performance Analysis of Systems and Software, 2003

2001

Strategies for the efficient exploitation of loop-level parallelism in Java.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2001

2000

NanosCompiler: supporting flexible multilevel parallelism exploitation in OpenMP.

[BibT_eX]

[DOI]

Concurr. Pract. Exp., 2000

DITools: Application-level Support for Dynamic Extension and Flexible Composition.

[BibT_eX]

[DOI]

Albert Serra

Nacho Navarro

Toni Cortes

Proceedings of the General Track: 2000 USENIX Annual Technical Conference, 2000

OpenMP Extensions for Thread Groups and Their Run-Time Support.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2000

A Tool to Schedule Parallel Applications on Multiprocessors: The NANOS CPU MANAGER.

[BibT_eX]

[DOI]

Xavier Martorell

Julita Corbalán

Dimitrios S. Nikolopoulos

Nacho Navarro

Eleftherios D. Polychronopoulos

Theodore S. Papatheodorou

Jesús Labarta

Proceedings of the Job Scheduling Strategies for Parallel Processing, IPDPS 2000 Workshop, 2000

Towards an efficient exploitation of loop-level parallelism in Java.

[BibT_eX]

[DOI]

José Oliver

Eduard Ayguadé

Nacho Navarro

Proceedings of the ACM 2000 Java Grande Conference, San Francisco, CA, USA, 2000

Applying Interposition Techniques for Performance Analysis of OpenMP Parallel Applications.

[BibT_eX]

[DOI]

Proceedings of the 14th International Parallel & Distributed Processing Symposium (IPDPS'00), 2000

1999

Thread fork/join techniques for multi-level parallelism exploitation in NUMA multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the 13th international conference on Supercomputing, 1999

Exploiting Multiple Levels of Parallelism in OpenMP: A Case Study.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Parallel Processing 1999, 1999

1998

Experiences on implementing PARMACS macros to run the SPLASH-2 suite on multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the Sixth Euromicro Workshop on Parallel and Distributed Processing, 1998

Kernel-level Scheduling for the Nano-threads Programming Model.

[BibT_eX]

[DOI]

Eleftherios D. Polychronopoulos

Xavier Martorell

Dimitrios S. Nikolopoulos

Jesús Labarta

Theodore S. Papatheodorou

Nacho Navarro

Proceedings of the 12th international conference on Supercomputing, 1998

1997

Exploiting Parallelism Through Directives on the Nano-Threads Programming Model.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 1997

Analysis of Several Scheduling Algorithms under the Nano-Thread Programming Model.

[BibT_eX]

[DOI]

Proceedings of the 11th International Parallel Processing Symposium (IPPS '97), 1997

1996

A Library Implementation of the Nano-Threads Programming Model.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par '96 Parallel Processing, 1996

1995

The eXc Model: Scheduler-Activations on Mach 3.0.

[BibT_eX]

Marisa Gil

Xavier Martorell

Nacho Navarro

Proceedings of the Seventh IASTED/ISMM International Conference on Parallel and Distributed Computing and Systems, 1995

Nacho Navarro

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...