Lasse Natvig

Affiliations:
  • Norwegian University of Science and Technology, Norway


According to our database1, Lasse Natvig authored at least 50 papers between 1990 and 2020.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2020
Scalability analysis of AVX-512 extensions.
J. Supercomput., 2020

2018
A vectorized k-means algorithm for compressed datasets: design and experimental analysis.
J. Supercomput., 2018

Proof-of-Concept Examples of Performance-Transparent Programming Models.
CoRR, 2018

Make software harder.
Proceedings of the 15th ACM International Conference on Computing Frontiers, 2018

2017
Efficient array slicing on the Intel Xeon Phi coprocessor.
Proceedings of the 4th ACM SIGPLAN International Workshop on Libraries, 2017

2016
Transient Temperature Prediction for Aging Thermal Sensors Using Artificial Neural Network.
Proceedings of the 24th Euromicro International Conference on Parallel, 2016

A systematic approach to automated construction of power emulation models.
Proceedings of the 2016 Design, Automation & Test in Europe Conference & Exhibition, 2016

2015
Tuning the victim selection policy of Intel TBB.
J. Syst. Archit., 2015

Climbing Mont Blanc - A Training Site for Energy Efficient Programming on Heterogeneous Multicore Processors.
CoRR, 2015

ParVec: vectorizing the PARSEC benchmark suite.
Computing, 2015

Cost-comfort balancing in a smart residential building with bidirectional energy trading.
Proceedings of the 2015 Sustainable Internet and ICT for Sustainability, 2015

V-PFORDelta: Data Compression for Energy Efficient Computation of Time Series.
Proceedings of the 22nd IEEE International Conference on High Performance Computing, 2015

2014
Performance and energy impact of parallelization and vectorization techniques in modern microprocessors.
Computing, 2014

Optimized hardware for suboptimal software: The case for SIMD-aware benchmarks.
Proceedings of the 2014 IEEE International Symposium on Performance Analysis of Systems and Software, 2014

Load scheduling in smart buildings with bidirectional energy trading.
Proceedings of the IEEE 33rd International Performance Computing and Communications Conference, 2014

Victim Selection Policies for Intel TBB: Overheads and Energy Footprint.
Proceedings of the Architecture of Computing Systems - ARCS 2014, 2014

2013
Energy-Efficient Sparse Matrix Autotuning with CSX - A Trade-off Study.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

On the energy footprint of task based parallel applications.
Proceedings of the International Conference on High Performance Computing & Simulation, 2013

Challenges of Reducing Cycle-Accurate Simulation Time for TBP Applications.
Proceedings of the International Conference on Computational Science, 2013

Temperature effects on on-chip energy measurements.
Proceedings of the International Green Computing Conference, 2013

2012
IPM based sparse LP solver on a heterogeneous processor.
Comput. Manag. Sci., 2012

Improving Energy Efficiency through Parallelization and Vectorization on Intel Core i5 and i7 Processors.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Case Studies of Multi-core Energy Efficiency in Task Based Programs.
Proceedings of the ICT as Key Technology against Global Warming, 2012

Performance and Energy Efficiency Analysis of Data Reuse Transformation Methodology on Multicore Processor.
Proceedings of the Euro-Par 2012: Parallel Processing Workshops, 2012

2011
A High Performance Adaptive Miss Handling Architecture for Chip Multiprocessors.
Trans. High Perform. Embed. Archit. Compil., 2011

Storage Efficient Hardware Prefetching using Delta-Correlating Prediction Tables.
J. Instr. Level Parallelism, 2011

Parallel algorithms for the maximum flow problem with minimum lot sizes.
Proceedings of the Operations Research Proceedings 2011, Selected Papers of the International Conference on Operations Research (OR 2011), August 30, 2011

Cache-Aware Matrix Multiplication on Multicore Systems for IPM-based LP Solvers.
Proceedings of the Federated Conference on Computer Science and Information Systems, 2011

Exploring the Prefetcher/Memory Controller Design Space: An Opportunistic Prefetch Scheduling Strategy.
Proceedings of the Architecture of Computing Systems - ARCS 2011, 2011

2010
Implementation of a linear programming solver on the Cell BE processor.
Proceedings of the International Conference on Computational Science, 2010

Computational Computer Architecture Research at NTNU.
ERCIM News, 2010

Mixed-Precision Parallel Linear Programming Solver.
Proceedings of the 22st International Symposium on Computer Architecture and High Performance Computing, 2010

DIEF: An Accurate Interference Feedback Mechanism for Chip Multiprocessor Memory Systems.
Proceedings of the High Performance Embedded Architectures and Compilers, 2010

Multi-level Hardware Prefetching Using Low Complexity Delta Correlating Prediction Tables with Partial Matching.
Proceedings of the High Performance Embedded Architectures and Compilers, 2010

2009
Experimental Validation of the Learning Effect for a Pedagogical Game on Computer Fundamentals.
IEEE Trans. Educ., 2009

A compulsory yet motivating question/answer game to teach computer fundamentals.
Comput. Appl. Eng. Educ., 2009

A Quantitative Study of Memory System Interference in Chip Multiprocessor Architectures.
Proceedings of the 11th IEEE International Conference on High Performance Computing and Communications, 2009

A light-weight fairness mechanism for chip multiprocessor memory systems.
Proceedings of the 6th Conference on Computing Frontiers, 2009

2008
Low-cost open-page prefetch scheduling in chip multiprocessors.
Proceedings of the 26th International Conference on Computer Design, 2008

Towards an Intelligent Environment for Programming Multi-core Computing Systems.
Proceedings of the Euro-Par 2008 Workshops, 2008

2007
An LRU-based replacement algorithm augmented with frequency of access in shared chip-multiprocessor caches.
SIGARCH Comput. Archit. News, 2007

2006
Destructive-read in embedded DRAM, impact on power consumption.
J. Embed. Comput., 2006

A Cache-Partitioning Aware Replacement Policy for Chip Multiprocessors.
Proceedings of the High Performance Computing, 2006

Master-Slave Tasking on Asymmetric Networks.
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

Cache Write-Back Schemes for Embedded Destructive-Read DRAM.
Proceedings of the Architecture of Computing Systems, 2006

2004
Age of computers: game-based teaching of computer fundamentals.
Proceedings of the 9th Annual SIGCSE Conference on Innovation and Technology in Computer Science Education, 2004

2001
Experience from a 450 Students/Year Course on Digital Logic and Computer Fundamentals using FPGAs and mu-Controllers.
Proceedings of the 2001 International Conference on Microelectronics Systems Education, 2001

Simulating Parallel Architectures with BSPlab.
Proceedings of the High-Performance Computing and Networking, 9th International Conference, 2001

1994
Compile and Runtime Padding: An Approach to Realising Synchronous MIMD Execution.
Proceedings of the Technology and Foundations - Information Processing '94, Volume 1, Proceedings of the IFIP 13th World Computer Congress, Hamburg, Germany, 28 August, 1994

1990
Logarithmic time cost optimal parallel sorting is <i>not yet</i> fast in practice!.
Proceedings of the Proceedings Supercomputing '90, New York, NY, USA, November 12-16, 1990, 1990


  Loading...