Huynh Phung Huynh

According to our database1, Huynh Phung Huynh authored at least 23 papers between 2007 and 2018.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2018
Exploiting Sparsity to Accelerate Fully Connected Layers of CNN-Based Applications on Mobile SoCs.
ACM Trans. Embed. Comput. Syst., 2018

2017
Scale-Free Sparse Matrix-Vector Multiplication on Many-Core Architectures.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2017

2016
Efficient Query Processing on Many-core Architectures: A Case Study with Intel Xeon Phi Processor.
Proceedings of the 2016 International Conference on Management of Data, 2016

2015
MrPhi: An Optimized MapReduce Framework on Intel Xeon Phi Coprocessors.
IEEE Trans. Parallel Distributed Syst., 2015

Efficient GPU Spatial-Temporal Multitasking.
IEEE Trans. Parallel Distributed Syst., 2015

Improving Main Memory Hash Joins on Intel Xeon Phi Processors: An Experimental Approach.
Proc. VLDB Endow., 2015

Improving GPGPU energy-efficiency through concurrent kernel execution and DVFS.
Proceedings of the 13th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2015

2014
Mapping Streaming Applications onto GPU Systems.
IEEE Trans. Parallel Distributed Syst., 2014

2013
Hierarchical Parallel Algorithm for Modularity-Based Community Detection Using GPUs.
Proceedings of the Euro-Par 2013 Parallel Processing, 2013

Optimizing the MapReduce framework on Intel Xeon Phi coprocessor.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

2012
Poster: Automated Mapping Streaming Applications onto GPUs.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: Mapping Streaming Applications onto GPU Systems.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Scalable framework for mapping streaming applications onto multi-GPU systems.
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012

GPGPU for Real-Time Data Analytics.
Proceedings of the 18th IEEE International Conference on Parallel and Distributed Systems, 2012

2011
Automated Architecture-Aware Mapping of Streaming Applications Onto GPUs.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

2010
Design space exploration of instruction set customizable MPSoCs for multimedia applications.
Proceedings of the 2010 International Conference on Embedded Computer Systems: Architectures, 2010

Efficient custom instructions generation for system-level design.
Proceedings of the International Conference on Field-Programmable Technology, 2010

2009
An efficient framework for dynamic reconfiguration of instruction-set customization.
Des. Autom. Embed. Syst., 2009

Runtime Adaptive Extensible Embedded Processors - A Survey.
Proceedings of the Embedded Computer Systems: Architectures, 2009

Runtime reconfiguration of custom instructions for real-time embedded systems.
Proceedings of the Design, Automation and Test in Europe, 2009

Evaluating design trade-offs in customizable processors.
Proceedings of the 46th Design Automation Conference, 2009

2008
Processor customization for wearable bio-monitoring platforms.
Proceedings of the 2008 International Conference on Field-Programmable Technology, 2008

2007
Instruction-set customization for real-time embedded systems.
Proceedings of the 2007 Design, Automation and Test in Europe Conference and Exposition, 2007


  Loading...