Xuhao Chen

Orcid: 0000-0001-6470-3387

Affiliations:
  • Massachusetts Institute of Technology, Cambridge, MA, USA
  • The University of Texas at Austin, TX, USA (2019 - 2020)
  • National University of Defense Technology, College of Computer, Changsha, China (PhD 2014)
  • University of Illinois Urbana-Champaign, IMPACT Research group, Urbana-Champaign, IL, USA (2012 - 2014)


According to our database1, Xuhao Chen authored at least 28 papers between 2011 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
Efficient and Scalable Graph Pattern Mining on GPUs.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

2021
FlexMiner: A Pattern-Aware Accelerator for Graph Pattern Mining.
Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

Sandslash: a two-level framework for efficient graph pattern mining.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021

2020
Pangolin: An Efficient and Flexible Graph Mining System on CPU and GPU.
Proc. VLDB Endow., 2020

2019
GARDENIA: A Graph Processing Benchmark Suite for Next-Generation Accelerators.
ACM J. Emerg. Technol. Comput. Syst., 2019

GraphCage: Cache Aware Graph Processing on GPUs.
CoRR, 2019

Architectural Implications in Graph Processing of Accelerator with Gardenia Benchmark Suite.
Proceedings of the 2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2019

DistTC: High Performance Distributed Triangle Counting.
Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

2018
Orchestrating parallel detection of strongly connected components on GPUs.
Parallel Comput., 2018

Escort: Efficient Sparse Convolutional Neural Networks on GPUs.
CoRR, 2018

2017
GARDENIA: A Domain-specific Benchmark Suite for Next-generation Accelerators.
CoRR, 2017

Efficient and high-quality sparse graph coloring on GPUs.
Concurr. Comput. Pract. Exp., 2017

High Performance Detection of Strongly Connected Components in Sparse Graphs on GPUs.
Proceedings of the 8th International Workshop on Programming Models and Applications for Multicores and Manycores, 2017

Efficient and Portable ALS Matrix Factorization for Recommender Systems.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

2016
Evaluating Multiple Streams on Heterogeneous Platforms.
Parallel Process. Lett., 2016

Shielding STT-RAM Based Register Files on GPUs against Read Disturbance.
ACM J. Emerg. Technol. Comput. Syst., 2016

Efficient and High-quality Sparse Graph Coloring on the GPU.
CoRR, 2016

Streaming Applications on Heterogeneous Platforms.
Proceedings of the Network and Parallel Computing, 2016

Evaluating the Performance Impact of Multiple Streams on the MIC-Based Heterogeneous Platform.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

High Performance Parallel Graph Coloring on GPGPUs.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

An Energy-Efficient Implementation of LU Factorization on Heterogeneous Systems.
Proceedings of the 22nd IEEE International Conference on Parallel and Distributed Systems, 2016

Red-Shield: Shielding Read Disturbance for STT-RAM Based Register Files on GPUs.
Proceedings of the 26th edition on Great Lakes Symposium on VLSI, 2016

Architecting energy-efficient STT-RAM based register file on GPGPUs via delta compression.
Proceedings of the 53rd Annual Design Automation Conference, 2016

2014
Binary compatibility for embedded systems using greedy subgraph mapping.
Sci. China Inf. Sci., 2014

Adaptive Cache Management for Energy-Efficient GPU Computing.
Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014

Adaptive Cache Bypass and Insertion for Many-core Accelerators.
Proceedings of the 2nd International Workshop on Many-core Embedded Systems, 2014

2011
GSM: An Efficient Code Generation Algorithm for Dynamic Binary Translator.
Proceedings of the Fourth International Symposium on Parallel Architectures, 2011

Characterizing Fine-Grain Parallelism on Modern Multicore Platform.
Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011


  Loading...