Hyojin Sung

Orcid: 0000-0002-3036-6180

According to our database1, Hyojin Sung authored at least 27 papers between 2009 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
XLA-NDP: Efficient Scheduling and Code Generation for Deep Learning Model Training on Near-Data Processing Memory.
IEEE Comput. Archit. Lett., 2023

Multi-Objective Architecture Search and Optimization for Heterogeneous Neuromorphic Architecture.
Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

PRIMO: A Full-Stack Processing-in-DRAM Emulation Framework for Machine Learning Workloads.
Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

PIMFlow: Compiler and Runtime Support for CNN Models on Processing-in-Memory DRAM.
Proceedings of the 21st ACM/IEEE International Symposium on Code Generation and Optimization, 2023

2022
Runtime Support for Accelerating CNN Models on Digital DRAM Processing-in-Memory Hardware.
IEEE Comput. Archit. Lett., 2022

One-shot tuner for deep learning compilers.
Proceedings of the CC '22: 31st ACM SIGPLAN International Conference on Compiler Construction, Seoul, South Korea, April 2, 2022

2021
MetaTune: Meta-Learning Based Cost Model for Fast and Efficient Auto-tuning Frameworks.
CoRR, 2021

Near-Data Processing in Memory Expander for DNN Acceleration on GPUs.
IEEE Comput. Archit. Lett., 2021

Hybrid Register Allocation with Spill Cost and Pattern Guided Optimization.
Proceedings of the Languages and Compilers for Parallel Computing, 2021

2019
Using Structured Input and Modularity for Improved Learning.
CoRR, 2019

POSTER: CogR: Exploiting Program Structures for Machine-Learning Based Runtime Solutions.
Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019

2017
Implementing implicit OpenMP data sharing on GPUs.
Proceedings of the Fourth Workshop on the LLVM Compiler Infrastructure in HPC, 2017

Leveraging OpenMP 4.5 Support in CLANG for Fortran.
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

Efficient Fork-Join on GPUs Through Warp Specialization.
Proceedings of the 24th IEEE International Conference on High Performance Computing, 2017

2016
Performance Analysis and Optimization of Clang's OpenMP 4.5 GPU Support.
Proceedings of the 7th International Workshop on Performance Modeling, 2016

Offloading Support for OpenMP in Clang and LLVM.
Proceedings of the Third Workshop on the LLVM Compiler Infrastructure in HPC, 2016

Automatic Copying of Pointer-Based Data Structures.
Proceedings of the Languages and Compilers for Parallel Computing, 2016

2015
DeNovo: rethinking the memory hierarchy for disciplined parallelism
PhD thesis, 2015

Integrating GPU support for OpenMP offloading directives into Clang.
Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC, 2015

Performance analysis of OpenMP on a GPU using a CORAL proxy application.
Proceedings of the 6th International Workshop on Performance Modeling, 2015

Eliminating on-chip traffic waste: are we there yet?
Proceedings of the 2015 IEEE International Symposium on Performance Analysis of Systems and Software, 2015

DeNovoSync: Efficient Support for Arbitrary Synchronization without Writer-Initiated Invalidations.
Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems, 2015

2014
DeNovoND: Efficient Hardware for Disciplined Nondeterminism.
IEEE Micro, 2014

2013
DeNovoND: efficient hardware support for disciplined non-determinism.
Proceedings of the Architectural Support for Programming Languages and Operating Systems, 2013

2011
DeNovo: Rethinking the Memory Hierarchy for Disciplined Parallelism.
Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011

2010
Parallel SAH k-D tree construction.
Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on High Performance Graphics 2010, 2010

2009
A type and effect system for deterministic parallel Java.
Proceedings of the 24th Annual ACM SIGPLAN Conference on Object-Oriented Programming, 2009


  Loading...