Yuchen Hao

Orcid: 0009-0005-8513-9566

According to our database1, Yuchen Hao authored at least 21 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Wukong: Towards a Scaling Law for Large-Scale Recommendation.
CoRR, 2024

Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation.
CoRR, 2024

2023
PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel.
Proc. VLDB Endow., 2023

Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale.
CoRR, 2023

PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel.
CoRR, 2023


Software-Hardware Co-design of Heterogeneous SmartNIC System for Recommendation Models Inference and Training.
Proceedings of the 37th International Conference on Supercomputing, 2023

2022
DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction.
CoRR, 2022

New enhanced clustering algorithms for patient referrals in medical consortia.
Comput. Ind. Eng., 2022


2021
High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models.
CoRR, 2021

2020
Identifying Experts in Community Question Answering Website Based on Graph Convolutional Neural Network.
IEEE Access, 2020

Reconfigurable Accelerator Compute Hierarchy: A Case Study using Content-Based Image Retrieval.
Proceedings of the IEEE International Symposium on Workload Characterization, 2020

2019
In-Depth Analysis on Microarchitectures of Modern Heterogeneous CPU-FPGA Platforms.
ACM Trans. Reconfigurable Technol. Syst., 2019

2018
Architectural Techniques to Enhance the Efficiency of Accelerator-Centric Architectures.
PhD thesis, 2018

Best-Effort FPGA Programming: A Few Steps Can Go a Long Way.
CoRR, 2018

2017
Supporting Address Translation for Accelerator-Centric Architectures.
Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture, 2017

2016
A quantitative analysis on microarchitectures of modern CPU-FPGA platforms.
Proceedings of the 53rd Annual Design Automation Conference, 2016

2015
On-chip interconnection network for accelerator-rich architectures.
Proceedings of the 52nd Annual Design Automation Conference, 2015

2014
Hardware Acceleration for an Accurate Stereo Vision System Using Mini-Census Adaptive Support Region.
ACM Trans. Embed. Comput. Syst., 2014

2012
FPGA based memory efficient high resolution stereo vision system for video tolling.
Proceedings of the 2012 International Conference on Field-Programmable Technology, 2012


  Loading...