Yuhao Zhu

Orcid: 0000-0002-2802-0578

Affiliations:
  • University of Rochester, Rochester, NY, USA


According to our database1, Yuhao Zhu authored at least 86 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Exploiting Human Color Discrimination for Memory- and Energy-Efficient Image Encoding in Virtual Reality.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

Amanda: Unified Instrumentation Framework for Deep Neural Networks.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
An Energy Efficient and Runtime Reconfigurable Accelerator for Robotic Localization.
IEEE Trans. Computers, July, 2023

JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping.
CoRR, 2023

Autonomy 2.0: The Quest for Economies of Scale.
CoRR, 2023

Fast and Accurate: Video Enhancement Using Sparse Depth.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

FastPoints: A state-of-the-art point cloud renderer for Unity.
Proceedings of the Visualization and Data Analysis 2023, 2023

Teaching color science to EECS students using interactive tutorials: Tools and lessons.
Proceedings of the Visualization and Data Analysis 2023, 2023

Imperceptible Color Modulation for Power Saving in VR/AR.
Proceedings of the ACM SIGGRAPH 2023 Emerging Technologies, 2023

ImaGen: A General Framework for Generating Memory- and Power-Efficient Image Processing Accelerators.
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

CAMJ: Enabling System-Level Energy Modeling and Architectural Exploration for In-Sensor Visual Computing.
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization.
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

Invited Paper: Learned In-Sensor Visual Computing: From Compression to Eventification.
Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

BLITZCRANK: Factor Graph Accelerator for Motion Planning.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

2022
Communication Challenges in Infrastructure-Vehicle Cooperative Autonomous Driving: A Field Deployment Perspective.
IEEE Wirel. Commun., 2022

Color-Perception-Guided Display Power Reduction for Virtual Reality.
ACM Trans. Graph., 2022

Thales: Formulating and Estimating Architectural Vulnerability Factors for DNN Accelerators.
CoRR, 2022

Factor Graph Accelerator for LiDAR-Inertial Odometry.
CoRR, 2022

Real-Time Gaze Tracking with Event-Driven Eye Segmentation.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2022

Digital reconstruction of Elmina Castle for mobile virtual reality via point-based detail transfer.
Proceedings of the Visualization and Data Analysis 2022, online, January 15-26, 2022, 2022

RTNN: accelerating neighbor search using hardware ray tracing.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization.
Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022

Braum: Analyzing and Protecting Autonomous Machine Software Stack.
Proceedings of the IEEE 33rd International Symposium on Software Reliability Engineering, 2022

Crescent: taming memory irregularities for accelerating deep point cloud analytics.
Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022

SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Factor Graph Accelerator for LiDAR-Inertial Odometry (Invited Paper).
Proceedings of the 41st IEEE/ACM International Conference on Computer-Aided Design, 2022

S2TA: Exploiting Structured Sparsity for Energy-Efficient Mobile CNN Acceleration.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

Block-Skim: Efficient Question Answering for Transformer.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
The Promise of Dataflow Architectures in the Design of Processing Systems for Autonomous Machines.
CoRR, 2021

ZIPPER: Exploiting Tile- and Operator-level Parallelism for General and Scalable Graph Neural Network Acceleration.
CoRR, 2021

The Matter of Time - A General and Efficient System for Precise Sensor Synchronization in Robotic Computing.
CoRR, 2021

A LiDAR-Guided Framework for Video Enhancement.
CoRR, 2021

Resurrect3D: An Open and Customizable Platform for Visualizing and Analyzing Cultural Heritage Artifacts.
Proceedings of the Web3D '21: The 26th International Conference on 3D Web Technology, Pisa, Italy, November 8, 2021

Brief Industry Paper: The Matter of Time - A General and Efficient System for Precise Sensor Synchronization in Robotic Computing.
Proceedings of the 27th IEEE Real-Time and Embedded Technology and Applications Symposium, 2021

Archytas: A Framework for Synthesizing and Dynamically Optimizing Accelerators for Robotic Localization.
Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021

Characterizing and Demystifying the Implicit Convolution Algorithm on Commercial Matrix-Multiplication Accelerators.
Proceedings of the IEEE International Symposium on Workload Characterization, 2021

3D SceneFlowNet: Self-Supervised 3D Scene Flow Estimation Based on Graph CNN.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Eudoxus: Characterizing and Accelerating Localization in Autonomous Machines Industry Track Paper.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021

2020
Energy-Efficient Video Processing for Virtual Reality.
IEEE Micro, 2020

Eudoxus: Characterizing and Accelerating Localization in Autonomous Machines.
CoRR, 2020

End-to-End Framework for Efficient Deep Learning Using Metasurfaces Optics.
CoRR, 2020

A Survey of FPGA-Based Robotic Computing.
CoRR, 2020

Accelerating sparse DNN models without hardware-support via tile-wise sparsity.
Proceedings of the International Conference for High Performance Computing, 2020

Building the Computing System for Autonomous Micromobility Vehicles: Design Constraints and Architectural Optimizations.
Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

Ptolemy: Architecture Support for Robust Deep Learning.
Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

Mesorasi: Architecture Support for Point Cloud Analytics via Delayed-Aggregation.
Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

A Systematic Methodology for Characterizing Scalability of DNN Accelerators using SCALE-Sim.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2020

Real-Time Spatio-Temporal LiDAR Point Cloud Compression.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Energy-Efficient 360-Degree Video Rendering on FPGA via Algorithm-Architecture Co-Design.
Proceedings of the FPGA '20: The 2020 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2020

Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

Automatic Neural Network Compression by Sparsity-Quantization Joint Learning: A Constrained Optimization-Based Approach.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Low-Latency Proactive Continuous Vision.
Proceedings of the PACT '20: International Conference on Parallel Architectures and Compilation Techniques, 2020

2019
Learning Sparsity and Quantization Jointly and Automatically for Neural Network Compression via Constrained Optimization.
CoRR, 2019

SVSoC: Speculative Vision Systems-on-a-Chip.
IEEE Comput. Archit. Lett., 2019

Tail latency in node.js: energy efficient turbo boosting for long latency requests in event-driven web services.
Proceedings of the 15th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2019

Tigris: Architecture and Algorithms for 3D Perception in Point Clouds.
Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

ASV: Accelerated Stereo Vision System.
Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

Demystifying Bayesian Inference Workloads.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2019

PES: proactive event scheduling for responsive and energy-efficient mobile web computing.
Proceedings of the 46th International Symposium on Computer Architecture, 2019

Energy-Constrained Compression for Deep Neural Networks via Weighted Sparse Projection and Layer Input Masking.
Proceedings of the 7th International Conference on Learning Representations, 2019

ECC: Platform-Independent Energy-Constrained Deep Neural Network Compression via a Bilinear Regression Model.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Adversarial Defense Through Network Profiling Based Path Extraction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
ECC: Energy-Constrained Deep Neural Network Compression via a Bilinear Regression Model.
CoRR, 2018

SCALE-Sim: Systolic CNN Accelerator.
CoRR, 2018

End-to-End Learning of Energy-Constrained Deep Neural Networks.
CoRR, 2018

Cloud No Longer a Silver Bullet, Edge to the Rescue.
CoRR, 2018

Mobile Machine Learning Hardware at ARM: A Systems-on-Chip (SoC) Perspective.
CoRR, 2018

Euphrates: Algorithm-SoC Co-Design for Low-Power Mobile Continuous Vision.
Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

BitFlow: Exploiting Vector Parallelism for Binary Neural Networks on CPU.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Semantic-Aware Virtual Reality Video Streaming.
Proceedings of the 9th Asia-Pacific Workshop on Systems, 2018

2017
Optimizing General-Purpose CPUs for Energy-Efficient Mobile Web Computing.
ACM Trans. Comput. Syst., 2017

Cognitive Computing Safety: The New Horizon for Reliability / The Design and Evolution of Deep Learning Workloads.
IEEE Micro, 2017

Research for practice: web security and mobile web computing.
Commun. ACM, 2017

2016
GreenWeb: language extensions for energy-efficient mobile web computing.
Proceedings of the 37th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2016

Mobile CPU's rise to power: Quantifying the impact of generational mobile CPU design trends on performance, energy, and user satisfaction.
Proceedings of the 2016 IEEE International Symposium on High Performance Computer Architecture, 2016

2015
The Role of the CPU in Energy-Efficient Mobile Web Browsing.
IEEE Micro, 2015

Microarchitectural implications of event-driven server-side web applications.
Proceedings of the 48th International Symposium on Microarchitecture, 2015

Mosaic: cross-platform user-interaction record and replay for the fragmented android ecosystem.
Proceedings of the 2015 IEEE International Symposium on Performance Analysis of Systems and Software, 2015

Event-based scheduling for energy-efficient QoS (eQoS) in mobile Web applications.
Proceedings of the 21st IEEE International Symposium on High Performance Computer Architecture, 2015

2014
Exploiting Webpage Characteristics for Energy-Efficient Mobile Web Browsing.
IEEE Comput. Archit. Lett., 2014

WebCore: Architectural support for mobile Web browsing.
Proceedings of the ACM/IEEE 41st International Symposium on Computer Architecture, 2014

2013
High-performance and energy-efficient mobile web browsing on big/little systems.
Proceedings of the 19th IEEE International Symposium on High Performance Computer Architecture, 2013

2011
Massively Parallel Logic Simulation with GPUs.
ACM Trans. Design Autom. Electr. Syst., 2011

Hermes: an integrated CPU/GPU microarchitecture for IP routing.
Proceedings of the 48th Design Automation Conference, 2011

2010
Distributed time, conservative parallel logic simulation on GPUs.
Proceedings of the 47th Design Automation Conference, 2010


  Loading...