Yu Feng

Orcid: 0000-0002-2192-5737

Affiliations:
  • Shanghai Jiao Tong University, Shanghai, China
  • University of Rochester, Department of Computer Science, Rochester, NY, USA (former)


According to our database1, Yu Feng authored at least 50 papers between 2019 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
ELSA: An ELastic SNN Inference Architecture for Efficient Neuromorphic Computing.
CoRR, May, 2026

Mosaic: Towards Efficient Training of Multimodal Models with Spatial Resource Multiplexing.
CoRR, May, 2026

AB-Sparse: Sparse Attention with Adaptive Block Size for Accurate and Efficient Long-Context Inference.
CoRR, May, 2026

On the (In-)Security of the Shuffling Defense in the Transformer Secure Inference.
CoRR, May, 2026

CODO: An Automated Compiler for Comprehensive Dataflow Optimization.
CoRR, April, 2026

M2XFP: A Metadata-Augmented Microscaling Data Format for Efficient Low-bit Quantization.
CoRR, January, 2026

ORANGE: Exploring Ockham's Razor for Neural Rendering by Accelerating 3DGS on NPUs with GEMM-Friendly Blending and Balanced Workloads.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2026

SPLATONIC: Architectural Support for 3D Gaussian Splatting SLAM via Sparse Processing.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2026

FlashFuser: Expanding the Scale of Kernel Fusion for Compute-Intensive Operators via Inter-Core Connection.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2026

Nebula: Infinite-Scale 3D Gaussian Splatting in VR via Collaborative Rendering and Accelerated Stereo Rasterization.
Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026

EARTH: An Efficient MoE Accelerator with Entropy-Aware Speculative Prefetch and Result Reuse.
Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026

M<sup>2</sup>XFP: A Metadata-Augmented Microscaling Data Format for Efficient Low-bit Quantization.
Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026

2025
Nebula: Enable City-Scale 3D Gaussian Splatting in Virtual Reality via Collaborative Rendering and Accelerated Stereo Rasterization.
CoRR, December, 2025

Splatonic: Architecture Support for 3D Gaussian Splatting SLAM via Sparse Processing.
CoRR, November, 2025

TIMERIPPLE: Accelerating vDiTs by Understanding the Spatio-Temporal Correlations in Latent Space.
CoRR, November, 2025

Justitia: Fair and Efficient Scheduling for LLM Applications.
CoRR, October, 2025

EDAS: Enabling Fast Data Loading for GPU Serverless Computing.
ACM Trans. Archit. Code Optim., September, 2025

ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive.
CoRR, August, 2025

Lumina: Real-Time Mobile Neural Rendering by Exploiting Computational Redundancy.
CoRR, June, 2025

Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers.
CoRR, June, 2025

Voyager: Real-Time Splatting City-Scale 3D Gaussians on Your Phone.
CoRR, June, 2025

SeeLe: A Unified Acceleration Framework for Real-Time Gaussian Splatting.
CoRR, March, 2025

PrivateEye: In-Sensor Privacy Preservation Through Optical Feature Separation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Lumina: Real-Time Neural Rendering by Exploiting Computational Redundancy.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

An Efficient Private GPT Never Autoregressively Decodes.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

SLTarch: Towards Scalable Point-Based Neural Rendering by Taming Workload Imbalance and Memory Irregularity.
Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2025

M-ANT: Efficient Low-bit Group Quantization for LLMs via Mathematically Adaptive Numerical Type.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2025

VQ-LLM: High-performance Code Generation for Vector Quantization Augmented LLM Inference.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2025

STREAMINGGS: Voxel-Based Streaming 3D Gaussian Splatting with Memory Optimization and Architectural Support.
Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

SNAPPIX: Efficient-Coding-Inspired In-Sensor Compression for Edge Vision.
Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

MetaSapiens: Real-Time Neural Rendering with Efficiency-Aware Pruning and Accelerated Foveated Rendering.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

StreamGrid: Streaming Point Cloud Analytics via Compulsory Splitting and Deterministic Termination.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

2024
Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture.
ACM Trans. Archit. Code Optim., December, 2024

vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving.
CoRR, 2024

RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering.
CoRR, 2024

BlissCam: Boosting Eye Tracking Efficiency with Learned In-Sensor Sparse Sampling.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

AutoVCoder: A Systematic Framework for Automated Verilog Code Generation using LLMs.
Proceedings of the 42nd IEEE International Conference on Computer Design, 2024

JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

Amanda: Unified Instrumentation Framework for Deep Neural Networks.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
Fast and Accurate: Video Enhancement Using Sparse Depth.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

CAMJ: Enabling System-Level Energy Modeling and Architectural Exploration for In-Sensor Visual Computing.
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

Invited Paper: Learned In-Sensor Visual Computing: From Compression to Eventification.
Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

2022
Real-Time Gaze Tracking with Event-Driven Eye Segmentation.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2022

Crescent: taming memory irregularities for accelerating deep point cloud analytics.
Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022

2021
A LiDAR-Guided Framework for Video Enhancement.
CoRR, 2021

2020
Mesorasi: Architecture Support for Point Cloud Analytics via Delayed-Aggregation.
Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

Real-Time Spatio-Temporal LiDAR Point Cloud Compression.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

2019
ASV: Accelerated Stereo Vision System.
Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

PES: proactive event scheduling for responsive and energy-efficient mobile web computing.
Proceedings of the 46th International Symposium on Computer Architecture, 2019


  Loading...