Yu Feng

Orcid: 0000-0002-2192-5737

Affiliations:

Shanghai Jiao Tong University, Shanghai, China
University of Rochester, Department of Computer Science, Rochester, NY, USA (former)

According to our database¹, Yu Feng authored at least 50 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

ELSA: An ELastic SNN Inference Architecture for Efficient Neuromorphic Computing.

[BibT_eX]

[DOI]

CoRR, May, 2026

Mosaic: Towards Efficient Training of Multimodal Models with Spatial Resource Multiplexing.

[BibT_eX]

[DOI]

CoRR, May, 2026

AB-Sparse: Sparse Attention with Adaptive Block Size for Accurate and Efficient Long-Context Inference.

[BibT_eX]

[DOI]

CoRR, May, 2026

On the (In-)Security of the Shuffling Defense in the Transformer Secure Inference.

[BibT_eX]

[DOI]

CoRR, May, 2026

CODO: An Automated Compiler for Comprehensive Dataflow Optimization.

[BibT_eX]

[DOI]

CoRR, April, 2026

M2XFP: A Metadata-Augmented Microscaling Data Format for Efficient Low-bit Quantization.

[BibT_eX]

[DOI]

CoRR, January, 2026

ORANGE: Exploring Ockham's Razor for Neural Rendering by Accelerating 3DGS on NPUs with GEMM-Friendly Blending and Balanced Workloads.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2026

SPLATONIC: Architectural Support for 3D Gaussian Splatting SLAM via Sparse Processing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2026

FlashFuser: Expanding the Scale of Kernel Fusion for Compute-Intensive Operators via Inter-Core Connection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2026

Nebula: Infinite-Scale 3D Gaussian Splatting in VR via Collaborative Rendering and Accelerated Stereo Rasterization.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026

EARTH: An Efficient MoE Accelerator with Entropy-Aware Speculative Prefetch and Result Reuse.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026

M<sup>2</sup>XFP: A Metadata-Augmented Microscaling Data Format for Efficient Low-bit Quantization.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026

2025

Nebula: Enable City-Scale 3D Gaussian Splatting in Virtual Reality via Collaborative Rendering and Accelerated Stereo Rasterization.

[BibT_eX]

[DOI]

CoRR, December, 2025

Splatonic: Architecture Support for 3D Gaussian Splatting SLAM via Sparse Processing.

[BibT_eX]

[DOI]

CoRR, November, 2025

TIMERIPPLE: Accelerating vDiTs by Understanding the Spatio-Temporal Correlations in Latent Space.

[BibT_eX]

[DOI]

CoRR, November, 2025

Justitia: Fair and Efficient Scheduling for LLM Applications.

[BibT_eX]

[DOI]

CoRR, October, 2025

EDAS: Enabling Fast Data Loading for GPU Serverless Computing.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., September, 2025

ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive.

[BibT_eX]

[DOI]

CoRR, August, 2025

Lumina: Real-Time Mobile Neural Rendering by Exploiting Computational Redundancy.

[BibT_eX]

[DOI]

CoRR, June, 2025

Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers.

[BibT_eX]

[DOI]

CoRR, June, 2025

Voyager: Real-Time Splatting City-Scale 3D Gaussians on Your Phone.

[BibT_eX]

[DOI]

CoRR, June, 2025

SeeLe: A Unified Acceleration Framework for Real-Time Gaussian Splatting.

[BibT_eX]

[DOI]

CoRR, March, 2025

PrivateEye: In-Sensor Privacy Preservation Through Optical Feature Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Lumina: Real-Time Neural Rendering by Exploiting Computational Redundancy.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

An Efficient Private GPT Never Autoregressively Decodes.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

SLTarch: Towards Scalable Point-Based Neural Rendering by Taming Workload Imbalance and Memory Irregularity.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2025

M-ANT: Efficient Low-bit Group Quantization for LLMs via Mathematically Adaptive Numerical Type.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2025

VQ-LLM: High-performance Code Generation for Vector Quantization Augmented LLM Inference.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2025

STREAMINGGS: Voxel-Based Streaming 3D Gaussian Splatting with Memory Optimization and Architectural Support.

[BibT_eX]

[DOI]

Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

SNAPPIX: Efficient-Coding-Inspired In-Sensor Compression for Edge Vision.

[BibT_eX]

[DOI]

Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

MetaSapiens: Real-Time Neural Rendering with Efficiency-Aware Pruning and Accelerated Foveated Rendering.

[BibT_eX]

[DOI]

Weikai Lin

Yu Feng

Yuhao Zhu

Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

StreamGrid: Streaming Point Cloud Analytics via Compulsory Splitting and Deterministic Termination.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

2024

Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., December, 2024

vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving.

[BibT_eX]

[DOI]

CoRR, 2024

RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering.

[BibT_eX]

[DOI]

Weikai Lin

Yu Feng

Yuhao Zhu

CoRR, 2024

BlissCam: Boosting Eye Tracking Efficiency with Learned In-Sensor Sparse Sampling.

[BibT_eX]

[DOI]

Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations.

[BibT_eX]

[DOI]

Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

AutoVCoder: A Systematic Framework for Automated Verilog Code Generation using LLMs.

[BibT_eX]

[DOI]

Proceedings of the 42nd IEEE International Conference on Computer Design, 2024

JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

Amanda: Unified Instrumentation Framework for Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023

Fast and Accurate: Video Enhancement Using Sparse Depth.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

CAMJ: Enabling System-Level Energy Modeling and Architectural Exploration for In-Sensor Visual Computing.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

Invited Paper: Learned In-Sensor Visual Computing: From Compression to Eventification.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

2022

Real-Time Gaze Tracking with Event-Driven Eye Segmentation.

[BibT_eX]

[DOI]

Yu Feng

Nathan Goulding-Hotta

Asif Khan

Hans Reyserhove

Yuhao Zhu

Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2022

Crescent: taming memory irregularities for accelerating deep point cloud analytics.

[BibT_eX]

[DOI]

Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022

2021

A LiDAR-Guided Framework for Video Enhancement.

[BibT_eX]

[DOI]

CoRR, 2021

2020

Mesorasi: Architecture Support for Point Cloud Analytics via Delayed-Aggregation.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

Real-Time Spatio-Temporal LiDAR Point Cloud Compression.

[BibT_eX]

[DOI]

Yu Feng

Shaoshan Liu

Yuhao Zhu

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

2019

ASV: Accelerated Stereo Vision System.

[BibT_eX]

[DOI]

Yu Feng

Paul N. Whatmough

Yuhao Zhu

Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

PES: proactive event scheduling for responsive and energy-efficient mobile web computing.

[BibT_eX]

[DOI]

Yu Feng

Yuhao Zhu

Proceedings of the 46th International Symposium on Computer Architecture, 2019

Yu Feng

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...