Yujun Lin

Orcid: 0000-0001-6314-1722

Affiliations:

Massachusetts Institute of Technology, Cambridge, USA
Tsinghua University, Department of Electronic Engineering, Beijing, China (former)

According to our database¹, Yujun Lin authored at least 39 papers between 2016 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

QeRL: Beyond Efficiency - Quantization-enhanced Reinforcement Learning for LLMs.

[BibT_eX]

[DOI]

CoRR, October, 2025

DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder.

[BibT_eX]

[DOI]

CoRR, September, 2025

DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space.

[BibT_eX]

[DOI]

CoRR, September, 2025

Radial Attention: O(n log n) Sparse Attention with Energy Decay for Long Video Generation.

[BibT_eX]

[DOI]

CoRR, June, 2025

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation.

[BibT_eX]

[DOI]

CoRR, May, 2025

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention.

[BibT_eX]

[DOI]

CoRR, February, 2025

Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity.

[BibT_eX]

[DOI]

CoRR, February, 2025

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Sparse Video-Gen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

SANA: Efficient High-Resolution Text-to-Image Synthesis with Linear Diffusion Transformers.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SVDQuant: Absorbing Outliers by Low-Rank Component for 4-Bit Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LEGO: Spatial Accelerator Generation and Optimization for Tensor Applications.

[BibT_eX]

[DOI]

Yujun Lin

Zhekai Zhang

Song Han

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2025

2024

Algorithm-System-Hardware Co-Design for Efficient 3D Deep Learning.

[BibT_eX]

[DOI]

World Sci. Annu. Rev. Artif. Intell., 2024

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2024

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers.

[BibT_eX]

[DOI]

CoRR, 2024

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving.

[BibT_eX]

[DOI]

CoRR, 2024

2022

Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2022

TorchSparse: Efficient Point Cloud Inference Engine.

[BibT_eX]

[DOI]

Proceedings of the Fifth Conference on Machine Learning and Systems, 2022

QuantumNAS: Noise-Adaptive Search for Robust Quantum Circuits.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

2021

Delayed Gradient Averaging: Tolerate the Communication Latency for Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

PointAcc: Efficient Point Cloud Accelerator.

[BibT_eX]

[DOI]

Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021

NAAS: Neural Accelerator Architecture Search.

[BibT_eX]

[DOI]

Yujun Lin

Mengtian Yang

Song Han

Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

2020

Long Live TIME: Improving Lifetime and Security for NVM-Based Training-in-Memory Systems.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

AutoML for Architecting Efficient and Specialized Neural Networks.

[BibT_eX]

[DOI]

IEEE Micro, 2020

Hardware-Centric AutoML for Mixed-Precision Quantization.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2020

MCUNet: Tiny Deep Learning on IoT Devices.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Lite Transformer with Long-Short Range Attention.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

APQ: Joint Search for Network Architecture, Pruning and Quantization Policy.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Design Automation for Efficient Deep Learning Computing.

[BibT_eX]

[DOI]

CoRR, 2019

Point-Voxel CNN for Efficient 3D Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

A Fine-Grained Sparse Accelerator for Multi-Precision DNN.

[BibT_eX]

[DOI]

Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019

A Configurable Multi-Precision CNN Computing Framework Based on Single Bit RRAM.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Design Automation Conference 2019, 2019

HAQ: Hardware-Aware Automated Quantization With Mixed Precision.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

HAQ: Hardware-Aware Automated Quantization.

[BibT_eX]

[DOI]

CoRR, 2018

Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Long live TIME: improving lifetime for training-in-memory engines by structured gradient sparsification.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Design Automation Conference, 2018

2017

On the Understanding of Interdependency of Mobile App Usage.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Conference on Mobile Ad Hoc and Sensor Systems, 2017

2016

Big Data Driven Mobile Traffic Understanding and Forecasting: A Time Series Approach.

[BibT_eX]

[DOI]

IEEE Trans. Serv. Comput., 2016

Yujun Lin

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...