Fei Sun

Affiliations:
  • Meta Systems Inc


According to our database1, Fei Sun authored at least 35 papers between 2002 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Composer: A Search Framework for Hybrid Neural Architecture Design.
CoRR, October, 2025

Q-TempFusion: Quantization-Aware Temporal Multi-Sensor Fusion on Bird's-Eye View Representation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

2024
Is Flash Attention Stable?
CoRR, 2024

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2024

Pruning Foundation Models for High Accuracy without Retraining.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2022
MILAN: Masked Image Pretraining on Language Assisted Representation.
CoRR, 2022

184QPS/W 64Mb/mm<sup>2</sup>3D Logic-to-DRAM Hybrid Bonding with Process-Near-Memory Engine for Recommendation System.
Proceedings of the IEEE International Solid-State Circuits Conference, 2022

Compact Multi-level Sparse Neural Networks with Input Independent Dynamic Rerouting.
Proceedings of the 34th IEEE International Conference on Tools with Artificial Intelligence, 2022

Effective Model Sparsification by Scheduled Grow-and-Prune Methods.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Shfl-BW: accelerating deep neural network inference with tensor-core aware weight pruning.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

CHEX: CHannel EXploration for CNN Model Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

A one-for-all and <i>o</i>(<i>v</i> log(<i>v</i> ))-cost solution for parallel merge style operations on sorted key-value arrays.
Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

2021
Load-balanced Gather-scatter Patterns for Sparse Deep Neural Networks.
CoRR, 2021

2020
Computation on Sparse Neural Networks: an Inspiration for Future Hardware.
CoRR, 2020

A Unified DNN Weight Compression Framework Using Reweighted Optimization Methods.
CoRR, 2020


INVITED: Computation on Sparse Neural Networks and its Implications for Future Hardware.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

Learning in the Frequency Domain.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Low-Power Computer Vision: Status, Challenges, and Opportunities.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2019

Low-Power Computer Vision: Status, Challenges, Opportunities.
CoRR, 2019

Machine Learning at Facebook: Understanding Inference at the Edge.
Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

ChamNet: Towards Efficient Network Design Through Platform-Aware Model Adaptation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Special Session: 2018 Low-Power Image Recognition Challenge and Beyond.
Proceedings of the IEEE International Conference on Artificial Intelligence Circuits and Systems, 2019

2018
2018 Low-Power Image Recognition Challenge.
CoRR, 2018

2012
Automatic generation of functional models for embedded processor extensions.
Proceedings of the 2012 Design, Automation & Test in Europe Conference & Exhibition, 2012

2007
A Synthesis Methodology for Hybrid Custom Instruction and Coprocessor Generation for Extensible Processors.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2007

2006
A Scalable Synthesis Methodology for Application-Specific Processors.
IEEE Trans. Very Large Scale Integr. Syst., 2006

Application-specific heterogeneous multiprocessor synthesis using extensible processors.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2006

Hybrid Custom Instruction and Co-Processor Synthesis Methodology for Extensible Processors.
Proceedings of the 19th International Conference on VLSI Design (VLSI Design 2006), 2006

2005
Synthesis of Application-Specific Heterogeneous Multiprocessor Architectures Using Extensible Processors.
Proceedings of the 18th International Conference on VLSI Design (VLSI Design 2005), 2005

2004
Custom-instruction synthesis for extensible-processor platforms.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2004

2003
A Scalable Application-Specific Processor Synthesis Methodology.
Proceedings of the 2003 International Conference on Computer-Aided Design, 2003

2002
Synthesis of custom processors based on extensible platforms.
Proceedings of the 2002 IEEE/ACM International Conference on Computer-aided Design, 2002


  Loading...