Yuanbo Wen

Orcid: 0000-0002-7775-2724

Affiliations:
  • Chinese Academy of Sciences, Institute of Computing Technology, State Key Laboratory of Processors, Beijing, China


According to our database1, Yuanbo Wen authored at least 35 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
VariPar: Variation-Aware Workload Partitioning in Chiplet-Based DNN Accelerators.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., December, 2025

QiMeng-NeuComBack: Self-Evolving Translation from IR to Assembly Code.
CoRR, November, 2025

QiMeng-Attention: SOTA Attention Operator is generated by SOTA Attention Algorithm.
CoRR, June, 2025

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation.
CoRR, June, 2025

QiMeng: Fully Automated Hardware and Software Design for Processor Chip.
CoRR, June, 2025

QiMeng-TensorOp: Automatically Generating High-Performance Tensor Operators with Hardware Primitives.
CoRR, May, 2025

QiMeng-CPU-v2: Automated Superscalar Processor Design by Learning Data Dependencies.
CoRR, May, 2025

QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach.
CoRR, May, 2025

MigGPT: Harnessing Large Language Models for Automated Migration of Out-of-Tree Linux Kernel Patches Across Versions.
CoRR, April, 2025

Harmonia: A Unified Architecture for Efficient Deep Symbolic Regression.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., February, 2025

Efficient and Fast High-Performance Library Generation for Deep Learning Accelerators.
IEEE Trans. Computers, January, 2025

AI Computing Systems for Large Language Models Training.
J. Comput. Sci. Technol., January, 2025

Cambricon-SR: An Accelerator for Neural Scene Representation with Sparse Encoding Table.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

QiMeng-TensorOp: One-Line Prompt is Enough for High-Performance Tensor Operator Generation with Hardware Primitives.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Automated Superscalar Processor Design by Learning Data Dependencies.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Mosaic: Exploiting Instruction-Level Parallelism on Deep Learning Accelerators with <i>iTex</i> Tessellation.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

QiMeng-Attention: SOTA Attention Operator is generated by SOTA Attention Algorithm.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

QiMeng-GEMM: Automatically Generating High-Performance Matrix Multiplication Code by Exploiting Large Language Models.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
AGON: Automated Design Framework for Customizing Processors from ISA Documents.
CoRR, 2024

Assessing and Understanding Creativity in Large Language Models.
CoRR, 2024

Cambricon-C: Efficient 4-Bit Matrix Unit via Primitivization.
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024

Cambricon-D: Full-Network Differential Acceleration for Diffusion Models.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

Prompt-based Visual Alignment for Zero-shot Policy Transfer.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

AutoOS: Make Your OS More Powerful by Exploiting Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Revisiting Automatic Pipelining: Gate-level Forwarding and Speculation.
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

TensorTEE: Unifying Heterogeneous TEE Granularity for Efficient Secure Collaborative Tensor Computing.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
ANPL: Compiling Natural Programs with Interactive Decomposition.
CoRR, 2023

ANPL: Towards Natural Programming with Interactive Decomposition.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Emergent Communication for Rules Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Cambricon-R: A Fully Fused Accelerator for Real-Time Learning of Neural Scene Representation.
Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023

BALTO: fast tensor program optimization with diversity-based active learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Heron: Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
Enabling One-Size-Fits-All Compilation Optimization for Inference Across Machine Learning Computers.
IEEE Trans. Computers, 2022

BabelTower: Learning to Auto-parallelized Program Translation.
Proceedings of the International Conference on Machine Learning, 2022

2020
Addressing Irregularity in Sparse Neural Networks Through a Cooperative Software/Hardware Approach.
IEEE Trans. Computers, 2020


  Loading...