Zihao Zeng

Orcid: 0009-0005-9467-5520

According to our database1, Zihao Zeng authored at least 18 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions.
CoRR, May, 2025

Done Is Better than Perfect: Unlocking Efficient Reasoning by Structured Multi-Turn Decomposition.
CoRR, May, 2025

RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability.
CoRR, April, 2025

AutoHete: An Automatic and Efficient Heterogeneous Training System for LLMs.
CoRR, March, 2025

SIFT: Grounding LLM Reasoning in Contexts via Stickers.
CoRR, February, 2025

Probability analysis of shallow landslides in varying vegetation zones with random soil grain-size distribution.
Environ. Model. Softw., 2025

Generating invisible adversarial watermarks based on block-matching embedding algorithm.
Digit. Signal Process., 2025

MatryoshkaKV: Adaptive KV Compression via Trainable Orthogonal Projection.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference.
CoRR, 2024

In-context KV-Cache Eviction for LLMs via Attention-Gate.
CoRR, 2024

AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models.
CoRR, 2024

AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

An End-to-End Multi-modal-based Framework for Visual Identity Inspection System.
Proceedings of the 2024 8th International Conference on Big Data and Internet of Things, 2024

2022
AccTFM: An Effective Intra-Layer Model Parallelization Strategy for Training Large-Scale Transformer-Based Models.
IEEE Trans. Parallel Distributed Syst., 2022

2021
Balancing Performance and Effort in Deep Learning via the Fusion of Real and Synthetic Cultural Heritage Photogrammetry Training Sets.
Proceedings of the 13th International Conference on Agents and Artificial Intelligence, 2021

Training Acceleration for Deep Neural Networks: A Hybrid Parallelization Strategy.
Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

2020
CoExe: An Efficient Co-execution Architecture for Real-Time Neural Network Services.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

2019
K-Means Parallel Acceleration for Sparse Data Dimensions on Flink.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019


  Loading...