Shangyan Zhou

Orcid: 0009-0002-9696-1801

According to our database1, Shangyan Zhou authored at least 7 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference.
CoRR, February, 2026

2025
mHC: Manifold-Constrained Hyper-Connections.
CoRR, December, 2025

DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Nat., 2025

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

2024
Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning.
CoRR, 2024

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism.
CoRR, 2024



  Loading...