Marco Chen

According to our database1, Marco Chen authored at least 6 papers between 2025 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
SageBwd: A Trainable Low-bit Attention.
CoRR, March, 2026

Delving into Muon and Beyond: Deep Analysis and Extensions.
CoRR, February, 2026

SimpleGPT: Improving GPT via A Simple Normalization Strategy.
CoRR, February, 2026

2025
DNT: a Deeply Normalized Transformer that can be trained by Momentum SGD.
CoRR, July, 2025

Not All Attention Heads Are What You Need: Refining CLIP's Image Representation with Attention Ablation.
CoRR, July, 2025

Señorita-2M: A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists.
CoRR, February, 2025


  Loading...