Zhoutong Wu

Orcid: 0009-0005-6137-5492

According to our database1, Zhoutong Wu authored at least 4 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
Improving Model Representation and Reducing KV Cache via Skip Connections with First Value Heads.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

2024
Designing Universally-Approximating Deep Neural Networks: A First-Order Optimization Approach.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

Separation and Bias of Deep Equilibrium Models on Expressivity and Learning Dynamics.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

2021
Analysis of Legal Documents via Non-negative Matrix Factorization Methods.
CoRR, 2021


  Loading...