Siyu Chen

Affiliations:
  • Yale University, Department of Statistics and Data Science, New Haven, CT, USA


According to our database1, Siyu Chen authored at least 14 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders.
CoRR, June, 2025

Can Neural Networks Achieve Optimal Computational-statistical Tradeoff? An Analysis on Single-Index Model.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Unveiling the Statistical Foundations of Chain-of-Thought Prompting Methods.
CoRR, 2024

Contractual Reinforcement Learning: Pulling Arms with Invisible Hands.
CoRR, 2024

Implicit Regularization of Gradient Flow on One-Layer Softmax Attention.
CoRR, 2024

Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality.
CoRR, 2024

Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality (extended abstract).
Proceedings of the Thirty Seventh Annual Conference on Learning Theory, June 30, 2024

2023
Actions Speak What You Want: Provably Sample-Efficient Reinforcement Learning of the Quantal Stackelberg Equilibrium from Strategic Feedbacks.
CoRR, 2023

A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing Observations.
CoRR, 2023

Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model.
Proceedings of the International Conference on Machine Learning, 2023

2022
Adaptive Model Design for Markov Decision Process.
Proceedings of the International Conference on Machine Learning, 2022

2021
Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


  Loading...