Siran Yang

Orcid: 0009-0001-0272-3718

According to our database1, Siran Yang authored at least 28 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Complementary Reinforcement Learning.
CoRR, March, 2026

Attack of the Bubbles: Straggler-Resilient Pipeline Parallelism for Large Model Training.
Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

RollPacker: Taming Long-Tail Rollouts for RL Post-Training with Tail Batching.
Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

2025
AMAP Agentic Planning Technical Report.
CoRR, December, 2025

RollArt: Scaling Agentic RL Training via Disaggregated Infrastructure.
CoRR, December, 2025

RollMux: Phase-Level Multiplexing for Disaggregated RL Post-Training.
CoRR, December, 2025

Reconstructing KV Caches with Cross-layer Fusion For Enhanced Transformers.
CoRR, December, 2025

Part II: ROLL Flash - Accelerating RLVR and Agentic Training with Asynchrony.
CoRR, October, 2025

RollPacker: Mitigating Long-Tail Rollouts for Fast, Synchronous RL Post-Training.
CoRR, September, 2025

RecIS: Sparse to Dense, A Unified Training Framework for Recommendation Models.
CoRR, September, 2025

Ovis2.5 Technical Report.
CoRR, August, 2025

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning.
CoRR, August, 2025

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library.
CoRR, June, 2025

Adaptra: Straggler-Resilient Hybrid-Parallel Training with Pipeline Adaptation.
CoRR, April, 2025

CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games.
CoRR, March, 2025

Ebdfes: Post-earthquake building damage and fatality estimation system.
Softw. Impacts, 2025

MESCNN: Magnitude estimation system based on convolutional neural networks.
Softw. Impacts, 2025

GREYHOUND: Hunting Fail-Slows in Hybrid-Parallel Training at Scale.
Proceedings of the 2025 USENIX Annual Technical Conference, 2025

CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Enhancing Large-Scale AI Training Efficiency: The C4 Solution for Real-Time Anomaly Detection and Communication Optimization.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2025

2024
FALCON: Pinpointing and Mitigating Stragglers for Large-Scale Hybrid-Parallel Training.
CoRR, 2024

Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach.
CoRR, 2024

FaPES: Enabling Efficient Elastic Scaling for Serverless Machine Learning Platforms.
Proceedings of the 2024 ACM Symposium on Cloud Computing, 2024

2023
Joint Optimization of Ranking and Calibration with Contextualized Hybrid Model.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Rec4Ad: A Free Lunch to Mitigate Sample Selection Bias for Ads CTR Prediction in Taobao.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
PICASSO: Unleashing the Potential of GPU-centric Training for Wide-and-deep Recommender Systems.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

2021
One Model to Serve All: Star Topology Adaptive Recommender for Multi-Domain CTR Prediction.
CoRR, 2021

One Model to Serve All: Star Topology Adaptive Recommender for Multi-Domain CTR Prediction.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021


  Loading...