Stas Bekman

Orcid: 0009-0002-1212-0379

According to our database1, Stas Bekman authored at least 9 papers between 2003 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences.
CoRR, June, 2025

Universal Checkpointing: A Flexible and Efficient Distributed Checkpointing System for Large-Scale DNN Training with Reconfigurable Parallelism.
Proceedings of the 2025 USENIX Annual Technical Conference, 2025

2024
Universal Checkpointing: Efficient and Flexible Checkpointing for Large Scale Distributed Training.
CoRR, 2024

The Case for Co-Designing Model Architectures with Hardware.
Proceedings of the 53rd International Conference on Parallel Processing, 2024

2023
OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents.
CoRR, 2023

OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
What Language Model to Train if You Have One Million GPU Hours?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021

2003
Practical mod_perl - programming, administration, performance tips.
O'Reilly, ISBN: 978-0-596-00227-5, 2003


  Loading...