Stas Bekman

Orcid: 0009-0002-1212-0379

According to our database1, Stas Bekman authored at least 10 papers between 2003 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences.
CoRR, June, 2025

Universal Checkpointing: A Flexible and Efficient Distributed Checkpointing System for Large-Scale DNN Training with Reconfigurable Parallelism.
Proceedings of the 2025 USENIX Annual Technical Conference, 2025

2024
Universal Checkpointing: Efficient and Flexible Checkpointing for Large Scale Distributed Training.
CoRR, 2024

The Case for Co-Designing Model Architectures with Hardware.
Proceedings of the 53rd International Conference on Parallel Processing, 2024

2023
OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents.
CoRR, 2023

OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model.
CoRR, 2022

What Language Model to Train if You Have One Million GPU Hours?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021

2003
Practical mod_perl - programming, administration, performance tips.
O'Reilly, ISBN: 978-0-596-00227-5, 2003


  Loading...