Mansheej Paul

According to our database1, Mansheej Paul authored at least 12 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
μnit Scaling: Simple and Scalable FP8 LLM Training.
CoRR, February, 2025

Soup to go: mitigating forgetting during continual learning with model averaging.
CoRR, January, 2025

Scaling Laws for Precision.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
LoRA Learns Less and Forgets Less.
Trans. Mach. Learn. Res., 2024

Critique-out-Loud Reward Models.
CoRR, 2024

Does your data spark joy? Performance gains from domain upsampling at the end of training.
CoRR, 2024

2023
Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Unmasking the Lottery Ticket Hypothesis: What's Encoded in a Winning Ticket's Mask?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Lottery Tickets on a Data Diet: Finding Initializations with Sparse Trainable Networks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Deep Learning on a Data Diet: Finding Important Examples Early in Training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Deep learning versus kernel learning: an empirical study of loss landscape geometry and the time evolution of the Neural Tangent Kernel.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020


  Loading...