Gurpreet Gosal

According to our database1, Gurpreet Gosal authored at least 8 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Power Lines: Scaling Laws for Weight Decay and Batch Size in LLM Pre-training.
CoRR, May, 2025

Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi.
CoRR, April, 2025

Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh.
CoRR, March, 2025

Straight to Zero: Why Linearly Decaying the Learning Rate to Zero Works Best for LLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Bilingual Adaptation of Monolingual Foundation Models.
CoRR, 2024

Med42 - Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches.
CoRR, 2024

2023
Improving Resnet-9 Generalization Trained on Small Datasets.
CoRR, 2023

Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
CoRR, 2023


  Loading...