Simin Fan

Orcid: 0000-0002-1490-9413

According to our database1, Simin Fan authored at least 14 papers between 2022 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
The Magic Correlations: Understanding Knowledge Transfer from Pretraining to Supervised Fine-Tuning.
CoRR, February, 2026

2025
Semantic uncertainty in advanced decoding methods for LLM generation.
CoRR, June, 2025

GRAPE: Optimize Data Mixture for Group Robust Multi-target Adaptive Pretraining.
CoRR, May, 2025

NeuralGrok: Accelerate Grokking by Neural Gradient Transformation.
CoRR, April, 2025

Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation.
CoRR, 2024

Dynamic Gradient Alignment for Online Data Mixing.
CoRR, 2024

Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

Deep Grokking: Would Deep Neural Networks Generalize Better?
CoRR, 2024

DOGE: Domain Reweighting with Generalization Estimation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
MEDITRON-70B: Scaling Medical Pretraining for Large Language Models.
CoRR, 2023

Irreducible Curriculum for Language Model Pretraining.
CoRR, 2023

ReadingQuizMaker: A Human-NLP Collaborative System that Supports Instructors to Design High-Quality Reading Quiz Questions.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022
Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022


  Loading...