Simin Fan

Orcid: 0000-0002-1490-9413

According to our database1, Simin Fan authored at least 13 papers between 2022 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Semantic uncertainty in advanced decoding methods for LLM generation.
CoRR, June, 2025

GRAPE: Optimize Data Mixture for Group Robust Multi-target Adaptive Pretraining.
CoRR, May, 2025

NeuralGrok: Accelerate Grokking by Neural Gradient Transformation.
CoRR, April, 2025

Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation.
CoRR, 2024

Dynamic Gradient Alignment for Online Data Mixing.
CoRR, 2024

Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

Deep Grokking: Would Deep Neural Networks Generalize Better?
CoRR, 2024

DOGE: Domain Reweighting with Generalization Estimation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
MEDITRON-70B: Scaling Medical Pretraining for Large Language Models.
CoRR, 2023

Irreducible Curriculum for Language Model Pretraining.
CoRR, 2023

ReadingQuizMaker: A Human-NLP Collaborative System that Supports Instructors to Design High-Quality Reading Quiz Questions.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022
Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022


  Loading...