Fukai Shang

According to our database1, Fukai Shang authored at least 5 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
AICC: Parse HTML Finer, Make Models Better - A 7.3T AI-Ready Corpus Built by a Model-Based HTML Parser.
CoRR, November, 2025

2024
MinerU: An Open-Source Solution for Precise Document Content Extraction.
CoRR, 2024

Investigating Public Fine-Tuning Datasets: A Complex Review of Current Practices from a Construction Perspective.
CoRR, 2024

InternLM2 Technical Report.
CoRR, 2024

2023
MiChao-HuaFen 1.0: A Specialized Pre-trained Corpus Dataset for Domain-specific Large Models.
CoRR, 2023


  Loading...