Towards Next-Generation LLM Training: From the Data-Centric Perspective | ScienceToStartup | ScienceToStartup