Towards Next-Generation LLM Training: From the Data-Centric Perspective | Signal Canvas | ScienceToStartup