Group Relative Policy Optimization (GRPO)

Group Relative Policy Optimization (GRPO) is a model technology tracked in AI research papers.