DeepSeek's February 2024 GRPO paper predates OpenAI's o1, suggesting its reasoning models stem from independent research instead of distillation · Digg