Have an idea.
Implement it. Test it at 1b. Compare to the proper SOTA baselines. Do not mess up the evals. Normalize flops. Normalize mem. Test at 8b. Test at 32b. Test at 200b.
1:14 AM · Jun 25, 2026 · 8.1K Views
Have an idea.
Implement it. Test it at 1b. Compare to the proper SOTA baselines. Do not mess up the evals. Normalize flops. Normalize mem. Test at 8b. Test at 32b. Test at 200b.
No Digg Deeper questions have been answered for this story yet.
you basically lose 3/4 of ideas at every step.
Have an idea.
Implement it. Test it at 1b. Compare to the proper SOTA baselines. Do not mess up the evals. Normalize flops. Normalize mem. Test at 8b. Test at 32b. Test at 200b.