OpenDeepThink Scales LLM Reasoning Via Parallel Population Evolution
This is pretty fun vibecoded an implementation of the paper, added a webui and freeform mode, plus model split (deepseek-Flash for generation, Pro for judge). Still chokes on hard problems, but would be trivial to chain with decompositions/other hacks. GPT-Pro on a budget?
yess… I love this
This is pretty fun vibecoded an implementation of the paper, added a webui and freeform mode, plus model split (deepseek-Flash for generation, Pro for judge). Still chokes on hard problems, but would be trivial to chain with decompositions/other hacks. GPT-Pro on a budget?
OK this is schizo enough

yess… I love this
Now we're talking. Bradley–Terry-powered silliness-mazimizer! of course this is a fully general technique.

OK this is schizo enough



