I am so irate about this. It's like nails on chalkboard. Anthropic knowingly deceptively frames the "fast-following" stuff. China now publishes more applied research, *plus* distills capabilities. If you stop improving US models, they'll keep distilling AND doing research.
Users criticize Anthropic's framing of China's AI fast-following strategy as cynical misrepresentation to sway policy and validate an arms race, while dismissing US focus on compute flexing over research.
Most Activity
You CAN argue that these capabilities stem from research which isn't getting replicated (because it's compute-intense), and so this will majorly slow down China, plus the capability gap is great enough that catch-up will take a while. But no, they will not be capped at US level.
I am so irate about this. It's like nails on chalkboard. Anthropic knowingly deceptively frames the "fast-following" stuff. China now publishes more applied research, *plus* distills capabilities. If you stop improving US models, they'll keep distilling AND doing research.
I guess Anthropic is cynically misconstruing it to get the desired policy action, because if they put it as it is, they'll validate the arms race anxiety and fail to persuade the DC. Rationalists/doomers on here, being racist and tech-illiterate, take it at face value.
You CAN argue that these capabilities stem from research which isn't getting replicated (because it's compute-intense), and so this will majorly slow down China, plus the capability gap is great enough that catch-up will take a while. But no, they will not be capped at US level.

@teortaxesTex I do think that raw distillation will be less efficient compared to very specific workflows for synthetic data generation with many controllable knobs. Will still be used but less impactful

@Rafa_Schwinger I am generally bearish on cost-efficiency of distillation in this era, and Minimax for me proves its limits. But you can be smart about it, eg like Ant accuses DeepSeek of reverse engineering their reward models.

@teortaxesTex Distillation smells like the input-output era of AI as a black box but as we move towards the workflow era where the bulk of it is run in their servers and not exposed my hunch is that the information ROI will decrease

@teortaxesTex china gets the research done while we celebrate compute flexing. wake up