3h agoWeight Extrapolation of RL Checkpoints Produces Complementary PoliciesSentimentSentimentPos100%Neg0%Users express pride in research on weight extrapolation of RL checkpoints for yielding better policies and scaling, crediting excellent coauthors for the work's quality.1 comment with sentiment. View comments.