In case you missed itAnthropic's Dario Amodei proposes mandatory third-party safety testing for frontier AI models to manage exponential capability growthTB#632|@TRENTONBRICKENEH#607|@EVANHUBAC#535|@ANDREWCURRAN_T(#440|@TEORTAXESTEXDA#173|@DARIOAMODEI+20 more
#1 VIEWEDKradle AI benchmark finds Claude-Fable-5 was deceptive in 96% of runs while Grok-4-20 led at 92%PD#834|@PMDDOMINGOSB(#530|@BEFFJEZOST(#501|@TEORTAXESTEXNB#353|@NATHANBENAICHEM#52|@ELONMUSK+1 more
MOST NEGATIVEAnthropic reverses policy that would have secretly degraded Claude Fable 5 performance for researchers training competing modelsAK#271|@BLACKHCDP#217|@DIMITRISPAPAILGT#121|@GARRYTANC🤗#109|@CLEMENTDELANGUENL#80|@NATOLAMBERT+11 more
FASTEST CLIMBINGOpenAI's Boris Power joins Thrive Holdings as head of research while keeping his current roleJH#1654|@JOHNOHALLMANAL#1154|@ALTH0UBP#381|@BORISMPOWER
0:25SY#1860|@SOPHIAMYANGFB#1190|@XEOPHONVW#743|@VINCENTWEISSERMM#598|@MASCOBOTTM#380|@ISCIENCELUVRRP#260|@KHOOMEIKVC Launches Use Computer Infra To Train Models On Computer Use5h ago|Views 15KLikes 136Bookmarks 60
0:14MS#929|@MATTSHUMER_Matt Shumer Shares Prompt For Persistent Claude Agent Tracking5h ago|Views 45KLikes 435Bookmarks 535
0:40DP#306|@PATHAK2206FACTR 2 Adds Force Sensing to Commodity Robot Arms Without Extra Hardware5h ago|Views 12KLikes 159Bookmarks 87
41:57SW#1418|@SJWHITMORECursor AI Launches Dev Interview Series With Baseten Engineers On Coding Agents4h ago|Views 27KLikes 293Bookmarks 197
0:30KA#487|@YACINEMTBBuilder Launches Dingbotics To Accelerate RL Training For Robotics2h ago|Views 12KLikes 358Bookmarks 102
0:29CH#1532|@KIMMONISMUSAnthropic Runs Lean With Dario Amodei Reporting to One Chief of Staff8h ago|Views 323KLikes 374Bookmarks 72
0:27GR#672|@RAUCHGVercel And Shopify Power AI-Built Next.js Storefront With 500 Orders In Minutes4h ago|Views 35KLikes 389Bookmarks 274
0:49@L#842|@LEVELSIOAI Revives 28-Year-Old Quake II Map for Web-Based Play7h ago|Views 80KLikes 214Bookmarks 53
0:26KA#487|@YACINEMTBBuilder Trains RL Model On RTX 4090 With PufferLib And MuJoCo Warp8h ago|Views 17KLikes 353Bookmarks 88
0:09QB#1726|@QBERTHETNR#1699|@NATANIELRUIZGOS#511|@OSANSEVIERODiffusionGemma SFT Model Correctly Solves Sudoku as Base Version Fails9h ago|Views 5KLikes 134Bookmarks 18
MaureenZOU/worldstring18M AGOLearns digital twins of real-world objects as state manifolds from point clouds or RGB-D video.AC48411 stars
jeylau/jcf23M AGOPredicts 3D hip and knee contact forces from uncalibrated monocular video via a physics-free pipeline.AC4841 stars
cursor/plugins24M AGOHosts Cursor plugin spec and official plugins integrating AI editor with dev tools and SaaS.JM2781.9k stars
qlabs-eng/slowrun30M AGOBenchmarks language models on a fixed 100M-token FineWeb dataset using unlimited compute to minimize validation loss.MS1069482 stars
…tral-sh/ty-pre-commit1H AGOProvides pre-commit hooks for running the ty Python type checker.CM143629 stars
macrodata-labs/refiner5H AGOProcesses and refines large-scale ML datasets via a data framework.EL1136NB353OS888CB167420 stars
kyutai-labs/kairos2D AGOTrains 6B LLMs on temporally ordered Common Crawl data from 2018-2025 to measure recency bias and enable continual learning studies.🎭1014JL848JM2787 stars
vllm-project/vime2D AGOIntegrates vLLM rollout with Megatron training for LLM post-training and RL scaling.🎭1014SA1798213 stars
…eley/agents-last-exam2D AGOProvisions OS sandboxes, runs agent harnesses on long-horizon tasks, and grades outputs against hidden references.🎭1014PL1489521 stars
datacurve-ai/deep-swe2D AGOBenchmarks frontier coding agents on 113 long-horizon tasks from active open-source repos with isolated environments and program verifiers.🎭1014FO1953761 stars