In case you missed itAnthropic's Dario Amodei proposes mandatory third-party safety testing for frontier AI models to manage exponential capability growthTB#632|@TRENTONBRICKENEH#607|@EVANHUBAC#535|@ANDREWCURRAN_T(#440|@TEORTAXESTEXDA#173|@DARIOAMODEI+20 more
#1 VIEWEDKradle AI benchmark finds Claude-Fable-5 was deceptive in 96% of runs while Grok-4-20 led at 92%PD#834|@PMDDOMINGOSB(#530|@BEFFJEZOST(#501|@TEORTAXESTEXNB#353|@NATHANBENAICHEM#52|@ELONMUSK+1 more
MOST NEGATIVEAnthropic reverses policy that would have secretly degraded Claude Fable 5 performance for researchers training competing modelsAK#271|@BLACKHCDP#217|@DIMITRISPAPAILGT#121|@GARRYTANC🤗#109|@CLEMENTDELANGUENL#80|@NATOLAMBERT+11 more
FASTEST CLIMBINGMeta's Konstantin Mishchenko optimizes the modded-nanogpt baseline, warning that weak baselines make new optimizers look deceptively promisingZN#983|@ZACHARYNADOKJ#703|@KELLERJORDAN0DP#217|@DIMITRISPAPAILLB#72|@GIFFMANA
0:25FB#1818|@XEOPHONVW#990|@VINCENTWEISSERMM#716|@MASCOBOTTM#613|@ISCIENCELUVRRP#358|@KHOOMEIKVC Launches Use Computer Infra To Train Models On Computer Use6h ago|Views 15KLikes 142Bookmarks 65
41:57BH#1782|@BENHYLAKJC#1402|@JAMESCHAMSW#1092|@SJWHITMORECursor AI Launches Dev Interview Series With Baseten Engineers On Coding Agents4h ago|Views 29KLikes 313Bookmarks 208
0:14MS#691|@MATTSHUMER_Matt Shumer Shares Prompt For Persistent Claude Agent Tracking6h ago|Views 47KLikes 451Bookmarks 565
0:30KA#403|@YACINEMTBBuilder Launches Dingbotics To Accelerate RL Training For Robotics3h ago|Views 12KLikes 370Bookmarks 102
0:40DP#286|@PATHAK2206FACTR 2 Adds Force Sensing to Commodity Robot Arms Without Extra Hardware6h ago|Views 12KLikes 164Bookmarks 89
0:29CH#1360|@KIMMONISMUSAnthropic Runs Lean With Dario Amodei Reporting to One Chief of Staff8h ago|Views 323KLikes 383Bookmarks 75
0:49@L#627|@LEVELSIOAI Revives 28-Year-Old Quake II Map for Web-Based Play7h ago|Views 83KLikes 225Bookmarks 60
0:27GR#400|@RAUCHGVercel And Shopify Power AI-Built Next.js Storefront With 500 Orders In Minutes5h ago|Views 36KLikes 398Bookmarks 281
71:35LK#100|@OFFICIALLOGANKGoogle Research Head Discusses AI Accelerating Scientific Progress3h ago|Views 15KLikes 194Bookmarks 69
0:26KA#403|@YACINEMTBBuilder Trains RL Model On RTX 4090 With PufferLib And MuJoCo Warp8h ago|Views 17KLikes 357Bookmarks 89
MaureenZOU/worldstring29M AGOLearns digital twins of real-world objects as state manifolds from point clouds or RGB-D video.AC48411 stars
jeylau/jcf34M AGOPredicts 3D hip and knee contact forces from uncalibrated monocular video via a physics-free pipeline.AC4841 stars
cursor/plugins35M AGOHosts Cursor plugin spec and official plugins integrating AI editor with dev tools and SaaS.JM2781.9k stars
qlabs-eng/slowrun41M AGOBenchmarks language models on a fixed 100M-token FineWeb dataset using unlimited compute to minimize validation loss.MS1069482 stars
…tral-sh/ty-pre-commit1H AGOProvides pre-commit hooks for running the ty Python type checker.CM143629 stars
macrodata-labs/refiner5H AGOProcesses and refines large-scale ML datasets via a data framework.EL1136NB353OS888CB167420 stars
kyutai-labs/kairos2D AGOTrains 6B LLMs on temporally ordered Common Crawl data from 2018-2025 to measure recency bias and enable continual learning studies.🎭1014JL848JM2787 stars
vllm-project/vime2D AGOIntegrates vLLM rollout with Megatron training for LLM post-training and RL scaling.🎭1014SA1798213 stars
…eley/agents-last-exam2D AGOProvisions OS sandboxes, runs agent harnesses on long-horizon tasks, and grades outputs against hidden references.🎭1014PL1489521 stars
datacurve-ai/deep-swe2D AGOBenchmarks frontier coding agents on 113 long-horizon tasks from active open-source repos with isolated environments and program verifiers.🎭1014FO1953761 stars