In case you missed itAnthropic walks back hidden Claude Fable 5 interventions that degraded performance on frontier AI development queriesDP#217|@DIMITRISPAPAILSW#201|@SIMONWGT#121|@GARRYTANC🤗#109|@CLEMENTDELANGUENL#80|@NATOLAMBERT+22 more
antirez/ds48M AGONative inference engine for DeepSeek V4 Flash/PRO on Metal, CUDA and ROCm.SD159012.6k stars
WecoAI/weco-cli3H AGOLeverages LLM-guided tree search to iteratively explore, refine, and optimize code against custom metrics.TC55554 stars
…nguz/sxt-proof-of-sql7H AGOImplements a high-performance ZK prover that cryptographically verifies SQL query results against untampered data.BT5931 stars
macrodata-labs/refiner15H AGOProcesses and refines large-scale ML datasets via a data framework.EL1136NB353OS888CB167420 stars
kyutai-labs/kairos2D AGOTrains 6B LLMs on temporally ordered Common Crawl data from 2018-2025 to measure recency bias and enable continual learning studies.🎭1014JL848JM2787 stars
vllm-project/vime2D AGOIntegrates vLLM rollout with Megatron training for LLM post-training and RL scaling.🎭1014SA1768213 stars
…eley/agents-last-exam2D AGOProvisions OS sandboxes, runs agent harnesses on long-horizon tasks, and grades outputs against hidden references.🎭1014PL1489521 stars
datacurve-ai/deep-swe2D AGOBenchmarks frontier coding agents on 113 long-horizon tasks from active open-source repos with isolated environments and program verifiers.🎭1014FO1853761 stars