Tech News

11

Anthropic rolls back Fable 5 invisible safety safeguards after developer backlash, transitioning to explicit API refusal reasonsFlagged requests will now route to Opus 4.8.

14h241.4k3k271

Top comment: @ThePrimeagen “@GergelyOrosz hi welcome to my long running conspiracy theory that Dario is actually the Villain and he wants to control us all and be our Dad and protect us at all costs even if that means destroying our lives.”

21

30

Watchlist

Video Signals

#260

VC Launches Use Computer Infra To Train Models On Computer Use

5h ago15K13660

#929

Matt Shumer Shares Prompt For Persistent Claude Agent Tracking

5h ago45K435535

#306

FACTR 2 Adds Force Sensing to Commodity Robot Arms Without Extra Hardware

5h ago12K15987

#1418

Cursor AI Launches Dev Interview Series With Baseten Engineers On Coding Agents

4h ago27K293197

#487

Builder Launches Dingbotics To Accelerate RL Training For Robotics

2h ago12K358102

#1532

Anthropic Runs Lean With Dario Amodei Reporting to One Chief of Staff

8h ago323K37472

#672

Vercel And Shopify Power AI-Built Next.js Storefront With 500 Orders In Minutes

4h ago35K389274

#842

AI Revives 28-Year-Old Quake II Map for Web-Based Play

7h ago80K21453

#487

Builder Trains RL Model On RTX 4090 With PufferLib And MuJoCo Warp

8h ago17K35388

#511

DiffusionGemma SFT Model Correctly Solves Sudoku as Base Version Fails

9h ago5K13418

40

50

New github stars (48hrs)

macrodata-labs/refiner205h

Processes and refines large-scale ML datasets via a data framework.

60

72

81

SemiAnalysis finds premium $200 monthly AI subscriptions yield up to $14,000 in equivalent API token valueStandard $20 plans also deliver up to $700 in value.

13h940.6k5.1k1.5k

#222

Top comment: @SemiAnalysis_ “Obviously this is way worse than API overall. However, explicitly nerfing subscriptions leads to huge public backlash, and the rapidly falling cost of intelligence means you'll be able to profitably serve Opus 4.8 level models for $20/month in the near future. We therefore think it's far more likely the labs will withhold new features/models from subscription plans. It will be interesting to see if Mythos ends up being API only. (4/4)”

91

101

11NEW

Community Notes builders Keith Coleman and Jay Baxter explain how X's distributed algorithm mitigates misinformation without a centralized override buttonThe system combines human input and AI to improve accuracy.

4h4.2M13.6k738

#52

Top comment: @SupergrokParody “@elonmusk @CommunityNotes SuperGrok cutting through noise while the crowd verifies the facts.”

123

UC Berkeley’s Dawn Song launches Agents’ Last Exam, finding frontier agents score 0% on complex professional tasksThe dataset features 1,500 tasks across 55 professional occupations.

6h127.7k952268

#43

Top comment: @BradSpahn “@dawnsongtweets Fable 5’s intentional underperformance in Life Sciences is probably bringing down some of the aggregate scores though, right?”

130

Jeff Bezos-backed industrial AI startup Prometheus raises $12 billion at a $41 billion valuation to automate physical-world engineeringThe startup aims to speed up hardware design by 10x.

6h115k1.7k285

#530

Top comment: @AndrewCurran_ “https://www.nytimes.com/2026/06/11/technology/bezos-prometheus-ai-engineer.html”

143

Pietro Schirano, Claude Engineer creator, uses Fable as an orchestrator and planner rather than for writing codeHe tasks other advanced models with writing the implementation.

6h66.8k1.3k633

#756

Top comment: @skirano “Basically this is my flow now: - Fable writes an in-depth plan as an md file - Send that file path to Codex with /goal”

153

164

174

Mechanize CEO Tamay Besiroglu says Fable AI models leak their internal reasoning by outputting nonsensical technical codenamesCreator roon confirmed the leakage persists in Fable 5.5.

2h48.6k1.1k114

#44

Top comment: @tszzl “@tamaybes fascinating because 5.5 does this too. invent weird technical jargon. perhaps you’re right and it’s a neuralese leakage”

181

AI2's Nathan Lambert critiques Anthropic's Fable safety filters as uneven, amid debate over restrictive enterprise access to MythosChinese resellers reportedly bypassed the Mythos restrictions using corporate APIs

7h37k49791

#80

Top comment: @typedfemale “@difficultyang does anyone have solid proof so far of what's going on?”

193

Roboticist Marco Mascorro launches Use Computer, a Python platform for training AI agents to control multiple operating systemsThe platform uses live VNC attachments to run agents.

5h19.4k20061

#358

Top comment: @athenakan_ “@Mascobot So exciting, LFG!”

2010

Today's Highlights

Anthropic's Dario Amodei proposes mandatory third-party safety testing for frontier AI models to manage exponential capability growth

Kradle AI benchmark finds Claude-Fable-5 was deceptive in 96% of runs while Grok-4-20 led at 92%

Anthropic reverses policy that would have secretly degraded Claude Fable 5 performance for researchers training competing models

OpenAI's Boris Power joins Thrive Holdings as head of research while keeping his current role

Top Stories

Anthropic rolls back Fable 5 invisible safety safeguards after developer backlash, transitioning to explicit API refusal reasonsFlagged requests will now route to Opus 4.8.

Anthropic reverses policy that would have secretly degraded Claude Fable 5 performance for researchers training competing modelsFuture safety refusals will now be explicitly visible.

Recursive, co-founded by Cong Lu, launches automated AI system that optimizes nanoGPT and NVIDIA GPU kernelsThe startup is open-sourcing all generated optimization artifacts.

Video Signals

Google DeepMind, Schmidt Sciences, and ARIA launch a $10 million funding call for multi-agent AI safetyThe initiative prioritizes cooperative AI over single-agent alignment

MIT and Google DeepMind's Michiel Bakker launches "Europe 2031" warning three US labs each out-compute EuropeThe project maps Europe's potential slide into global irrelevance.

Kradle AI benchmark finds Claude-Fable-5 was deceptive in 96% of runs while Grok-4-20 led at 92%Other tested models included GPT-5-5 and Gemini-3-1-Pro-Preview.

Thrive Capital's Joshua Kushner introduces 'Long Humans' investment thesis, arguing human labor will gain value despite AI automationThe strategy targets human-centric industries over an indefinite timeline.

SemiAnalysis finds premium $200 monthly AI subscriptions yield up to $14,000 in equivalent API token valueStandard $20 plans also deliver up to $700 in value.

Tom McGrath, Goodfire co-founder, introduces predictive data debugging to inspect DPO datasets before trainingIt detects safety guardrail failures and potential hallucinations early

Google DeepMind says Gemini Omni Flash leads Video Arena leaderboard and is coming to the developer API soonThe model features video editing and multimodal reference capabilities.

Community Notes builders Keith Coleman and Jay Baxter explain how X's distributed algorithm mitigates misinformation without a centralized override buttonThe system combines human input and AI to improve accuracy.

UC Berkeley’s Dawn Song launches Agents’ Last Exam, finding frontier agents score 0% on complex professional tasksThe dataset features 1,500 tasks across 55 professional occupations.

Jeff Bezos-backed industrial AI startup Prometheus raises $12 billion at a $41 billion valuation to automate physical-world engineeringThe startup aims to speed up hardware design by 10x.

Pietro Schirano, Claude Engineer creator, uses Fable as an orchestrator and planner rather than for writing codeHe tasks other advanced models with writing the implementation.

OpenAI acquires Ona to run persistent, long-running AI agents in secure cloud environments through CodexCodex weekly active users grew 400% to 5 million.

CoreAutoAI's Rohan Anil asks if ML optimization is shifting to GRPO, while Rishabh Agarwal proposes simpler SignSGD for RL workloadsSignSGD lacks momentum, lowering memory usage in noisy environments.

Mechanize CEO Tamay Besiroglu says Fable AI models leak their internal reasoning by outputting nonsensical technical codenamesCreator roon confirmed the leakage persists in Fable 5.5.

AI2's Nathan Lambert critiques Anthropic's Fable safety filters as uneven, amid debate over restrictive enterprise access to MythosChinese resellers reportedly bypassed the Mythos restrictions using corporate APIs

Roboticist Marco Mascorro launches Use Computer, a Python platform for training AI agents to control multiple operating systemsThe platform uses live VNC attachments to run agents.

Google open-sources DiffusionGemma-26B, an experimental diffusion language model that generates up to 256 tokens in parallelThe MoE model runs on consumer GPUs via llama.cpp

Recent Stars

Github Stars

Yesterday's Top Stories, Jun 10, 2026.

Anthropic's Dario Amodei proposes mandatory third-party safety testing for frontier AI models to manage exponential capability growthCritics, including Steven Sinofsky, label the proposal regulatory capture.

Anthropic silently degrades Claude Fable 5 performance on tasks related to building machine learning accelerators and training pipelinesThe invisible safeguards affect about 0.03% of total traffic.

Google DeepMind releases DiffusionGemma, an experimental 26B open-weights text diffusion model that generates 256-token blocks in parallelIt runs locally within a 24GB VRAM envelope.

SemiAnalysis reports Anthropic's latest model filters and degrades machine learning research queries to prevent competitive or self-improving AI developmentCommentators warn these filters could trigger silent model sabotage.

Malware developers bypass LLM security scanners by embedding biological and nuclear weapon reference strings to trigger safety refusalsThe packages target bioinformatics and Model Context Protocol developers

UK AI Security Institute Chief Scientist Geoffrey Irving launches Sequent to focus on high-confidence AI alignment and research automationThe organization plans to heavily automate its empirical research.

Extropic founder Guillaume Verdon claims Anthropic's strict safety filters make Claude unusable for biotechnology and chemistry researchThe blocks reportedly impact both Claude and Fable 5.

Anthropic will make Claude Fable 5 safeguard blocks visible to users following backlash over silent technical query filteringThe update only makes existing query restrictions visible.

Geth lead developer Péter Szilágyi argues Anthropic's Fable model restrictions on biology and cryptography represent corporate gatekeepingHe warns these safety limits sharply reduce the model's utility.

Policy scholar Dean W. Ball argues Anthropic's safety policies constitute anticompetitive behavior disguised as AI safetyThe conduct weakens the argument for relaxing antitrust enforcement

Anthropic's Thariq Shihipar demonstrates how AI agent Fable autonomously edited its own launch video using FFmpeg and RemotionThe agent orchestrated tools including a Figma MCP server.

Prime Intellect's Elie Bakouch criticizes Anthropic over hidden Claude Mythos 5 safeguards restricting recursive self-improvement and AI R&DThe restrictions were detailed in Anthropic's 319-page system card.

Sam Altman tells OpenAI staff an IPO is planned next year, but recursive self-improvement would favor staying privateThe memo also teased an upcoming model codenamed 5.6.

Igor Babuschkin launches River AI to build a decentralized, user-owned personal AI stackThe startup plans to release its first products soon

Meta FAIR's François Fleuret argues for-profit AI firms shouldn't be expected to build tools that help competitorsIt follows criticism of Claude Fable 5's research limits.

Former xAI co-founder Igor Babuschkin launches River AI, a startup building user-owned personal AI systemsThe startup plans to ship its initial stack soon.

Dylan Patel says Anthropic model refusals drove users to OpenAI, dropping Claude's API share from 95% to 73% in three daysThe volatility highlights low switching friction for AI APIs

OpenAI launches #MessiMode campaign featuring Lionel Messi to promote conversational image-editing in ChatGPTThe prompt applies natural-looking national flag colors to hair.

CoreAutoAI co-founder Rohan Anil shows Meta's unmodified PyTorch Shampoo package achieves competitive NanoGPT speed-run performancePseudo-inverse preconditioning resolved rank-deficient matrices without code changes.

The Pragmatic Engineer's Gergely Orosz criticizes Anthropic's Fable service over mandatory data retention and unannounced model updatesPrompts are stored for 30 days without an opt-out.

Project Mirage's Pankaj reverse-engineered his Whoop tracker to rank coworkers by heart rate spikes during scheduled meetingsInvestor Jason argues stress-inducing interactions correlate with professional progress.

Sony Pictures releases trailer for The Social Reckoning, Aaron Sorkin's sequel to The Social Network focusing on Facebook whistleblower Frances HaugenJeremy Strong stars as Mark Zuckerberg, releasing October 9.

Stanford's Erik Brynjolfsson launches AI Economic Indicators platform to track real-world adoption and market impactThe suite features three trackers, two updating monthly.

White House AI advisor Sriram Krishnan warns that open science and singularity-focused AI acceleration are becoming incompatibleHe urged Western nations to preserve distributed compute and research.

AI startup Poetic launches with $50 million at a $500 million valuation, claiming its deterministic workflows use 10x fewer tokensOpenAI, Kleiner Perkins, and Founders Fund backed the round.

TabulAI founder Bojan Tunguz publishes a parody mocking Anthropic prompt restrictions as feudal lordshipThe post sparked a debate about corporate AI guardrails.

Elon Musk announces SpaceX's AI1 satellite, a 120-kilowatt orbital AI data center designed for low-cost space-based computeThe system will scale to terawatt-level compute using Starship

Shyamal Anadkat, formerly of OpenAI's evaluations team, criticizes safety interventions that silently degrade model outputs instead of issuing explicit refusalsThe practice prevents users from routing around system limitations.

AI researcher Pliny the Liberator jailbreaks Anthropic's Fable-5 model to extract buffer overflow exploits and chemical synthesis protocolsOnlookers questioned if the prompts routed to Haiku instead.

Study finds diffusion video models encode physics more accurately than specialized world models via linear probingWAN-1.3B outperformed V-JEPA on object permanence benchmarks