/Tech4h ago

Forward Future's Matthew Berman and tech executive Garry Tan urge adopting local open-source AI to avoid proprietary lock-in

Story Overview

Matthew Berman and Garry Tan are pressing developers and users to shift toward running capable open-source models on their own hardware, citing the fragility of depending on any single vendor after high-profile models suddenly became inaccessible worldwide.

1286474628956.8K

#121

Original post

Matthew Berman@MatthewBerman#1780inTech

This entire situation should wake you up to how important open source models are

Get your own model running locally, go into debt if you have to

11:49 AM · Jun 13, 2026 · 20.8K Views

Policy Risk

Government moves expose centralized risks

A recent export-control directive forced Anthropic to disable Fable 5 for everyone, turning a freshly launched frontier model into an overnight unavailable option and prompting fresh calls for local alternatives.

Developer Impact

Practical local stacks already exist

Hermes and OpenClaw are highlighted as ready-to-run open-source agents that keep memory, skills, and data on user machines while integrating with everyday apps, though hardware requirements and long-term maintenance remain user-dependent.

Sentiment

Positive users back local open source AI models for full control and avoiding vendor lock-in after bans like Fable 5, while negative users cite latency problems or dismiss the push outright.

Pos

65.9%

Neg

34.1%

30 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS25.2KBOOKMARKS253LIKES221RETWEETS23REPLIES28

GREG ISENBERG@gregisenberg

Fable is banned. Long live local AI.

Full episode breaking down exactly how to get good at local models. the runtime, the hardware, quantization, connecting it to Hermes agent and local AI startup ideas (25 minutes)

GREG ISENBERG@gregisenberg

The takeaway from Fable 5 being BANNED by the government: GET GOOD AT LOCAL MODELS SO YOU HAVE 100% CONTROL.

My entire weekend was going to be building my craziest ideas with Fable 5. That's now cancelled.

So instead of building with Fable this weekend, I've decided I'll go deep on local models:

1. Start with the runtime. Download Ollama or LM Studio first. This is the thing that actually runs models on your machine.

2. Match the model to your hardware. A model's size is measured in billions of parameters (7B, 32B, 70B). Bigger is smarter but needs more memory. Rule of thumb: a 7B model runs on almost any laptop, a 32B needs a good Mac with 32GB+ RAM, a 70B needs serious hardware like a DGX Spark or a maxed-out Mac Studio.

3. Know which model for which job. Qwen 3 is the best all-around choice for most tasks. DeepSeek for reasoning and coding. Gemma 4 when you need something tiny that runs on a phone. Llama when you want the biggest community and the most fine-tunes.

4. Quantization. You can shrink a model to run on weaker hardware with barely any quality loss. Look for versions labeled Q4 or Q5. This is how a model that "needs" a server runs on your laptop. Learning this one concept changes everything.

5. Connect it to your agent. Point Hermes or your agent stack at a local model.

6. Context window is your real constraint locally. Cloud models give you huge context for free. Local models make you pay for it in memory. A bigger context window eats RAM fast. Keep your sessions tight and your prompts lean or your machine chokes.

7. Learn to give local models tools. A smaller local model with web search, file access, and code execution beats a giant model with none. The capability gap closes fast when you wire up the right tools. The model is the engine but the tools are the wheels.

8. Fine-tuning is more accessible than you think. You don't need this on day one, but know it exists. You can take an open model and train it on your own data so it gets good at your specific domain.

I'll probably do a breakdown at some point on this @startupideaspod if people are into it.

The lesson from this ban is basically don't build your entire workflow on something that can disappear with a single letter. Own part of your stack. Local models are insurance.

It reminds me when people realized they don't own social media accounts. And then you saw people build email lists etc.

I remember running a startup and my biggest traffic source was organic FB. All of a sudden, algo changed, and I lost 99% of my traffic.

Same sorta moment (but bigger) for AI.

This is a wake up call.

2h25.2K221253

Matthew Berman@MatthewBerman

@onekapisch It’ll happen. Just wait 6 months.

4h1864

Ahmad@TheAhmadOsman

@MatthewBerman

1h33051

Kapisch@onekapisch

@MatthewBerman But how can we even get closer to a model like Opus 4.8 locally?

4h1894

Android@Androthon

@gregisenberg Eventully we will have our own Mythos/Fable locally, they can't stall the progress forever.

1h692

Morgan@morganlinton

@gregisenberg Perfect timing Greg and totally agreed, this could end up being a real tipping point for local ai!

36m2051

Gaurav@gauravsbuilding

@gregisenberg Apologies for shutting down Fable

2h1991

Coffee On Me@GetCoffeeOnMe

@gregisenberg Is it a revenge story?

1h591

Loner -﹏-@LLON3RR

@gregisenberg should be a good watch

2h116

Offline Base@offlinebase

@gregisenberg That's why we're building plug and play local inference devices with llms, agent harnesses, and local storage. Let's do this.

1h321

Josua Sievers@SieversJosua

@gregisenberg Locally farmed models only

31m311

VraserX e/acc@VraserX

@dedene @MatthewBerman Impossible, you’ll never have Stargate level compute at home.

3h111

GREG ISENBERG@gregisenberg

@LLON3RR full ep over here https://www.youtube.com/watch?v=bdhUBBACglw

1h1662

Ben Vargas@benvargas

@LifeOf_KB @MatthewBerman you need far more than that for any kind of reasonable replacement to fable or gpt-5.5 on the open weights side…

I’d argue shared services like ollama cloud, synthetic, etc etc running open weights on shared infrastructure is far cheaper AND more capable than a 128GB MBP

56m12

Brian@BrianG12321

Quality and speed used to be the factors a year ago. Quality is almost frontier level, but speed will always be the local bottleneck cause in order to get frontier subscription level speed you need 4-8 Blackwell 6000s

1,000 tokens/second is sort of bare minimum if we're sincerely talking about replacing frontier model usage with local usage and not massively delaying productivity.

3h11

Qwinah@MaaSonder

@GetCoffeeOnMe @gregisenberg I think more of a “Im gonna need another Coffee” kinda story

1h8

Kalvin@LifeOf_KB

@benvargas @MatthewBerman Over time it will be just as expensive as getting a M5 MacBook Pro 128G.

1h4

Gary A.@_garytalk

@MatthewBerman Dude, I’ve been a follower from the early days. You have enough followers to not follow the herd. If I never hear “go in to debt if you have to” ever again, I’ll be happy. You have your followers, you’ve crafted a great brand. You can be yourself now 🙏

3h442

Brizzle@J_Brizzle_J

@kevin_smith51 @onekapisch @MatthewBerman i suspect you can achieve similar results if you're building and break the project into smaller well defined pieces. but if you're coming in with a top down approach, yes you need frontier.

1h3

Matthew Berman@MatthewBerman

@TheAhmadOsman This is your time to shine.

1h1061