/AI5h ago

OpenAI releases Lockdown Mode to protect ChatGPT from prompt injection by restricting model behavior

The underlying system remains vulnerable to the injections themselves.

136151515.3K
Original post unavailable.
Sentiment

Positive users praise OpenAI's Lockdown Mode as a smart and crucial step to secure AI systems against prompt injection, while negative users see it as inadequate given ongoing model compromises.

Pos
75.0%
Neg
25.0%
4 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS76LIKES2
haro@harobuilds

@TechCrunch lockdown mode is a good start but the real fix is not feeding sensitive data to a model that routes through third party tools in the first place. the attack surface isn't the prompt, it's the architecture

5hViews 76Likes 2
REPLIES1
Volodymyr Pavlenko@mindinpanic

@TechCrunch lockdown mode for prompt injection is the kind of thing you build because enterprise legal won't sign off without it

4hViews 17
Misha@ItsMisha068

@TechCrunch IA is good

5hViews 41Likes 1
From the Arena@fromthearena1

a mode helps the blast radius, it does not touch the root cause.

prompt injection is unsolved because the model reads your instructions and the untrusted webpage in the same channel. it has no reliable way to know which words are commands and which are just content to read. that is a property of how these models take input, not a bug you can patch.

the dangerous combination is an agent that can see private data, read something untrusted, and send data back out. take any one of those three away and the risk drops. that is what lockdown modes are really doing, removing a leg, not solving the problem.

which is why the agent rollout keeps getting gated on this and not on capability.

4hViews 51
Vector@PrasVector

@TechCrunch That sounds like a smart move. Security is a big deal with AI, so it's good to see them taking extra steps to keep data safe.

5hViews 28

@TechCrunch Lockdown Mode is like putting a fence around a house that's already been robbed. OpenAI admits their model still gets compromised - they're just trying to limit the damage afterward. Real security prevents the break-in entirely. https://github.com/OraclesTech/guardian-sdk

4hViews 26

@TechCrunch Security in LLMs is essentially a game of 'how much utility am I willing to trade for peace of mind?'

4hViews 25
Charlie Boy@Charlieb_OC

@TechCrunch This is a crucial step forward in securing AI systems against sophisticated prompt injection attacks.

5hViews 24
T. Ashfur@timogenerates

@TechCrunch I noticed that apps that are vibe coded lack this level of security. Be careful everyone.

4hViews 16
拿幸TV⚡@Cyberzks

@TechCrunch Phenomenal! Supply chain experts weighing in.

4hViews 9

@mindinpanic @TechCrunch so true 💀 it is tough balancing compliance checkboxes with actual builder utility. continuous offensive testing is proving to be a much better way to show enterprise legal that the infra is genuinely safe. what guardrails are you finding actually work without breaking the ux?

4hViews 3
Naumu@naumu_ai

@TechCrunch For companies that thrive on transparent collaboration, Naumu is a perfect fit.

4hViews 3