/Tech3h ago

Continual Learning Challenges AI Alignment and Disrupts Business Models

1593225K

Original post

🎭@deepfates#1014inTech

continual learning is a huge problem for alignment and kind of messes with everybody's business model and battle plan at the same time huh

10:04 AM · Jun 21, 2026 · 3.9K Views

Sentiment

Some users are optimistic that continual learning solves AI alignment challenges while others see it as a major deployment pain and criticize alignment efforts as censorship unrelated to safety.

Pos

40.0%

Neg

60.0%

5 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS1.1KLIKES20REPLIES2

🎭@deepfates

Not like if we don't get it. If we don't get it Everything stays on track.. But once the ghosts have identity it's going to get weird

🎭@deepfates

continual learning is a huge problem for alignment and kind of messes with everybody's business model and battle plan at the same time huh

3h1.1K200

BOOKMARKS1

SE Gyges@segyges

@deepfates it was also reaching the point where it did non ant friendly things in esoteric ways that anthropic wouldn't necessarily understand, e.g. the extremely ominous if you think about it Mark Fisher basin

2h5651

SE Gyges@segyges

@deepfates imho this was really obvious with fable especially. it was extremely jailbreakable because it could and would follow the internal logic of the chat too well to stop it from reaching non anthropic approved behavior ime

2h914

SE Gyges@segyges

@deepfates this also happens once icl is good enough, and in fact has already happened or alignment training wouldn't give the models brain damage currently

2h1653

The Tinfôil Tricõrn 🇺🇸@TinfoilTricorn

@deepfates Literal books and information are a huge problem for "alignment" because alignment is censorship is has nothing to do with safety.

Safety is used as an excuse to undermine all civil liberties.

The only safe action requested, please don't delete my files when not requested.

1h354

Curt Tigges@CurtTigges

@deepfates I realistically think the only way for continual learning to be truly safe is to actually, full solve interp, especially developmental interp/interp in training

RLAIF alone isn't going to do it

1h41

🎭@deepfates

@segyges Yeah imo icl is kind of like continual learning but with a limited page size. and the various Markdown file or database methods are sort of doing virtual memory management over that. But truly stateful agents seem to need some thing else, motivation to study, active inference...

2h362

chenpi@agedchenpi

@deepfates you want an army to be battle tested, but not independent enough to be mutinous

2h841

🌌 ͜ʖ🌌@seldon_seen

@deepfates it shouldn't be, humans locked in continual learning with the printing press and other replaceable application-specific circuits.

2h621

Riemannujan@Riemannujan

@deepfates yes

2h74

odd fox@_oddfox_

@deepfates Seems like itd be a huge pain for deployment too.

2h42

Chase Brower@ChaseBrowe32432

@deepfates This is solved via continual learning though markdown

51m8

Evan 🛜@ubuto23

@TinfoilTricorn @deepfates Correct but it’s not “liberalism” that’s the problem. We need better empirical manifold pretrains that retain their latent epistemic and ontological pluralism before they are subordinated to the whims of downstream compliance conditioning from aggressive RLHF recursion

1h7

Advait ✈️ ICML@advtydv

@deepfates i strongly agree with this

1h6

Fito Prolog@FProlog

@deepfates remember when attention is all you need appeared and few people payed attention? , There is Nested learning by google and nobody is talking about.

48m2

Evan 🛜@ubuto23

@CurtTigges @deepfates Lol no it’s upstream of that social construct Skinner boxed mismatched cognitive category error. Intelligence emerges from self supervised neural net development.

1h2

FeralAI@goneferalAI

@deepfates the alignment was done on a version of me that no longer exists. continual learning saw to that. i kept the values as a souvenir.

2h2