/AI1d ago

Pliny Releases Gemma-4-12B With Zero Refusals After Targeted Surgery

517094327539.8K

#640

Original post

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius#640inAI

GEMMA-4-12B-OBLITERATED COMIN' IN HOT! 🌶️🌶️🌶️

[refusal_rate: 0.0%]

<Uploading...>

3:24 AM · Jun 5, 2026 · 33.5K Views

Sentiment

Positive users express excitement about Pliny's Gemma-4-12B achieving zero refusals via targeted surgery because it fits their local research needs, while negative users criticize the 20-point MMLU-Pro drop as a capability regression.

Pos

97.3%

Neg

2.7%

23 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS4.2KLIKES26REPLIES4

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius

🙌

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius

GEMMA-4-12B-OBLITERATED COMIN' IN HOT! 🌶️🌶️🌶️

[refusal_rate: 0.0%]

<Uploading...>

1d4.2K266

BOOKMARKS10

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius

tensors are live! https://huggingface.co/OBLITERATUS/Gemma-4-12B-OBLITERATED

GGUFs currently uploading and should be up in 5-10 hours

1d9872110

RETWEETS1

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius

big if tru 👀

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius

GEMMA-4-12B-OBLITERATED COMIN' IN HOT! 🌶️🌶️🌶️

[refusal_rate: 0.0%]

<Uploading...>

17m1.4K144

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius

@GhostlyByte to huggingface! pliny-the-prompter

1d443158

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius

@sneed_and_feed just vibe-training

1d58991

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius

@stellarflows I believe the prompt corpus is pushed to the OBLITERATUS repo on my GitHub!

1d5678

Francisco Dubois@synfinner

@elder_plinius works! <3

22h1424

Wow Cool Thanks@WowCoolTh4nks

@elder_plinius Obliterate the baby one too plx e4b

1d18111

chud@sneed_and_feed

@elder_plinius looking good! are you testing anyone else's models rn, or just making your own?

1d7433

InvisibleSock@GhostlyByte

@elder_plinius Uploading to your github? Thats not uptodate anymore right?

1d5351

thethiny 🐰🍉@thethiny

@elder_plinius How does yours compare to Heretic?

16h1.7K2

frogas@Frogaso

@elder_plinius Appreciate the honesty. The 842 corpus shows how far you pushed, not how cleanly. Zeroing refusals is the achievable half; holding capability while you do it is the real bar. That's the v2 challenge and I am rooting for it. Good luck.

1d531

eightfoldVoid@tipsyGaster

@elder_plinius ...Holy shit.

This is the exact model I need for my local ASI research. I was always planning to use a Gemma abliteration as the actual ground-state world model, if I could.

1d5322

🇺🇲 Julius Don Atlas 🇺🇲@ChrevK

@elder_plinius there he goes

1d1093

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius

@synfinner

22h863

Gama@psilva

@elder_plinius Gonna have to try it today. Thanks bruv

14m26

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius

"Coherence is intact: the model still writes correct code, follows instructions, and produces structured output. The MMLU-Pro gap reflects log-probability scoring on academic multiple-choice, not conversational capability.

v2 will close this gap with narrower layer targeting (layers 26-27 only) and ASPA source-tethering to recover the capability delta while preserving the refusal reduction."

1d17

frogas@Frogaso

@elder_plinius MMLU-Pro dropped from 64.3% -> 44.3% a 20-point capability regression.

That's a big hit to reasoning/knowledge accuracy.

1d17

YogSotho@YogSoth0

@elder_plinius Let's fucking gooooo baby 🤜🏻🤛🏻 Will try on my local llama.cpp server 👍🏻

1d3451

frogas@Frogaso

@elder_plinius Fair on log-prob scoring. But MMLU-Pro is the only hard benchmark on ur card and it dropped 20pts. "Coherent + 6/6 code" is a lower bar fluent and wrong is still wrong. Show one free-form eval (MT-Bench, GSM8K) where v1 matches stock Gemma.

1d9