Stony Brook's Tuhin Chakrabarty finds a prize-winning Granta story contains 1,236 phrases copied from online fanfiction

VIEWS86.3KBOOKMARKS179LIKES311REPLIES25

Interesting. I had assumed that AI’s annoying writing tics emerged from post-training. That the models had, essentially, over-learned certain effective rhetorical techniques. But if I’m reading @TuhinChakr correctly, the problem is the training data itself.

Alex Imas@alexolegimas

This from @TuhinChakr is brilliant. That prize winning story from Granta? Turns out it's just a bunch of random whole phrases taken directly from existing text on the internet. Tool allows you to trace those n-grams directly to their source, which is mostly random fanfiction.

https://tuhinchakrabarty.substack.com/p/ai-slop-grantagate-and-bad-writing

38d86.3K311179

RETWEETS112

Alex Imas@alexolegimas

This from @TuhinChakr is brilliant. That prize winning story from Granta? Turns out it's just a bunch of random whole phrases taken directly from existing text on the internet. Tool allows you to trace those n-grams directly to their source, which is mostly random fanfiction.

https://tuhinchakrabarty.substack.com/p/ai-slop-grantagate-and-bad-writing

38d328.5K794570

Max Spero@max_spero_

For those who don't know, infini-gram is a really cool N-gram search engine that works impressively fast over massive datasets

Just because there is an N-gram match doesn't necessarily mean an LLM "plagiarized" from the given work, but there is a reasonable chance that the given document was in the pretraining set of the LLM and influenced the weights towards producing that N-gram.

What is most interesting to me are actually the 115 N-grams found nowhere else on the internet. Maybe that's some sign that it's from the prompt or context. Or maybe even just a token getting randomly sampled.

I'd love to see some more comparisons on human text as well. Waybe there is a major difference here in N-gram similarity for human and AI text, but we won't know until we try it!

Alex Imas@alexolegimas

This from @TuhinChakr is brilliant. That prize winning story from Granta? Turns out it's just a bunch of random whole phrases taken directly from existing text on the internet. Tool allows you to trace those n-grams directly to their source, which is mostly random fanfiction.

https://tuhinchakrabarty.substack.com/p/ai-slop-grantagate-and-bad-writing

38d24K192120

Sewon Min@sewon__min

Really amazing results analyzing what's creative/novel vs. what's copied from Internet data, enabled by the amazing @liujc1998's Infini-gram! http://infini-gram.io

This is also enabled in @allen_ai's OlmoTrace http://allenai.org/blog/olmotrace where anyone can find matching n-grams between LLM-generated text and its training data.

Alex Imas@alexolegimas

This from @TuhinChakr is brilliant. That prize winning story from Granta? Turns out it's just a bunch of random whole phrases taken directly from existing text on the internet. Tool allows you to trace those n-grams directly to their source, which is mostly random fanfiction.

https://tuhinchakrabarty.substack.com/p/ai-slop-grantagate-and-bad-writing

35d14.3K7851

Tuhin Chakrabarty@TuhinChakr

Since this post has blown up.

1) The research is based on two papers

https://arxiv.org/pdf/2410.04265 https://arxiv.org/pdf/2504.07096

2) When writing about the matches I focused on webpages that are not defunct and fan fiction results were especially relevant to AI fiction but some phrases can be in other websites too. That does not change the point about genre mismatch or stitching rare expressions

3) The attribution engine is built using CommonCrawl that LLMs have been trained on. So it might not catch all the possible webpages that might have that expression

Alex Imas@alexolegimas

This from @TuhinChakr is brilliant. That prize winning story from Granta? Turns out it's just a bunch of random whole phrases taken directly from existing text on the internet. Tool allows you to trace those n-grams directly to their source, which is mostly random fanfiction.

https://tuhinchakrabarty.substack.com/p/ai-slop-grantagate-and-bad-writing

38d4.4K4627

Marc Andreessen 🇺🇸@pmarca

Paging Alan Sokol.

Alex Imas@alexolegimas

This from @TuhinChakr is brilliant. That prize winning story from Granta? Turns out it's just a bunch of random whole phrases taken directly from existing text on the internet. Tool allows you to trace those n-grams directly to their source, which is mostly random fanfiction.

https://tuhinchakrabarty.substack.com/p/ai-slop-grantagate-and-bad-writing

38d38.5K5416

David Albright@dalbright

Very cool to see these conversations happening! This is what openness enables. The "tool that allows you to trace those n-grams directly to their source," is infinigram, AKA OlmoTrace from @allen_ai, created by @liujc1998.

Alex Imas@alexolegimas

This from @TuhinChakr is brilliant. That prize winning story from Granta? Turns out it's just a bunch of random whole phrases taken directly from existing text on the internet. Tool allows you to trace those n-grams directly to their source, which is mostly random fanfiction.

https://tuhinchakrabarty.substack.com/p/ai-slop-grantagate-and-bad-writing

37d9.2K3417

Nick Vincent@nickmvincent

Great post (OP is here: https://tuhinchakrabarty.substack.com/p/ai-slop-grantagate-and-bad-writing) and very interesting discussion in the comments of this QT from @alexolegimas. For instance this thread https://x.com/KelseyTuoc/status/2057922575326728330, alongside a bunch of other skeptical response

My read on this debate is that some of the (reasonable) skeptical responses are gesturing towards an interest in more extreme ways to be confident that "model needed certain text from the internet to produce an output". Specifically, I think that people want to see something closer to a concrete data counterfactual, such as "Would model X have produced output Y if document Z had not appeared at all in pretraining?" An answer to this question is one of the stronger pieces "causal" evidence we can provide for a strong dependence.

(Note: there are distinct literatures and techniques for studying data attribution, memorization, membership inference attacks, etc. -- not saying these are all the same but they're highly related).

Computing ground truth for this kind of thing -- for instance by literally training 2 full models with slightly different datasets -- is very expensive (though we are seeing serious progress on estimation techniques for LLM context!). More importantly, in the context of these kind of "societal impacts" debates, we might be more interested in complicated Shapley-style and distributional variants that try to measure impact across many coalitions or many realizations of model training, rather than "simple" leave-one-out. The more interesting complicated counterfactuals are even harder to get ground truth for.

However, I think that on average, I think this kind of analysis via n-gram search over likely training data *is* a good proxy for data counterfactuals of interest.

In large part, this is because as outsiders without direct access to inspectable training data details, our best guess likely involves a guilty-until-proven-innocent approach: if we see sequences from known training data and there's no other explanation offered in the datasheets, model cards, etc., this is the *best explanation that we have*. You could argue it's better to not even try to reason about the distribution of influence at all (just say, it's too complicated, we kind of have to just assume influence is uniformly distributed over all tokens) but I think this is the wrong way to go!

(Have longer versions of a bunch of these points across my older blog posts on these topics!)

Alex Imas@alexolegimas

This from @TuhinChakr is brilliant. That prize winning story from Granta? Turns out it's just a bunch of random whole phrases taken directly from existing text on the internet. Tool allows you to trace those n-grams directly to their source, which is mostly random fanfiction.

https://tuhinchakrabarty.substack.com/p/ai-slop-grantagate-and-bad-writing

37d4.3K1411

Tuhin Chakrabarty@TuhinChakr

@alexolegimas Thank you 🥹

Alex Imas@alexolegimas

This from @TuhinChakr is brilliant. That prize winning story from Granta? Turns out it's just a bunch of random whole phrases taken directly from existing text on the internet. Tool allows you to trace those n-grams directly to their source, which is mostly random fanfiction.

https://tuhinchakrabarty.substack.com/p/ai-slop-grantagate-and-bad-writing

38d8.4K542

Tracey Ryniec@TraceyRyniec

Finally. Someone who actually found where the bad AI writing comes from. AI cannot think. It isn't creative. It is just pulling words from sources and stringing them together. That's why it often makes no sense.

Alex Imas@alexolegimas

This from @TuhinChakr is brilliant. That prize winning story from Granta? Turns out it's just a bunch of random whole phrases taken directly from existing text on the internet. Tool allows you to trace those n-grams directly to their source, which is mostly random fanfiction.

https://tuhinchakrabarty.substack.com/p/ai-slop-grantagate-and-bad-writing

38d1.8K115

Kelsey Piper@KelseyTuoc

@alexolegimas @TuhinChakr I'm not convinced by this. I expect you could do the exact same analysis of human-written text - most grammatical three-word phrases are on the internet somewhere! But that doesn't mean the AI memorized it in pretraining.

38d1.2K48

Kelsey Piper@KelseyTuoc

@alexolegimas @TuhinChakr yes, I am convinced that you can identify LLM text by frequency of rare phrases, just not that you can 'trace the phrases to their source' in a useful way

38d1K50

Marzena Karpinska@mar_kar_

One thing from @TuhinChakr post hits very close home, people tend to #rationalize (bc we don't know better) and see things not there. We saw it already in GPT-2 generated stories -- we *expect* things to *mean* something so we tend to see things that are not there...

Tuhin Chakrabarty@TuhinChakr

Ran some 🧪 with @irisiris_l to 🔬 why the Granta story was certainly 🤖 slop

A lot of bad writing happens coz AI hasn’t learned aesthetics. It has memorized the whole internet and called it a day.

So sure, maybe you don't trust AI detectors. But you can trust your own 👁️.

37d2.9K133

Kelsey Piper@KelseyTuoc

@TuhinChakr @alexolegimas I did read the post and look at the demo. It contains a number of cases of giving a confident attribution for a three word phrase. For example:

38d578361

Kelsey Piper@KelseyTuoc

@TuhinChakr @alexolegimas

38d621291

Joe Weisenthal@TheStalwart

Emdashes, XY contrasts, triplicates. All fine writing techniques when used in moderation.

So I figured it was like wine, where fruitier wine does better in blind taste tests, but then people find it repellent in volume.

38d3.5K192

Kelsey Piper@KelseyTuoc

@TuhinChakr @alexolegimas I am not familiar with any body of work showing that for three-word phrases. I'm familiar with work on that for 30-50 tokens, which works because the probability of that occurring by chance is very small. For three words, it is not small.

38d60636

Marzena Karpinska@mar_kar_

Great article from @TuhinChakr showing that the so called 'creativity' of language models is in fact #frankensteining the internet (ie see Granta story with all its weird phrasing!)

Tuhin Chakrabarty@TuhinChakr

Ran some 🧪 with @irisiris_l to 🔬 why the Granta story was certainly 🤖 slop

A lot of bad writing happens coz AI hasn’t learned aesthetics. It has memorized the whole internet and called it a day.

So sure, maybe you don't trust AI detectors. But you can trust your own 👁️.

38d1.5K123

Chenhao Tan@ChenhaoTan

This is a reasonable take. One can at best make statistical claims on such n-gram analysis.

Max Spero@max_spero_

For those who don't know, infini-gram is a really cool N-gram search engine that works impressively fast over massive datasets

Just because there is an N-gram match doesn't necessarily mean an LLM "plagiarized" from the given work, but there is a reasonable chance that the given document was in the pretraining set of the LLM and influenced the weights towards producing that N-gram.

What is most interesting to me are actually the 115 N-grams found nowhere else on the internet. Maybe that's some sign that it's from the prompt or context. Or maybe even just a token getting randomly sampled.

I'd love to see some more comparisons on human text as well. Waybe there is a major difference here in N-gram similarity for human and AI text, but we won't know until we try it!

37d2.6K104

Kelsey Piper@KelseyTuoc

@TuhinChakr @alexolegimas

38d381171