/Tech3h ago

AI Text Detectors Like Pangram Draw Criticism Over Fallibility and Misuse

95013215.2K

Original post unavailable.

Sentiment

Many users criticized Pangram and similar AI text detectors for high false positive rates that cause unwarranted damage to individuals and waste developer effort while dismissing the marketing claims as unrealistic.

Pos

0.0%

Neg

100.0%

5 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS450LIKES11

Ethan@torchcompiled

Firstly, false positive rate differs wildly by scenario of how the text was created. the 1-in-10,000 metric happens under the most ideal, sterile cases. Real-life scenarios and mixed text have far worse reliability

17h45011

BOOKMARKS1

Ethan@torchcompiled

The witch-hunting and call outs are doing unwarranted damage, and arguably this benefits publicity of the product.

17h8161

REPLIES2

Ethan@torchcompiled

@akhmxt Wait it’s paywalled?

17h63

Ethan@torchcompiled

The model is trained and validated on in-house text datasets of human text pre-llm era against "lab-grown-AI" text

17h19010

Ethan@torchcompiled

There's research papers citing that human spoken language patterns takes on Ai characteristics, after filtering for scripted works. Naturally we mimic the culture we're exposed to and adapt our language The training and validation on pre-llm human text doesn't account for that

17h927

kalim@akhmxt

@torchcompiled Should make the piece free / remove paywall for wider distribution

17h1942

Ethan@torchcompiled

Failure rates are conditional on the genre of text, and then a big one: the reported failure rate is a population average reflecting a heterogenous and imbalanced population. Some writers are paying the cost of the worst-case scenario, while others less, FPR is just an average.

17h1016

Ethan@torchcompiled

The classifier output is an inference, given we see text with XYZ patterns, what is the probability that it came from an LLM vs a human?

17h746

Ethan@torchcompiled

The Taylor Lorenz Case

17h2211

Ethan@torchcompiled

Model updates preserve similar or better false positive rates in average, but don't reveal how individual decisions change. There's a risk something scans as AI on monday but gets flagged on human on friday, and this can be the difference between a case and a nothingburger.

17h715

Ethan@torchcompiled

https://open.substack.com/pub/ethansmith2000/p/ai-text-detection-arms-dealers-in?r=jsutr&utm_medium=ios

17h2703

Ethan@torchcompiled

The evidence tab suffers from confirmation bias and the multiple hypothesis bias (when testing many things one is likely to come back true)

17h644

Ethan@torchcompiled

The studies of the metric on external datasets, APT, Grammarly, and BEEMO show that a mixed text can basically end up anywhere on the scale of AI to human. So the person who did light AI polish/editor work can easily be flagged human or fully AI

17h583

Ethan@torchcompiled

The majority of benchmarks are over internal datasets, the validation set which matches the qualities of the train set, basically a better suggestion of "did we avoid memorizing" than does this extrapolate to in-the-wild usage. External audits often follow same pattern

17h583

Ethan@torchcompiled

for mixed authorship/AI-assistance detection, most benchmarks not only use their own crafted datasets but they also create the labels, because there is really no ground truth for how much "AI-ness" a text has. There is high variance and disagreement with human evals here.

17h563

Ethan@torchcompiled

Ironically a Pangram blog incidentally reinforces this idea without saying it

17h542

Ethan@torchcompiled

A paper by Garland, reminds that this kind of classification, population averages of FPR over a whole validation set don't recognize that some cases are more challenging than others, and some folks are worse off than others for false positives

17h542

Ethan@torchcompiled

We may often use AI text detection to infer things like effort and the writing process, but - AI created article, doing all the ideation, rewritten by human might go under the radar - a light edit pass would likely hit the detection

17h502

Ethan@torchcompiled

Error reporting may have a survivorship bias

17h482

Ethan@torchcompiled

The "skin in the game" asymmetry

17h472