arXiv clarifies one-year ban for unchecked LLM submissions

REPLY

@DimitrisPapail @tdietterich @roydanroy It was always gated: need one endorsement from someone. That already protects s lot of slop back then. But now that globally incentives and "the game"[1] have changed, it makes sense to change the gating.

1: trust me i hate that it even makes sense to call it that, but it does

Dimitris Papailiopoulos@DimitrisPapail

arXiv was never high SNR. it has had slop way before LLMs and a fake P=NP proof once a month for two decades and has always been usable. Its strength was never the average correctness of papers on it, but open access to text and artifacts, and easy way to reference work. Correctness gets established downstream by people who actually use the work

9:51 PM · May 14, 2026 · 11.2K Views

6:36 AM · May 15, 2026 · 359 Views

QUOTE POST

#56Dan Roy@ROYDANROY

Steep penalties for submitting AI slop to the arXiv.

Thomas G. Dietterich@tdietterich

The penalty is a 1-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted at a reputable peer-reviewed venue. 4/

7:03 PM · May 14, 2026 · 199.1K Views

8:06 PM · May 14, 2026 · 28.8K Views

ORIGINAL POST

#56Dan Roy@ROYDANROY

There's a lot of controversy brewing around arXiv's decision to penalize authors who post unchecked AI generated content.

The impulse is correct, IMO, simply on grounds of efficiency: it is much cheaper to insist the authors vet their work first, rather than distributing the cost of that work to EVERY reader/agent who subsequently downloads the work.

I believe the mechanism is likely the wrong one, however. Unfortunately, suggestions to use github are even worse, IMO, because they lose the (effective) immutability of the scientific record, which arXiv upholds.

5:59 AM · May 15, 2026 · 20.1K Views

REPLY

#66Thomas G. Dietterich@TDIETTERICH

The penalty is a 1-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted at a reputable peer-reviewed venue. 4/

Thomas G. Dietterich@tdietterich

We have recently clarified our penalties for this. If a submission contains incontrovertible evidence that the authors did not check the results of LLM generation, this means we can't trust anything in the paper. 3/

7:03 PM · May 14, 2026 · 91.4K Views

7:03 PM · May 14, 2026 · 199.1K Views

REPLY

#67Zico Kolter@ZICOKOLTER

@DimitrisPapail I see your points, but I think you may also be discounting just how curated Arxiv already is. @tdietterich and others reject a ton of low-quality submissions. There are problems with the LLM proposal, but the mods want to maintain something similar to the current quality bar.

Dimitris Papailiopoulos@DimitrisPapail

Found myself posting papers to GitHub instead of arXiv lately. No gatekeeping, is in the same repo as the code, one link for everything, and gets uploaded immediately. Makes you wonder what arXiv's actual value is.

9:12 PM · May 14, 2026 · 88.2K Views

3:58 PM · May 15, 2026 · 2.9K Views

QUOTE POST

#103Delip Rao e/σ@DELIPRAO

If you AI agents assist writing your paper and don't want to go to arXiv jail, our recent work publishes a tool that mitigates bibtex citation hallucination via the agent's skill interface. You will have to add something like "use the clibib skill to discover new bibtex citations and check existing ones". Link below 👇

Andrew White 🐦‍⬛@andrewwhite01

hallucinated references will land you a 1-year ban from arxiv now. wow

7:51 PM · May 14, 2026 · 221.5K Views

11:32 PM · May 14, 2026 · 8.4K Views

REPLY

#103Delip Rao e/σ@DELIPRAO

clibib is a Python package and an agent skill for fetching citations through natural language or the /clibib slash command. Works with any agent that supports the open standard — Claude Code, Codex CLI, Gemini CLI, OpenHands, GitHub Copilot, and others. https://github.com/delip/clibib

Delip Rao e/σ@deliprao

If you AI agents assist writing your paper and don't want to go to arXiv jail, our recent work publishes a tool that mitigates bibtex citation hallucination via the agent's skill interface. You will have to add something like "use the clibib skill to discover new bibtex citations and check existing ones". Link below 👇

11:32 PM · May 14, 2026 · 8.4K Views

11:32 PM · May 14, 2026 · 1.3K Views

REPLY

#103Delip Rao e/σ@DELIPRAO

details about the skill: https://github.com/delip/clibib/blob/main/skill/README.md

Delip Rao e/σ@deliprao

clibib is a Python package and an agent skill for fetching citations through natural language or the /clibib slash command. Works with any agent that supports the open standard — Claude Code, Codex CLI, Gemini CLI, OpenHands, GitHub Copilot, and others. https://github.com/delip/clibib

11:32 PM · May 14, 2026 · 1.3K Views

11:32 PM · May 14, 2026 · 857 Views

REPLY

#197Dimitris Papailiopoulos@DIMITRISPAPAIL

@BlancheMinerva @ChenhaoTan there's been fraudulent papers on arxiv before LLMs. The burden on what is of value falls on the community, not on arxiv. The problem with policies like that is they add more burden on the maintainers with little (if at all) benefit, and added frustration on authors.

Stella Biderman @ ICLR@BlancheMinerva

@DimitrisPapail @ChenhaoTan I think it is a very reasonable response to the deluge of fraudulent papers being submitted.

3:25 PM · May 15, 2026 · 172 Views

3:29 PM · May 15, 2026 · 163 Views

REPLY

#197Dimitris Papailiopoulos@DIMITRISPAPAIL

@roydanroy Keep Arxiv the way it is, this makes little sense. It's a repository not a gated venue, and that's good.

Dan Roy@roydanroy

Steep penalties for submitting AI slop to the arXiv.

8:06 PM · May 14, 2026 · 28.8K Views

8:42 PM · May 14, 2026 · 7.4K Views

REPLY

#197Dimitris Papailiopoulos@DIMITRISPAPAIL

@roydanroy there's been a ton of slop on arxiv before AI. The solution was never reviewing it before upload, but ignoring it after it is uploaded.

Dimitris Papailiopoulos@DimitrisPapail

@roydanroy Keep Arxiv the way it is, this makes little sense. It's a repository not a gated venue, and that's good.

8:42 PM · May 14, 2026 · 7.4K Views

8:43 PM · May 14, 2026 · 1.2K Views

REPLY

#197Dimitris Papailiopoulos@DIMITRISPAPAIL

arXiv was never high SNR. it has had slop way before LLMs and a fake P=NP proof once a month for two decades and has always been usable. Its strength was never the average correctness of papers on it, but open access to text and artifacts, and easy way to reference work. Correctness gets established downstream by people who actually use the work

Thomas G. Dietterich@tdietterich

@DimitrisPapail @roydanroy This is a policy question that we think about often. How low can the signal::noise ratio go before arXiv becomes unusable? Our main goal in moderation is to keep non-scientific-papers off of arXiv.

9:25 PM · May 14, 2026 · 1.5K Views

9:51 PM · May 14, 2026 · 11.2K Views

REPLY

#197Dimitris Papailiopoulos@DIMITRISPAPAIL

@_onionesque Why

Shubhendu Trivedi@_onionesque

@DimitrisPapail I don't think these two things are comparable. Requiring that an author has read what they have submitted seems categorically different.

11:43 PM · May 14, 2026 · 517 Views

1:27 AM · May 15, 2026 · 426 Views

QUOTE POST

#353Dylan HadfieldMenell@DHADFIELDMENELL

I agree with Dan, this is an important signal that arXiv is a serious choice and should be treated as such.

I think execution success will depend on clarity for the criteria used. E.g., an entirely made-up reference vs one that has incorrect authors.

Dan Roy@roydanroy

There's a lot of controversy brewing around arXiv's decision to penalize authors who post unchecked AI generated content. The impulse is correct, IMO, simply on grounds of efficiency: it is much cheaper to insist the authors vet their work first, rather than distributing the cost of that work to EVERY reader/agent who subsequently downloads the work. I believe the mechanism is likely the wrong one, however. Unfortunately, suggestions to use github are even worse, IMO, because they lose the (effective) immutability of the scientific record, which arXiv upholds.

5:59 AM · May 15, 2026 · 20.1K Views

1:52 PM · May 15, 2026 · 1.7K Views

REPLY

#353Dylan HadfieldMenell@DHADFIELDMENELL

In an ideal world, I think we would have something like a jury system with publicly posted results to develop and communicate the standard.

Dylan HadfieldMenell@dhadfieldmenell

I agree with Dan, this is an important signal that arXiv is a serious choice and should be treated as such. I think execution success will depend on clarity for the criteria used. E.g., an entirely made-up reference vs one that has incorrect authors.

1:52 PM · May 15, 2026 · 1.7K Views

1:52 PM · May 15, 2026 · 389 Views

REPLY

#567Leo Boytsov@SRCHVRS

@tdietterich Don't you think that the requirement for a subsequent submission is way too strict? It's like a life-long sentence.

Thomas G. Dietterich@tdietterich

The penalty is a 1-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted at a reputable peer-reviewed venue. 4/

7:03 PM · May 14, 2026 · 199.1K Views

10:07 PM · May 15, 2026 · 2.2K Views

QUOTE POST

#570Chenhao Tan@CHENHAOTAN

Would @arxiv be interested in more thorough checks beyond this?

Thomas G. Dietterich@tdietterich

The penalty is a 1-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted at a reputable peer-reviewed venue. 4/

7:03 PM · May 14, 2026 · 199.1K Views

8:25 PM · May 14, 2026 · 548 Views

QUOTE POST

#687Bojan Tunguz@TUNGUZ

IMHO, the whole notion of an immutable and sacrosanct “paper” as THE main unit of scientific work output has been outdated by at least a few decades. The arrival of AI and knowledge-work-as-a-service is only making this fact more obvious, and in practical terms untenable going forward.

Luca Ambrogioni@LucaAmb

I am quite convinced that, under these arxive guidelines, every single major PI in the field will be banned within a few years

10:04 AM · May 15, 2026 · 23.7K Views

2:49 PM · May 15, 2026 · 4.1K Views

REPLY

#880Mengye Ren@MENGYER

@anshulkundaje Agreed. We need to change the mindset from punishing people to help people improve.

Anshul Kundaje@anshulkundaje

Wouldn't tagging papers that issues with incontrovertible evidence (like hallucinated refs) be a much easier solution than this weird 1 year ban with a "reputed peer review" requirement (for a preprint server?!??

1:10 AM · May 15, 2026 · 16.5K Views

1:50 AM · May 15, 2026 · 1K Views

ORIGINAL POST

#880Mengye Ren@MENGYER

If arXiv decides to do active gate checking on LLM slops, it also has the responsibility to release the stats on rejection rate and on hold delay, and explain the decision making process in more transparency.

12:04 AM · May 15, 2026 · 5.5K Views

ORIGINAL POST

#988Andrew White 🐦‍⬛@ANDREWWHITE01

hallucinated references will land you a 1-year ban from arxiv now. wow

7:51 PM · May 14, 2026 · 221.5K Views

QUOTE POST

#1050Tuhin Chakrabarty@TUHINCHAKR

Excellent 👏 Accountability is crucial for Slop

Thomas G. Dietterich@tdietterich

The penalty is a 1-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted at a reputable peer-reviewed venue. 4/

7:03 PM · May 14, 2026 · 199.1K Views

8:00 PM · May 14, 2026 · 444 Views

REPLY

#1153Florian Brand@XEOPHON

@roydanroy It’s not on arXiv to decide which papers are good, this is what conferences are for

Dan Roy@roydanroy

There's a lot of controversy brewing around arXiv's decision to penalize authors who post unchecked AI generated content. The impulse is correct, IMO, simply on grounds of efficiency: it is much cheaper to insist the authors vet their work first, rather than distributing the cost of that work to EVERY reader/agent who subsequently downloads the work. I believe the mechanism is likely the wrong one, however. Unfortunately, suggestions to use github are even worse, IMO, because they lose the (effective) immutability of the scientific record, which arXiv upholds.

5:59 AM · May 15, 2026 · 20.1K Views

6:07 AM · May 15, 2026 · 1.5K Views

ORIGINAL POST

#1428Andreas Stuhlmüller@STUHLMUELLER

arXiv now has a one-year ban for hallucinated references

10:01 PM · May 14, 2026 · 320 Views

REPLY

#1446Shubhendu Trivedi@_ONIONESQUE

@roydanroy Surprised to see comments by professors here about gatekeeping. They seem to have forgotten that you couldn't submit to arXiv directly unless you had a .edu ID. If you were outside academia, you needed an endorsement to be able to submit. This has been the case for 15 years.

Dan Roy@roydanroy

There's a lot of controversy brewing around arXiv's decision to penalize authors who post unchecked AI generated content. The impulse is correct, IMO, simply on grounds of efficiency: it is much cheaper to insist the authors vet their work first, rather than distributing the cost of that work to EVERY reader/agent who subsequently downloads the work. I believe the mechanism is likely the wrong one, however. Unfortunately, suggestions to use github are even worse, IMO, because they lose the (effective) immutability of the scientific record, which arXiv upholds.

5:59 AM · May 15, 2026 · 20.1K Views

6:20 AM · May 15, 2026 · 1.7K Views

REPLY

#1446Shubhendu Trivedi@_ONIONESQUE

@DimitrisPapail I don't think these two things are comparable. Requiring that an author has read what they have submitted seems categorically different.

Dimitris Papailiopoulos@DimitrisPapail

Arxiv always had tons of slop, and was fine. Arbitrary gatekeeping mechanisms will only increase the chance that it withers away.

9:55 PM · May 14, 2026 · 9.9K Views

11:43 PM · May 14, 2026 · 517 Views

ORIGINAL POST

#1446Shubhendu Trivedi@_ONIONESQUE

I don't know what's going to happen to the concept of a paper (or at least its evaluation) 2-3 years from now. But the last time there was an epidemic of slop, you had vixra. No gatekeeping. But this time we might be destined for a web where everything becomes one massive viXra.

2:06 AM · May 15, 2026 · 2.3K Views

QUOTE POST

#1572Atoosa Kasirzadeh@DR_ATOOSA

Having hallucinated references in the submitted manuscript will land you a 1-year ban from arxiv.

Thomas G. Dietterich@tdietterich

Examples of incontrovertible evidence: hallucinated references, meta-comments from the LLM ("here is a 200 word summary; would you like me to make any changes?"; "the data in this table is illustrative, fill it in with the real numbers from your experiments") end/

7:03 PM · May 14, 2026 · 65.4K Views

9:05 PM · May 14, 2026 · 38 Views

REPLY

#1601Quentin Berthet@QBERTHET

@roydanroy Also what is the definition of a hallucinated reference?

Is the latex compiler accidentally putting two NeurIPS editors as authors an hallucination?

Is adding a reference in v2 following a reviewer request an hallucination if it doesn't exist?

Dan Roy@roydanroy

There's a lot of controversy brewing around arXiv's decision to penalize authors who post unchecked AI generated content. The impulse is correct, IMO, simply on grounds of efficiency: it is much cheaper to insist the authors vet their work first, rather than distributing the cost of that work to EVERY reader/agent who subsequently downloads the work. I believe the mechanism is likely the wrong one, however. Unfortunately, suggestions to use github are even worse, IMO, because they lose the (effective) immutability of the scientific record, which arXiv upholds.

5:59 AM · May 15, 2026 · 20.1K Views

10:39 AM · May 15, 2026 · 803 Views

QUOTE POST

#1674xlr8harder@XLR8HARDER

seems pretty intense for a mistake that could possibly happen to someone just reformatting text before a deadline (not the hallucinated references, but LLM meta-comments.)

Andrew White 🐦‍⬛@andrewwhite01

hallucinated references will land you a 1-year ban from arxiv now. wow

7:51 PM · May 14, 2026 · 221.5K Views

12:56 PM · May 15, 2026 · 3.4K Views

QUOTE POST

#1675Anshul Kundaje@ANSHULKUNDAJE

Wouldn't tagging papers that issues with incontrovertible evidence (like hallucinated refs) be a much easier solution than this weird 1 year ban with a "reputed peer review" requirement (for a preprint server?!??

Thomas G. Dietterich@tdietterich

Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/

7:03 PM · May 14, 2026 · 600.3K Views

1:10 AM · May 15, 2026 · 16.5K Views

REPLY

#1822Luca Ambrogioni@LUCAAMB

@TaliaRinger Is it? Then you should get a ban for misused references as well. Let's see who of us stays standing

Talia Ringer 🕊🪬@TaliaRinger

Good tbh

10:59 PM · May 14, 2026 · 2.5K Views

8:03 AM · May 15, 2026 · 71 Views

QUOTE POST

#1822Luca Ambrogioni@LUCAAMB

I understand that arxive has issues caused by blushing submissions but this is way too strict

Mistakes could slip in in papers long before AI and a single mistake slipping in isn't a sign that a paper is unchecked slop

You can have a great paper with a line of prompt left in the supplementary. Do you deserve a lifetime ban?

Thomas G. Dietterich@tdietterich

The penalty is a 1-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted at a reputable peer-reviewed venue. 4/

7:03 PM · May 14, 2026 · 199.1K Views

7:44 AM · May 15, 2026 · 10K Views

REPLY

#1822Luca Ambrogioni@LUCAAMB

@roydanroy Not for submitting AI slops. Steep penalties first making any mistakes while using AI, even in a paper that is otherwise good

Complete nonsense

Dan Roy@roydanroy

Steep penalties for submitting AI slop to the arXiv.

8:06 PM · May 14, 2026 · 28.8K Views

7:52 AM · May 15, 2026 · 233 Views

QUOTE POST

#1822Luca Ambrogioni@LUCAAMB

Fully agree

4:47 PM · May 15, 2026 · 1.3K Views

ORIGINAL POST

#1822Luca Ambrogioni@LUCAAMB

I am quite convinced that, under these arxive guidelines, every single major PI in the field will be banned within a few years

10:04 AM · May 15, 2026 · 23.7K Views

REPLY

#1822Luca Ambrogioni@LUCAAMB

@littmath I agree with you, as stated right now it simply cannot work, or worse they will hand protect a few famous people

Daniel Litt@littmath

@LucaAmb My point is not that PIs won’t have AI-generated text in their papers. It’s that obviously the arxiv will not ban all PIs. So either the guidelines will be enforced selectively (or, more generously, flexibly), or they will change.

11:55 AM · May 15, 2026 · 568 Views

11:58 AM · May 15, 2026 · 468 Views

arXiv clarifies one-year ban for unchecked LLM submissions

Cluster engagement

Sentiment

Cluster engagement