Thomas G. Dietterich reminds arXiv authors of content responsibility
Thomas G. Dietterich highlighted arXiv's Code of Conduct on X. He stated that every listed author bears full responsibility for all paper contents no matter how they were generated. Andrew White replied by asking whether a non-corresponding author would face a one-year ban for submitting work containing hallucinated references. The exchange centers on enforcement of existing author accountability rules for arXiv submissions.
@tdietterich @ziv_ravid @arxiv @AIXIV Yes, I think that makes perfect sense.
@ziv_ravid @arxiv is (currently) a forum for human authors to freely distribute their work. Maybe autonomous agents will publish their work on @AIXIV instead? What do you think the future should be?
@tdietterich @arxiv Very good, please strongly enforce it.
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/
@ziv_ravid Ravid, I agree that we need to think carefully about a world in which AI systems produce scientific knowledge autonomously. In that world, will no human being (or legal person) be responsible for the results? Who will own the IP? Will the process be self-correcting?
We must think about how to handle the fact that LLMs can generate papers without any human intervention, but this is not the way. Also, I want to see the arXiv ban senior PIs who upload 40 papers a year.
@ziv_ravid @arxiv is (currently) a forum for human authors to freely distribute their work. Maybe autonomous agents will publish their work on @AIXIV instead? What do you think the future should be?
@ziv_ravid Ravid, I agree that we need to think carefully about a world in which AI systems produce scientific knowledge autonomously. In that world, will no human being (or legal person) be responsible for the results? Who will own the IP? Will the process be self-correcting?
@andrewwhite01 @tdietterich @arxiv each author takes full responsibility.
@tdietterich @arxiv Will a non-corresponding author get a 1 year ban if there are hallucinated references in an arxiv submission?
Making humans responsible for their AI use seems like an incredibly reasonable way to address problems & opportunities in the use of AI for academic research, at least in the short term (autonomous scientific work will require different solutions).
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/
arXiv is taking a stand on **responsibility+ownership** in science -- one doesn't go without the other
👏👏👏
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/
@ChenhaoTan I am not sure it doesn't incentivize quality. I do understand it doesn't scale, but the community scaled NeurIPS+ICML+ICLR to > 60k submissions a year. Did quality follow? We are writing (ehm... generating) too much
@yoavartzi I understand the intention, but this particular version does not scale and does not really incentivize quality. It will simply encourage more superficial checks and then disputes around borderline conditions.
@tallinzen @ziv_ravid Everyone is worried about scaling, as if building a mountain of PDFs that no one will ever look at is the pinnacle of doing research ¯\_(ツ)_/¯
@ziv_ravid Honestly if you can't be bothered to read the 40 papers a year you upload to arXiv, maybe it's fine if you're banned!
@ziv_ravid Honestly if you can't be bothered to read the 40 papers a year you upload to arXiv, maybe it's fine if you're banned!
We must think about how to handle the fact that LLMs can generate papers without any human intervention, but this is not the way. Also, I want to see the arXiv ban senior PIs who upload 40 papers a year.
@ziv_ravid (I do think the specifics of the ban are a little more draconian than I would go for but I think the spirit is 100% the right one)
@ziv_ravid Honestly if you can't be bothered to read the 40 papers a year you upload to arXiv, maybe it's fine if you're banned!
This is definitely well-intentioned.
But it also reminds me of the ACL anonymity policy, which ended up creating a lot of friction for many legitimate researchers.
If arXiv starts enforcing this, will it require even stricter screening with the already limited amount of human review capacity?
False positives are already a frustrating issue on arXiv: good papers get put on hold without clear explanations. I worry this policy could lead to even more unnecessary holds and longer review delays.
In the end, it feels like the vast majority of researchers may end up paying the price for the actions of a very small minority.
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/
I understand why they are doing this, but IMHO this again highlights the extremely centralized and *PERMISSIONFUL* nature of scientific work. Even at the "open" publication venues.
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/
Just post your research onto GitHub bro.
Found myself posting papers to GitHub instead of arXiv lately. No gatekeeping, is in the same repo as the code, one link for everything, and gets uploaded immediately. Makes you wonder what arXiv's actual value is.
We must think about how to handle the fact that LLMs can generate papers without any human intervention, but this is not the way. Also, I want to see the arXiv ban senior PIs who upload 40 papers a year.
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/
A few more thoughts: I agree with the concern that Arxiv will become full of millions of AI-slop papers written by agents. **But** this is not the answer. Not only because banning an author for life is not fair (it might that you write only some sections and ofcourse that you can't check all the references one by one), but also because this is not the real problem. AI is writing our papers at this point, and the only question is at what level. If I ask the model to rephrase a sentence and it keeps, " Sure, I will do it for you", it doesn't mean anything about the quality of the paper. The same as we didn't punish papers with grammar errors in the past. Moreover, I can create entire fake papers that will pass this bar from one prompt, and soon, we will have systems that will catch these mistakes automatically. At the end, we need to come up with a semi-automatic system that will check for correctness without the question of whether it was generated by AI (the question of whether we want humans to be responsible for each paper is a different story)
We must think about how to handle the fact that LLMs can generate papers without any human intervention, but this is not the way. Also, I want to see the arXiv ban senior PIs who upload 40 papers a year.
@tdietterich @arxiv Why are meta comments from the LLM such a bad thing? What is the difference with human comments?
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/
@yoavartzi I understand the intention, but this particular version does not scale and does not really incentivize quality. It will simply encourage more superficial checks and then disputes around borderline conditions.
arXiv is taking a stand on **responsibility+ownership** in science -- one doesn't go without the other 👏👏👏
Reputation based mechanisms for accountability seems to be a likely direction esp for agentic use cases.
Making humans responsible for their AI use seems like an incredibly reasonable way to address problems & opportunities in the use of AI for academic research, at least in the short term (autonomous scientific work will require different solutions).
@tdietterich @arxiv Will a non-corresponding author get a 1 year ban if there are hallucinated references in an arxiv submission?
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/
the thread
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/
This was posted 15 minutes after I uploaded my latest paper to arxiv.
Then I spent the rest of the evening reminding myself that correlation does not imply causation :-D
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/
A neat example of updating our professional norms in the face of new technology. Bold, necessary move by arXiv to push back against the flurry of AI slop content published on its repository.
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/
A neat example of updating our norms in the face of new technology. Bold, necessary move by arXiv to push back against the flurry of AI slop content published on arxiv repository.
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/