3h ago

Aaron Roth flags phantom NeurIPS 2026 citation for 'Why Do Multi-Agent LLM Systems Fail?' that appeared in OpenReview and Google Scholar BibTeX exports

Paper presents MAST-Data of 1,600 multi-agent traces and 14-mode failure taxonomy.

0
Original post

A clearly hallucinated citation! NeurIPS 2026 decisions aren't out yet. But wait --- the hallucination is also present in the bibtex entries from openreview https://openreview.net/forum?id=fAjbYBmonr and Google Scholar https://scholar.googleusercontent.com/scholar.bib?q=info:E9WlfpJ70l0J:scholar.google.com/&output=citation&scisdr=CljIHctUENiogcez9JA:AFyMTJUAAAAAag217JCmKMR6XCw1ojwmwzHCQqA&scisig=AFyMTJUAAAAAag217DvWNBtTL4On8NPG9y8LQII&scisf=4&ct=citation&cd=-1&hl=en

6:21 AM · May 20, 2026 View on X

@Aaroth At first I thought this was a regular NeurIPS error (they have tended up to mess up dates on their proceedings, and also the volume number), and then saw the date. This happens with Openreview for in review papers quite frequently.

Aaron RothAaron Roth@Aaroth

A clearly hallucinated citation! NeurIPS 2026 decisions aren't out yet. But wait --- the hallucination is also present in the bibtex entries from openreview https://openreview.net/forum?id=fAjbYBmonr and Google Scholar https://scholar.googleusercontent.com/scholar.bib?q=info:E9WlfpJ70l0J:scholar.google.com/&output=citation&scisdr=CljIHctUENiogcez9JA:AFyMTJUAAAAAag217JCmKMR6XCw1ojwmwzHCQqA&scisig=AFyMTJUAAAAAag217DvWNBtTL4On8NPG9y8LQII&scisf=4&ct=citation&cd=-1&hl=en

1:21 PM · May 20, 2026 · 10K Views
2:13 PM · May 20, 2026 · 939 Views
Aaron Roth flags phantom NeurIPS 2026 citation for 'Why Do Multi-Agent LLM Systems Fail?' that appeared in OpenReview and Google Scholar BibTeX exports · Digg