Aaron Roth flags phantom NeurIPS 2026 citation for 'Why Do Multi-Agent LLM Systems Fail?' that appeared in OpenReview and Google Scholar BibTeX exports
Paper presents MAST-Data of 1,600 multi-agent traces and 14-mode failure taxonomy.
@Aaroth At first I thought this was a regular NeurIPS error (they have tended up to mess up dates on their proceedings, and also the volume number), and then saw the date. This happens with Openreview for in review papers quite frequently.
A clearly hallucinated citation! NeurIPS 2026 decisions aren't out yet. But wait --- the hallucination is also present in the bibtex entries from openreview https://openreview.net/forum?id=fAjbYBmonr and Google Scholar https://scholar.googleusercontent.com/scholar.bib?q=info:E9WlfpJ70l0J:scholar.google.com/&output=citation&scisdr=CljIHctUENiogcez9JA:AFyMTJUAAAAAag217JCmKMR6XCw1ojwmwzHCQqA&scisig=AFyMTJUAAAAAag217DvWNBtTL4On8NPG9y8LQII&scisf=4&ct=citation&cd=-1&hl=en