17h ago

AVERI Executive Director Miles Brundage warns of overconfidence in the Pangram system in a brief, no-detail tease

The post provides no technical benchmarks or performance data.

0
Original post

Many of you are vastly overconfident in Pangram

11:09 PM · May 25, 2026 View on X
Reposted by

@TuhinChakr I don't think that situation was caused by humility re: Pangram, but by people who didn't even bother running it and hadn't spent much time using AI at all in order to notice the warning signs (also see my replies elsewhere re: my more detailed thoughts)

Tuhin ChakrabartyTuhin Chakrabarty@TuhinChakr

@Miles_Brundage Can you empirically show why ? I feel this kind of rhetoric doesn’t help and leads to embarrassing situations like the AI short story awarded the commonwealth prize

7:23 PM · May 26, 2026 · 164 Views
7:25 PM · May 26, 2026 · 144 Views

@TuhinChakr You are not really the target audience here, if you already know there are limitations / that it is suggestive. Not everyone does

Tuhin ChakrabartyTuhin Chakrabarty@TuhinChakr

@Miles_Brundage I found your replies unsatisfying. Watermarking has lots of limitations plus enforcing it across all models is a policy problem. I think definitive or suggestive is an interesting angle. FWIW if you don't use AI to write Pangram wont flag. If you do on the contrary idk

7:28 PM · May 26, 2026 · 50 Views
7:46 PM · May 26, 2026 · 37 Views

@Miles_Brundage Can you empirically show why ? I feel this kind of rhetoric doesn’t help and leads to embarrassing situations like the AI short story awarded the commonwealth prize

Miles BrundageMiles Brundage@Miles_Brundage

Many of you are vastly overconfident in Pangram

6:09 AM · May 26, 2026 · 32.1K Views
7:23 PM · May 26, 2026 · 164 Views

@Miles_Brundage I found your replies unsatisfying. Watermarking has lots of limitations plus enforcing it across all models is a policy problem. I think definitive or suggestive is an interesting angle. FWIW if you don't use AI to write Pangram wont flag. If you do on the contrary idk

Miles BrundageMiles Brundage@Miles_Brundage

@TuhinChakr I don't think that situation was caused by humility re: Pangram, but by people who didn't even bother running it and hadn't spent much time using AI at all in order to notice the warning signs (also see my replies elsewhere re: my more detailed thoughts)

7:25 PM · May 26, 2026 · 144 Views
7:28 PM · May 26, 2026 · 50 Views