3h ago

Victor Taelin, Bend language creator, claims GPT models trail Mythos on compiler and proof language tasks

Technologist Jason proposed running benchmarks to target the performance gaps.

0
Original post

@jxnlco @BjarturTomas I just wish GPT catches up with Mythos. It isn't quite there yet, there is a LOT to improve, and some stuff are taking longer to improve than I wish. If anything, I'd just like to help fix its bad outputs on compiler / proof lang work more directly than just waiting and hoping

9:56 PM · May 23, 2026 View on X

@VictorTaelin @BjarturTomas Is there something we could run evals in

TaelinTaelin@VictorTaelin

@jxnlco @BjarturTomas I just wish GPT catches up with Mythos. It isn't quite there yet, there is a LOT to improve, and some stuff are taking longer to improve than I wish. If anything, I'd just like to help fix its bad outputs on compiler / proof lang work more directly than just waiting and hoping

4:56 AM · May 24, 2026 · 736 Views
4:58 AM · May 24, 2026 · 676 Views

@jxnlco @BjarturTomas Yes!! But I'd need you to tell me what kind of format would work best. I have only a rough idea, but there's a lot I could provide. For example, would a massive dataset of mined theorem/proof pairs work? How granular steps should be? Would making it interactive help? Etc.

jasonjason@jxnlco

@VictorTaelin @BjarturTomas Is there something we could run evals in

4:58 AM · May 24, 2026 · 676 Views
5:05 AM · May 24, 2026 · 659 Views