GPU Forecasters uses LLMs as selective surrogates to predict GPU kernel runtimes, achieving near-optimal performance with 90% fewer hardware runs · Digg