evals are environments
difficultyang@difficultyang
I used to think of evals as "here is how you can see how good a model is" but i think of evals now more as "there is something you want to teach the model, and this is the test to figure out if the model actually learned it". Yes, similar, but I think the latent matters.
11:36 PM · Jun 23, 2026 · 2.2K Views