/Tech2h ago

Deepfates Challenges Dismissal of Eval Awareness in Language Models

1900497

Original post

🎭@deepfates#1015inTech

@xeophon this would be pretty surprising, considering everything else we know about language models. would you like to try to back this up with a real claim?

Florian Brand@xeophon

I don’t think eval awareness is a real thing // is as much of a problem as people make it out to be

10:07 AM · Jul 4, 2026 · 447 Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS50REPLIES1

🎭@deepfates

@xeophon @jdchawla29 Oh, if this is the motte then you're really just claiming a lack of evidence in which case why don't you do an experiment improve eval awareness is jumped up. because everybody will be really excited to find this out

Florian Brand@xeophon

@jdchawla29 I’m not doubting that models know what evals, grading etc. is, as its knowledge from the training data

I’m saying that the phenomenon is overstated and its impact is not really assessed // ppl should do more realistic setups

2h5000

Florian Brand@xeophon

@deepfates @jdchawla29 Want to work on reward hacking on the near future; will comb through the rollouts then to see whether models specifically mention eval settings and if so, how those rollouts differ from others

2h1