/AI1h ago

Evals Expert Florian Brand Joins Live Session On LLM Benchmarking

5474152.1K

#1117

Original post

Florian Brand#1117

Yacine Mahdid@yacinelearning

alright folks we're talking to big eval master @xeophon in 3h tune in to ask sensible questions like:

- how to run your own evals without going insane? - are evals and environments kind of the same?? - why most benchmarks are janked??? - why LLM cheats?????

Yacine Mahdid@yacinelearning

to kick off our big boss frontier research series we have the evals master florian joining us this friday from 12:00-14:00 to talk about LLM benchmarking

send your questions by comments, dm, fax, ping alexine, text or any other means and I'll weave your questions right in

6:29 AM · Jun 5, 2026 · 2.1K Views

/AI1h ago

Evals Expert Florian Brand Joins Live Session On LLM Benchmarking

--0--

#1117

Original post

Florian Brand#1117

Yacine Mahdid@yacinelearning

alright folks we're talking to big eval master @xeophon in 3h tune in to ask sensible questions like:

- how to run your own evals without going insane? - are evals and environments kind of the same?? - why most benchmarks are janked??? - why LLM cheats?????

Yacine Mahdid@yacinelearning

to kick off our big boss frontier research series we have the evals master florian joining us this friday from 12:00-14:00 to talk about LLM benchmarking

send your questions by comments, dm, fax, ping alexine, text or any other means and I'll weave your questions right in

6:29 AM · Jun 5, 2026 · 2.1K Views

Sentiment

Users are celebrating LLM benchmarking expert Florian Brand's live discussion on agent evals as a win.

Pos

100.0%

Neg

0.0%

1 comments with sentiment.

Cluster Engagement

Sentiment

Sentiment building, check back later.

Cluster Engagement

Views

Comments

Reposts

Bookmarks

Expand data

Posts from X

Most Activity

VIEWS114LIKES3

Yacine Mahdid@yacinelearning

whyyyyyy??????

1h1143

Posts from X

Most Activity

No ranked X posts are available for this story yet.