22d ago

Researcher Teases Five Unexpected Projects in Works

4701169.6K

——0——

Original post

Being this excited about five rather unexpected research projects simultaneously is almost too painful. Assuming that we figure out how to sequence these releases, y’all are going to thoroughly love each of these.

5:57 PM · Apr 24, 2026

Cluster engagement

4 snapshots

Reposted by

#171@LATEINTERACTION

ORIGINAL POST

#171Omar Khattab@LATEINTERACTION

Being this excited about five rather unexpected research projects simultaneously is almost too painful.

Assuming that we figure out how to sequence these releases, y’all are going to thoroughly love each of these.

12:57 AM · Apr 25, 2026 · 18.6K Views

QUOTE POST

#171Omar Khattab@LATEINTERACTION

alright, two of the five are out since April 24 :D

funnily enough, i'm still somehow excited about five (3+2) again currently. y'all will find them really really nice.

Omar Khattab@lateinteraction

12:57 AM · Apr 25, 2026 · 18.6K Views

1:40 AM · May 15, 2026 · 6.8K Views

QUOTE POST

#171Omar Khattab@LATEINTERACTION

from the ones released recently, read #2 Pedagogical RL at:

Souradip Chakraborty@SOURADIPCHAKR18

🚨Typical RL algorithms and on-policy distillation methods are blind samplers: they use privileged info to score rollouts, but not to *find* them. We ask: can we use privileged info to *actively sample* the rollouts RL wishes it can stumble upon with compute? ⤵️ Pedagogical RL

10:46 PM · May 14, 2026 · 81.9K Views

1:43 AM · May 15, 2026 · 1.3K Views

QUOTE POST

#171Omar Khattab@LATEINTERACTION

from the ones released recently, read #1 OBLIQ-Bench at:

Diane@dianetc_

We set out to build a better retriever, so we looked for the hardest IR benchmarks. For each, we asked how much headroom remained by running oracle reranking with a frontier LLM. Most had little room left! So we built OBLIQ-Bench to study much harder search queries than before.

3:52 PM · May 6, 2026 · 75.2K Views

1:43 AM · May 15, 2026 · 1.5K Views