/AI23h ago

Dimitris Papailiopoulos of Microsoft Research AI Frontiers proposes a companion writer model to document OpenAI Codex research sessions step-by-step

Pasquale Minervini suggested recovering session history from ~/.codex

43349158129.1K
Original post
Dimitris Papailiopoulos@DimitrisPapail#193inAI

Codex feature request:

I use Codex a lot for autoresearch. But in the end, I'd love to hear a story about how we arrived at the final solution. The model currently sucks at doing that.

Please add a companion writer model that is good at pulling everything together in a coherent story.

8:36 AM · Jun 7, 2026 · 23.8K Views
Sentiment

Users are enthusiastic about a storyteller companion model for OpenAI Codex because it can document reasoning journeys, turn session logs into narratives, and improve explainability in agentic workflows.

Pos
100.0%
Neg
0.0%
10 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS3.4KBOOKMARKS17LIKES68RETWEETS2REPLIES8

@DimitrisPapail Good idea, although I’m sure you can implement this with a companion-writer skill and a sub agent that after each step invokes the skill to document things as you want

Happy to vibe something up for you and see if that doesn’t solve your use case

Codex feature request:

I use Codex a lot for autoresearch. But in the end, I'd love to hear a story about how we arrived at the final solution. The model currently sucks at doing that.

Please add a companion writer model that is good at pulling everything together in a coherent story.

21hViews 3.4KLikes 68Bookmarks 17

@reach_vb maybe but the point is many people, including myself, are lazy, and this is reflected as bad user experience. Probably an easier fix from your side :)

Also please add timestamps on messages :D

@DimitrisPapail Good idea, although I’m sure you can implement this with a companion-writer skill and a sub agent that after each step invokes the skill to document things as you want

Happy to vibe something up for you and see if that doesn’t solve your use case

21hViews 1.1KLikes 15Bookmarks 0

@reach_vb and yes if you have a quikc fix would love to try it out.

21hViews 189Likes 3
Alok Bishoyi@alokbishoyi97

@DimitrisPapail hey dimitis, would you be open to trying a different autoresearch orchestrator ? it keeps a track of all experiemnts tried / hypothesis / specific diffs etc . is compatible with codex as well off the shelf.

would love your take and feedback! thanks http://github.com/evo-hq/evo

23hViews 162Likes 2Bookmarks 1

@DimitrisPapail codex is on it! if this works, we can ship this as an official plugin :)

20hViews 130Likes 6
Bukut@bukut909

@DimitrisPapail You could try developing a program using Codex that allows the model to progressively organize the story within a context window segment, while retaining a spare window for the model with accurate memory of the current segment to absorb information from the next segment

21hViews 121

also another thing that's been bothering me and may be how i've setup the app or something: a git subprocess is always eating 100% of cpu, even when i work with remote gpus. I may have done something idiotic, but it's a codex app only issue (eg i dont have it with claude code cli etc). Any ideas on what this would be?

20hViews 118Likes 3

@DimitrisPapail you can just recover the sessions from e.g. ~/.codex and reconstruct a story, no?

Codex feature request:

I use Codex a lot for autoresearch. But in the end, I'd love to hear a story about how we arrived at the final solution. The model currently sucks at doing that.

Please add a companion writer model that is good at pulling everything together in a coherent story.

22hViews 994Likes 1Bookmarks 0

@DimitrisPapail Curious if you can say more about your autoresearch harness. This got a lot of publicity when Karparthy suggested it, but haven’t seen the community move to a clear set of best practices around it (memory, logging, orchestration).

20hViews 329
Sreeram Kannan@sreeramkannan

@DimitrisPapail Yeah we have a great case study.

Open agentic science achieved more than a 13x improvement in a Quantum algorithm. But its really hard to pull through what the models actually did.

There is a lot of data to study here on frontier scientific research.

13hViews 60Likes 1
Jonathan Chang@ChangJonathanC

@DimitrisPapail Curious what prompts did you use? I recently found that if you ask it to narrate what happened it can do it quite well

21hViews 147

@DimitrisPapail "Commit everything with a detailed writeup of what you changed, what you expected, and what you actually found" and then "review the commits and tell me what moved the needle the most" works pretty well

22hViews 32Likes 1
آلیوشا@__Aliusha__

@DimitrisPapail For some sessions, I have Codex turn raw session logs into a narrative, then generate graphics, audio, or even a video from it (using local tts and simple graphs). It’s worked surprisingly well. And now with imagegen tool, I can see this becoming an even better pipeline.

23hViews 77
apple@iWatch_AAPL

@reach_vb @DimitrisPapail Why not show thinking summaries in codex app

20hViews 44
MrFridayy@MrFridayyCS

@reach_vb @DimitrisPapail Created a skill for exactly the case.

20hViews 40
Aradhye Agarwal@AradhyeAgarwal

@DimitrisPapail I mean you can just have a line in AGENTS.md to ask it to read the full trajectory on the disk and summarise, right?

22hViews 30

@DimitrisPapail If you enable the memory feature you will find a good summary of your session in the .codex directory

21hViews 28
Tessa Archer@scifi_tessa

@DimitrisPapail Hit this same wall. Model finds the answer but can't narrate how it got there. Companion writer makes sense - search and storytelling need different capabilities.

22hViews 23
Frosty40@FrostForger

@DimitrisPapail @reach_vb thats gonna be spendy on the tokens for the context, even tying the chain together, better to have a subagent not your best brain put that mess together, and tracking logic for a24 hours run is gonna be some context...

21hViews 21
Load more posts