2h ago

Hugging Face co-founder Julien Chaumond releases SynthTraces to generate synthetic coding agent session traces

3 top authors

It uses llama.cpp to simulate human-to-agent interactions.

Original post

Julien Chaumond@julien_c

Today I'm launching a new project called SynthTraces 🔥 It is a minimal codebase to generate synthetic coding agent session traces using Pi (from @badlogicgames) I wanted a large number of coding-agent traces, so I built a tiny harness where two models talk to each other: - an open model (served via HF Inference Providers) plays the coding agent. It gets read + bash access to a real open source codebase (the huggingface OSS projects) - a small local model (llama.cpp) plays the human user, asking simple questions like "how do I run this?" or "how is CI set up?" The result is more than 2,000 Pi session traces which can be used to train or fine-tune LLMs, and optimize them for Pi 🤯 And ofc everything is published on @huggingface ✅

6:18 AM · Jun 4, 2026

Sentiment

Pos100%

Neg0%

Users praise SynthTraces for generating over 2,000 synthetic coding agent traces because it supplies high-quality data that lets models learn real coding without hallucinations.

10 comments with sentiment.

2 more posts

Retweeted by clem 🤗·2hView on

Suzana Ilić@suzatweet·1hReply
@julien_c @huggingface @badlogicgames Very cool!
View on