So much alpha in tuning/building LLM verifiers and judges.
I use them on top of my harness, and it has unlocked agentic coding workflows that are beyond anything that exists in the market today.
Building verifiers and LLM judges is starting to become a skill in high demand.
Bridgewater used their unique financial knowledge and partnered with us on @tinkerapi to fine-tune a model that helps their analysts focus on what's important. Experts improving AI that empowers experts. https://thinkingmachines.ai/news/learning-to-replicate-expert-judgment-in-financial-tasks/


