I've been getting a TON done with Fable today and I'm not hitting rate limits. Wanted to share some tips on how I'm doing that
1. I only use Fable on "high" effort for now. xhigh is token hungry. max/extra is a furnace with worse outputs than lower options imo
2. I taught Claude Code how to use Codex as a fallback for lots of implementation tasks. GPT-5.5 is incredibly steerable, and Fable can learn how to steer it
3. I wrote up a big section in my CLAUDE[.]md on how to prioritize different models for different work when orchestrating workflows and subagents
4. Things that are unnecessarily token hungry (computer use, codebase analysis, etc), I do with other models and report results back to Fable











