What models do you use?
Codebuff uses different models for different tasks. The orchestrator coordinates; subagents handle specific jobs.
Orchestrator
The main agent ("Buffy") coordinates everything:
| Mode | Model |
|---|---|
| Default | Claude Opus 4.5 |
| Max | Claude Opus 4.5 |
| Lite | Grok 4.1 Fast |
Subagents
The orchestrator spawns these for specific jobs:
| Task | Models |
|---|---|
| Code editing | Claude Opus 4.5, GPT-5.1 |
| Thinking/reasoning | Claude Opus 4.5, GPT-5.1, Gemini 2.5 Pro |
| Code review | Claude Opus 4.5, Claude Sonnet 4.5, GPT-5.1 |
| File discovery | Gemini 2.0 Flash, Grok 4 Fast |
| Terminal commands | Grok 4 Fast, Claude Sonnet 4.5 |
| Context management | GPT-5 Mini |
| Web/docs research | Grok 4 Fast |
Max mode runs multiple implementations in parallel and picks the best one. Default mode runs a single implementation pass. Lite mode skips validation steps for speed.
File rewrites use speculative decoding from Relace AI.