Providers
| Provider | Notes |
|---|---|
| Anthropic | Claude family models |
| OpenAI | GPT and reasoning (o-series) models |
| Gemini family models | |
| OpenRouter | A single key for 100+ models across providers — see OpenRouter |
| Local | On-device models via LM Studio, Ollama, or any OpenAI-compatible server — see Local Models |
Because the catalog is dynamic, the exact model names you see in Chat Models reflect what’s currently offered for your plan. Use the picker in the app as the source of truth.
Reasoning Effort
For models that support it, you can set how much the model “thinks” before responding. Higher effort trades speed for depth.| Effort | Description |
|---|---|
| Minimal | Lightest reasoning, fastest response |
| Low | Light reasoning |
| Medium | Balanced default |
| High | More deliberate reasoning |
| Extra High | Deepest reasoning |
| Plan | Reasoning levels |
|---|---|
| Free | Low, Medium |
| Pro | Minimal, Low, Medium, High |
| Lite / Elite | All five (Minimal → Extra High) |
Choosing a Model
Quick edits and chat
Quick edits and chat
A fast model at Low or Medium effort — ideal for simple questions, small edits, and exploration.
Complex refactoring
Complex refactoring
A strong coding model at Medium or High effort — better at large codebases and multi-file changes.
Deep analysis
Deep analysis
A frontier model at High or Extra High effort — best for architecture decisions and intricate debugging.
Maximum privacy
Maximum privacy
A Local model — all inference runs on your Mac with no network calls. See Local Models.