Available models
Pass any of these ids to -m, --model, or pick them in-session with /model. Run cmd --list-models for the live, copy-pasteable list; that command is the source of truth if this table ever drifts. The default model currently is moonshotai/Kimi-K2.5.
--model matching is case-insensitive and accepts either the full id or just the name after the /, so moonshotai/Kimi-K2.5, moonshotai/kimi-k2.5, and kimi-k2.5 all resolve to the same model. An unknown id is rejected up front (pointing you at cmd --list-models) rather than failing later with a cryptic provider error. Each model id below links to its profile page; models that don't have a dedicated page yet link to the models overview.
| Model id | Name | Best for |
|---|---|---|
claude-sonnet-4-6 | Claude Sonnet 4.6 | best combo of speed & intelligence (recommended) |
claude-opus-4-7 | Claude Opus 4.7 | most intelligent for agents and coding |
claude-opus-4-6 | Claude Opus 4.6 | most capable for complex work |
claude-haiku-4-5-20251001 | Claude Haiku 4.5 | fastest & most compact, great for quick tasks |
| Model id | Name | Best for |
|---|---|---|
gpt-5.5 | GPT-5.5 | latest frontier model for general complex work |
gpt-5.4 | GPT-5.4 | frontier model for general complex work |
gpt-5.3-codex | GPT-5.3 Codex | frontier coding model |
gpt-5.4-mini | GPT-5.4 Mini | fast, cost-effective model for everyday tasks |
| Model id | Name | Best for |
|---|---|---|
google/gemini-3.5-flash | Gemini 3.5 Flash | Pro-level coding proficiency, parallel agentic execution |
google/gemini-3.1-flash-lite | Gemini 3.1 Flash Lite | high-volume workhorse model with implicit caching |
| Model id | Name | Best for |
|---|---|---|
moonshotai/Kimi-K2.6 | Kimi K2.6 | long-horizon coding with vision |
moonshotai/Kimi-K2.5 (default) | Kimi K2.5 | multimodal frontend coding |
zai-org/GLM-5.1 | GLM-5.1 | long-horizon autonomous coding agent |
zai-org/GLM-5 | GLM-5 | multi-mode thinking & long-range planning |
MiniMaxAI/MiniMax-M2.7 | MiniMax M2.7 | end-to-end software engineering agent |
MiniMaxAI/MiniMax-M2.5 | MiniMax M2.5 | cross-platform full-stack agentic dev |
deepseek/deepseek-v4-pro | DeepSeek V4 Pro | hybrid-attention long-context reasoning |
deepseek/deepseek-v4-flash | DeepSeek V4 Flash | fast hybrid-attention reasoning |
Qwen/Qwen3.6-Max-Preview | Qwen 3.6 Max Preview | vibe coding & efficient agent execution |
Qwen/Qwen3.6-Plus | Qwen 3.6 Plus | agentic coding & reasoning |
Qwen/Qwen3.7-Max | Qwen 3.7 Max | frontier coding & long-horizon agent execution |
stepfun/Step-3.5-Flash | Step 3.5 Flash | fast sparse-MoE agentic reasoning |
- CLI Reference — all flags, slash commands, and subcommands
- Interactive mode — switching models in-session with
/model