Models

DeepSeek V4 Flash

TL;DR

Open-source

Fast, cost-efficient open-source reasoning at scale.

Available on

GoProMaxUltraTeamsEnterprise

Routed across multiple upstream providers; mean per-provider price.

Switch with

/model

Pick DeepSeek V4 Flash from the selector.

Intelligence Index

V4 Flash vs siblings

47

DeepSeek V4 Pro

52

GLM-5

50

DeepSeek V4 Flash

47

Speed

~82

tokens / sec

Input

$0.14

per M tokens

Output

$0.28

per M tokens

DeepSeek V4 Flash in Command Code

DeepSeek V4 Flash is the fast, cost-efficient sibling of V4 Pro — the cheapest model in the Command Code lineup. The right pick for high-volume agent loops where per-task budget dominates.

V4 Flash vs the Command Code lineup

Quality, speed, and pricing for V4 Flash alongside the cheapest peers.

ModelIntelligenceSpeedInput $/MOutput $/M
DeepSeek V4 Flash47~82 tok/s$0.14$0.28
MiniMax M2.5Not yet scored$0.27$0.95
Kimi K2.537~35 tok/s$0.60$3.00
GPT-5.4 Mini49~164 tok/s$0.75$4.50
Claude Haiku 4.537~97 tok/s$1.00$5.00
DeepSeek V4 Pro52~35 tok/s$1.74$3.48

What V4 Flash is best for

High-volume agent runs, narrow reasoning at scale, lint/format sweeps, and any task where per-task cost is the dominant constraint.

When to switch away from V4 Flash

Switch to DeepSeek V4 Pro

For long-context hybrid-attention reasoning. Higher Intelligence (52) at ~12× the input price.

Switch to Kimi K2.6

For the top open-weights Intelligence Index (54).

Switch to GPT-5.4 Mini

For premium ergonomics on the OpenAI side. Higher Intelligence (49) and much faster (~164 tok/s).

In Command Code: caching and taste-1

Open-source models are routed across multiple upstream providers for high availability. The price you see is the mean per-provider rate; the Usage page reflects what was actually charged.

Where supported by the upstream, prompt caching is on by default — cache reads are billed at $0.01 per million tokens versus $0.14 for fresh input.

taste-1 sits between the model and the agent loop, rewriting and reranking candidate edits to match your codebase conventions. Each plan ships with a taste-1 usage allowance that scales by tier (Go $100 → Ultra $10,000).

Plan availability

Open-source model. Available on every plan, including Go ($1/mo). Routed across multiple upstream providers; listed price is the mean per-provider rate.

All Command Code models, ranked by quality and speed

Quality is the Intelligence Index — an aggregate score across reasoning, math, coding, and knowledge evaluations. Speed is reported output tokens per second. Models without a published score are noted.

ModelTierIntelligence IndexOutput speed
GPT-5.5Premium60~65 tok/s
Claude Opus 4.7Premium57~49 tok/s
GPT-5.4Premium57~84 tok/s
GPT-5.3 CodexPremium54~72 tok/s
Kimi K2.6Open-source54~40 tok/s
Claude Sonnet 4.6Premium52~62 tok/s
DeepSeek V4 ProOpen-source52~35 tok/s
GLM-5Open-source50~61 tok/s
GPT-5.4 MiniPremium49~164 tok/s
DeepSeek V4 FlashOpen-source47~82 tok/s
Claude Haiku 4.5Premium37~97 tok/s
Kimi K2.5Open-source37~35 tok/s
Claude Opus 4.6PremiumNot yet scored
MiniMax M2.5Open-sourceNot yet scored

Switching models with /model

In an interactive Command Code session, run /model to open the model selector. Pick the model you want and it applies to this session and to future sessions until you change it again. Premium models require Pro or higher; open-source models are available on every plan, including Go.

cmd               # start an interactive session
/model            # open the selector and pick a model

Plans and pricing

Command Code is a subscription with model usage at API rates. Each plan ships with monthly LLM credits and a separate taste-1 usage allowance that scales by tier. Credits roll over and never expire. Auto top-up keeps you running if you go over.

PlanPrice/moLLM creditstaste-1 usageModels
Go$1$10$100Open-source only
Pro$15$30$500Open-source + premium
Max$100$150$5,000Open-source + premium
Ultra$200$300$10,000Open-source + premium
Teams$40 / seatPooled$1,000Open-source + premium
EnterpriseCustomCustomCustomCustom pool, SSO, audit logs

Frequently asked questions

Is V4 Flash too cheap to be useful?

For fast, narrow reasoning at scale, no — Intelligence Index 47 with ~82 tok/s is competitive. For deep work that benefits from long-context hybrid attention, switch to V4 Pro.

V4 Flash or MiniMax M2.5?

V4 Flash has a published Intelligence Index (47) and is cheaper ($0.14 vs $0.27 input). M2.5 is tuned for cross-platform full-stack agentic development.

Which Command Code model should I use?

Claude Sonnet 4.6 is the recommended default. Switch to GPT-5.5 (Intelligence Index 60) for the absolute hardest reasoning, or Claude Opus 4.7 / GPT-5.4 (both 57) for top-tier work at lower cost. For fast lookups, Claude Haiku 4.5 or GPT-5.4 Mini. For open-source, Kimi K2.6 leads the open-weights tier (Intelligence Index 54).

Can I mix DeepSeek V4 Flash with other models in a workflow?

Yes. Switch per session using /model. Common pattern: keep Sonnet 4.6 as the default and switch up to Opus 4.7 or down to Haiku 4.5 as the task calls for it.

Are open-source model prices fixed?

Open-source models are routed across multiple upstream providers for high availability. The price listed for each is the mean per-provider rate. Actual cost on a given request may vary slightly. The Usage page reflects the price charged.

Is Command Code free to try?

The Go plan starts at $1/mo with $10 in LLM credits and $100 of taste-1 usage. It covers open-source models only. Pro at $15/mo unlocks premium models with $30 in LLM credits and $500 of taste-1 usage.

Does Command Code train on my code?

No. Command Code does not train on your code or store your code snippets. taste-1 data is stored locally in your project directory.

Where can I track my usage?

The Usage page in Studio shows per-request cost, token counts, and which model ran. Settings > Billing lets you change plans, buy credits, or enable auto top-up.

Does Command Code replace my editor?

No. Command Code is editor-agnostic — it runs as a CLI and works alongside any editor (Cursor, VS Code, Zed, JetBrains, Neovim, etc.).

Ship code that matches your taste

Command Code is the AI coding agent that continuously learns your taste. Start for $1.