Models

Kimi K2.7 Code HighSpeed

TL;DR

Open-source

High-throughput serving variant of Kimi K2.7 Code, with vision.

Available on

GoProMax 10×Max 20×TeamsEnterprise

Routed across multiple upstream providers; price matches the Command Code pricing page.

Switch with

/model

Pick Kimi K2.7 Code HighSpeed from the selector.

Input

$1.90

per M tokens

Output

$8

per M tokens

Cache read

$0.38

per M tokens

Kimi K2.7 Code HighSpeed in Command Code

Kimi K2.7 Code HighSpeed is the high-throughput serving variant of Kimi K2.7 Code — the same model and output quality, served on faster infrastructure at exactly twice the standard token rate. Vision input and a 262K-token context. Reach for it when throughput, not per-token cost, is the constraint.

K2.7 Code HighSpeed vs the Command Code lineup

Pricing for K2.7 Code HighSpeed alongside the most relevant peers. K2.7 Code HighSpeed is not yet on the Intelligence Index; benchmarked peers show their scores.

ModelIntelligenceSpeedInput $/MOutput $/M
Kimi K2.7 Code HighSpeedNot yet scored$1.90$8.00
Kimi K2.7 CodeNot yet scored$0.95$4.00
Kimi K2.654~40 tok/s$0.95$4.00
GPT-5.4 Mini49~164 tok/s$0.75$4.50
Claude Haiku 4.537~97 tok/s$1.00$5.00
DeepSeek V4 Flash47~82 tok/s$0.14$0.28

What K2.7 Code HighSpeed is best for

Latency-sensitive coding loops, interactive sessions where wait time dominates, and high-throughput batch runs.

When to switch away from K2.7 Code HighSpeed

Switch to Kimi K2.7 Code

The standard variant at half the token price ($0.95/$4.00). Pick it when cost matters more than speed.

Switch to Kimi K2.6

The benchmarked Kimi (Intelligence Index 54) when you want a scored baseline.

Switch to DeepSeek V4 Flash or GPT-5.4 Mini

For cheaper, faster work where per-task cost dominates.

In Command Code: caching and taste-1

Open-source models are routed across multiple upstream providers for high availability. The price you see is the mean per-provider rate; the Usage page reflects what was actually charged.

Where supported by the upstream, prompt caching is on by default — cache reads are billed at $0.38 per million tokens versus $1.90 for fresh input.

taste-1 sits between the model and the agent loop, rewriting and reranking candidate edits to match your codebase conventions.

Plan availability

Open-source model. Available on every plan, including Go ($1/mo). Routed across multiple upstream providers; listed price matches the Command Code pricing page.

All Command Code models, ranked by quality and speed

Quality is the Intelligence Index — an aggregate score across reasoning, math, coding, and knowledge evaluations. Speed is reported output tokens per second. Models without a published score are noted.

ModelTierIntelligence IndexOutput speed
GPT-5.5Premium60~65 tok/s
Claude Opus 4.7Premium57~49 tok/s
GPT-5.4Premium57~84 tok/s
GPT-5.3 CodexPremium54~72 tok/s
Kimi K2.6Open-source54~40 tok/s
Claude Sonnet 4.6Premium52~62 tok/s
DeepSeek V4 ProOpen-source52~35 tok/s
GLM-5Open-source50~61 tok/s
GPT-5.4 MiniPremium49~164 tok/s
DeepSeek V4 FlashOpen-source47~82 tok/s
Claude Haiku 4.5Premium37~97 tok/s
Kimi K2.5Open-source37~35 tok/s
Claude Sonnet 5PremiumNot yet scored
Claude Opus 4.8PremiumNot yet scored
Claude Opus 4.6PremiumNot yet scored
MiniMax M2.5Open-sourceNot yet scored

Switching models with /model

In an interactive Command Code session, run /model to open the model selector. Pick the model you want and it applies to this session and to future sessions until you change it again. Premium models require Pro or higher; open-source models are available on every plan, including Go.

cmd               # start an interactive session
/model            # open the selector and pick a model

Plans and pricing

Command Code is a subscription with model usage at API rates. Each plan ships with monthly LLM credits. Credits roll over and never expire. Auto top-up keeps you running if you go over.

PlanPrice/moLLM creditsModels
Go$1$10Open-source only
Pro$15$30Open-source + premium
Max 10×$100$150Open-source + premium
Max 20×$200$300Open-source + premium
Teams$40 / seatPooledOpen-source + premium
EnterpriseCustomCustomCustom pool, SSO, audit logs

Frequently asked questions

What is different about the HighSpeed variant?

Same Kimi K2.7 Code model and output quality, served on higher-throughput infrastructure. Token pricing is exactly 2× the standard variant.

When is HighSpeed worth 2× the price?

When latency is the bottleneck — interactive sessions or throughput-bound batch work. For cost-sensitive runs, use standard K2.7 Code.

Which Command Code model should I use?

For open models, Kimi, DeepSeek, Qwen, Mimo are all really good. For closed models, Claude Sonnet 5 is the recommended default — the best combination of speed and intelligence, and a drop-in upgrade from Sonnet 4.6. Switch to Claude Opus 4.8 (the newest Anthropic flagship) for the most capable long-horizon agentic coding, GPT-5.5 (Intelligence Index 60) for the absolute hardest reasoning, or Claude Opus 4.7 / GPT-5.4 (both 57) for top-tier work at lower cost. For fast lookups, Claude Haiku 4.5 or GPT-5.4 Mini. For open-source, Kimi K2.6 leads the open-weights tier (Intelligence Index 54).

Can I mix Kimi K2.7 Code HighSpeed with other models in a workflow?

Yes. Switch per session using /model. Common pattern: keep Sonnet 5 as the default and switch up to Opus 4.8 or down to Haiku 4.5 as the task calls for it.

Are open-source model prices fixed?

Open-source models are routed across multiple upstream providers for high availability. The price listed for each is the mean per-provider rate. Actual cost on a given request may vary slightly. The Usage page reflects the price charged.

Is Command Code free to try?

The Go plan starts at $1/mo with $10 in LLM credits. It covers open-source models only. Pro at $15/mo unlocks premium models with $30 in LLM credits.

Does Command Code train on my code?

No. Command Code does not train on your code or store your code snippets. taste-1 data is stored locally in your project directory.

Where can I track my usage?

The Usage page in Studio shows per-request cost, token counts, and which model ran. Settings > Billing lets you change plans, buy credits, or enable auto top-up.

Does Command Code replace my editor?

No. Command Code is editor-agnostic — it runs as a CLI and works alongside any editor (Cursor, VS Code, Zed, JetBrains, Neovim, etc.).

Ship code that matches your taste

Command Code is the AI coding agent that continuously learns your taste. Start for $1.