Gemini 3.1 Flash Lite

TL;DR

Premium

High-volume workhorse model with implicit caching.

Available on

ProMax 10×Max 20×TeamsEnterprise

Not on the Go plan ($1/mo, open-source models only).

Switch with

/model

Pick Gemini 3.1 Flash Lite from the selector.

Input

$0.25

per M tokens

Output

$1.50

per M tokens

Cache read

$0.03

per M tokens

Start now See pricing

Gemini 3.1 Flash Lite in Command Code

Gemini 3.1 Flash Lite is Google's high-volume workhorse — a fast, inexpensive model with implicit caching, vision input, and a 1M-token context. It routes through the gateway to Google Vertex and is gated as a premium model.

Gemini 3.1 Flash Lite vs the Command Code lineup

Pricing for Gemini 3.1 Flash Lite alongside the most relevant peers. Gemini 3.1 Flash Lite is not yet on the Intelligence Index; benchmarked peers show their scores.

Model	Intelligence	Speed	Input $/M	Output $/M
Gemini 3.1 Flash Lite	Not yet scored	—	$0.25	$1.50
Gemini 3.5 Flash	Not yet scored	—	$1.50	$9.00
GPT-5.4 Mini	49	~164 tok/s	$0.75	$4.50
Claude Haiku 4.5	37	~97 tok/s	$1.00	$5.00
DeepSeek V4 Flash	47	~82 tok/s	$0.14	$0.28
Kimi K2.5	37	~35 tok/s	$0.60	$3.00

What Gemini 3.1 Flash Lite is best for

High-volume lookups and edits, multimodal tasks (vision), long-context work, and anywhere fast, cheap Google-family inference fits.

When to switch away from Gemini 3.1 Flash Lite

Switch to Gemini 3.5 Flash

The stronger Google model with parallel agentic execution.

Switch to GPT-5.4 Mini

For a fast premium peer on the OpenAI side (Intelligence Index 49).

Switch to Claude Haiku 4.5

For the fast Claude option with automatic prompt caching.

In Command Code: caching and taste-1

Two things change the experience of using this model inside Command Code versus calling it directly through the upstream API.

First, prompt caching is on by default. In an agent loop the same context is read across many steps; cache reads are billed at $0.03 per million tokens versus $0.25 for fresh input.

Second, taste-1 sits between the model and the agent loop, rewriting and reranking candidate edits to match your codebase conventions.

Plan availability

Premium model. Available on Pro ($15/mo), Max 10× ($100/mo), Max 20× ($200/mo), Teams ($40/mo per seat), and Enterprise. Not on the Go plan.

All Command Code models, ranked by quality and speed

Quality is the Intelligence Index — an aggregate score across reasoning, math, coding, and knowledge evaluations. Speed is reported output tokens per second. Models without a published score are noted.

Model	Tier	Intelligence Index	Output speed
GPT-5.5	Premium	60	~65 tok/s
Claude Opus 4.7	Premium	57	~49 tok/s
GPT-5.4	Premium	57	~84 tok/s
GPT-5.3 Codex	Premium	54	~72 tok/s
Kimi K2.6	Open-source	54	~40 tok/s
Claude Sonnet 4.6	Premium	52	~62 tok/s
DeepSeek V4 Pro	Open-source	52	~35 tok/s
GLM-5	Open-source	50	~61 tok/s
GPT-5.4 Mini	Premium	49	~164 tok/s
DeepSeek V4 Flash	Open-source	47	~82 tok/s
Claude Haiku 4.5	Premium	37	~97 tok/s
Kimi K2.5	Open-source	37	~35 tok/s
Claude Sonnet 5	Premium	Not yet scored	—
Claude Opus 4.8	Premium	Not yet scored	—
Claude Opus 4.6	Premium	Not yet scored	—
MiniMax M2.5	Open-source	Not yet scored	—

Switching models with /model

In an interactive Command Code session, run /model to open the model selector. Pick the model you want and it applies to this session and to future sessions until you change it again. Premium models require Pro or higher; open-source models are available on every plan, including Go.

cmd               # start an interactive session
/model            # open the selector and pick a model

Plans and pricing

Command Code is a subscription with model usage at API rates. Each plan ships with monthly LLM credits. Credits roll over and never expire. Auto top-up keeps you running if you go over.

Plan	Price/mo	LLM credits	Models
Go	$1	$10	Open-source only
Pro	$15	$30	Open-source + premium
Max 10×	$100	$150	Open-source + premium
Max 20×	$200	$300	Open-source + premium
Teams	$40 / seat	Pooled	Open-source + premium
Enterprise	Custom	Custom	Custom pool, SSO, audit logs

Frequently asked questions

Gemini 3.1 Flash Lite or Gemini 3.5 Flash?

Flash Lite is the cheaper, high-volume option; 3.5 Flash is stronger, with parallel agentic execution. Both carry vision and a 1M context.

What is implicit caching?

The upstream automatically reuses repeated prompt prefixes across requests and bills those reads at the lower cache-read rate, with no manual cache configuration.

Which Command Code model should I use?

For open models, Kimi, DeepSeek, Qwen, Mimo are all really good. For closed models, Claude Sonnet 5 is the recommended default — the best combination of speed and intelligence, and a drop-in upgrade from Sonnet 4.6. Switch to Claude Opus 4.8 (the newest Anthropic flagship) for the most capable long-horizon agentic coding, GPT-5.5 (Intelligence Index 60) for the absolute hardest reasoning, or Claude Opus 4.7 / GPT-5.4 (both 57) for top-tier work at lower cost. For fast lookups, Claude Haiku 4.5 or GPT-5.4 Mini. For open-source, Kimi K2.6 leads the open-weights tier (Intelligence Index 54).

Can I mix Gemini 3.1 Flash Lite with other models in a workflow?

Yes. Switch per session using /model. Common pattern: keep Sonnet 5 as the default and switch up to Opus 4.8 or down to Haiku 4.5 as the task calls for it.

Are open-source model prices fixed?

Open-source models are routed across multiple upstream providers for high availability. The price listed for each is the mean per-provider rate. Actual cost on a given request may vary slightly. The Usage page reflects the price charged.

Is Command Code free to try?

The Go plan starts at $1/mo with $10 in LLM credits. It covers open-source models only. Pro at $15/mo unlocks premium models with $30 in LLM credits.

Does Command Code train on my code?

No. Command Code does not train on your code or store your code snippets. taste-1 data is stored locally in your project directory.

Where can I track my usage?

The Usage page in Studio shows per-request cost, token counts, and which model ran. Settings > Billing lets you change plans, buy credits, or enable auto top-up.

Does Command Code replace my editor?

No. Command Code is editor-agnostic — it runs as a CLI and works alongside any editor (Cursor, VS Code, Zed, JetBrains, Neovim, etc.).

Ship code that matches your taste

Command Code is the AI coding agent that continuously learns your taste. Start for $1.

Start now See pricing