How much does Z.ai: GLM 4.7 Flash cost?

$0.06 per million input tokens and $0.40 per million output tokens.

What is Z.ai: GLM 4.7 Flash's context window?

202,752 tokens, roughly 304 pages of text in a single request.

Does Z.ai: GLM 4.7 Flash support tool calling?

It supports tool calling, structured output, a reasoning mode.

Can Z.ai: GLM 4.7 Flash process images?

No, it is text-only on the input side.

z-ai

Z.ai: GLM 4.7 Flash

Z.ai: GLM 4.7 Flash is designed to process text inputs with a context length of up to 202,752 tokens and supports reasoning and tools for more complex tasks. It can handle single-modal input without structured output capabilities, making it suitable for scenarios requiring in-depth analysis or tool integration. This model stands out as a viable option for those seeking robust reasoning abilities and tool support within a budget-conscious framework. With its blended benchmark score of 46.3 across two independent benchmarks, it offers reliable performance. At a price point of $0.06 per million input tokens and $0.4 per million output tokens, Z.ai: GLM 4.7 Flash provides a cost-effective solution for users looking to balance quality with affordability.

Query via API → View on z-ai → Estimate cost

Quality Score

99/100

price + capability + benchmarks

Input Price

$0.06

per 1M tokens

Output Price

$0.40

per 1M tokens

Context Window

202,752

tokens

Benchmark results

Independent, published benchmarks. Blended score 46.3 across 2 benchmarks, last refreshed 2026-07-29. How scoring works →

Benchmark	Measures	Score
AI Index	broad capability composite	37.8
EQ-Bench	emotional understanding in dialogue	52.5

Model ID: z-ai/glm-4.7-flash
Vendor: z-ai
Released: January 2026
Tokenizer: Other
Input Modalities: text
Output Modalities: text
Max Output: 16,384 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: ✓ supported
Vision: text only
Audio: no
Moderated: no

What it costs in practice

Computed from the current $0.06/M input and $0.40/M output rates. Run your own numbers →

Job	Tokens	Cost
Summarize a 50-page report	30k in / 1.5k out	under $0.01
Classify 1,000 customer emails	500k in / 50k out	$0.05
A month of a busy support chatbot	5M in / 2M out	$1.10

Price & spec history

Tracked daily by PicksByModel since 2026-07-17.

Date	Input /M	Output /M	Context
2026-07-23	$0.06	$0.40	202,752
2026-07-22	$0.06	$0.40	202,752
2026-07-18	$0.06	$0.40	200,000
2026-07-17	$0.06	$0.40	202,752

Category rankings

Where Z.ai: GLM 4.7 Flash places across the 1 category it ranks in. How we rank →

#	Category	Score
#25	Cheap Bulk InferenceCost · of 25 ranked	137

Similar models

z-ai

Quick answers

How much does Z.ai: GLM 4.7 Flash cost?: $0.06 per million input tokens and $0.40 per million output tokens.
What is Z.ai: GLM 4.7 Flash's context window?: 202,752 tokens, roughly 304 pages of text in a single request.
Does Z.ai: GLM 4.7 Flash support tool calling?: It supports tool calling, structured output, a reasoning mode.
Can Z.ai: GLM 4.7 Flash process images?: No, it is text-only on the input side.

Z.ai: GLM 4.7 Flash

Benchmark results

What it costs in practice

Price & spec history

Category rankings

Similar models

Z.ai: GLM 5V Turbo

Z.ai: GLM 4.6V

Z.ai: GLM 4.7

Z.ai: GLM 4.6

Z.ai: GLM 5.2

Z.ai: GLM 5

Quick answers