glm-4.7-flashx

Common Name: GLM-4.7 FlashX

Released on Dec 29, 2025 12:00 AMKnowledge Cutoff Apr 1, 2025 12:00 AMSupportedTool InvocationSupportedReasoning

Low-cost, high-speed variant of GLM-4.7 optimized for high-throughput inference at a fraction of the flagship price.

Context

200K

Maximum Output

128K

Inputtext

Outputtext

Collecting…

Input¥0.55/MTokens

Output¥3.30/MTokens

Cached Input¥0.11/MTokens

Availability Trend (24h)

¥0.55/¥0.55/M

ctx128Kmax16Kavail—tps—

InOutCap

Zhipu AI's lightweight GLM-4 variant for cost-effective tasks with 128K context.

¥1.10/¥1.10/M

ctx1.0Mmax4Kavail—tps—

InOutCap

GLM-4 variant with extended 1M token context window for processing very long documents.

Free/Free

ctx131Kmax98Kavail—tps—

InOutCap

Fast, cost-efficient version of GLM-4.5. Optimized for high-throughput applications.

¥4.40/¥13.20/M

ctx128Kmax—avail—tps—

InOutCap

Zhipu AI's GLM-4.5 AirX variant optimized for high-speed inference.