glm-4.7-flashx

Common Name: GLM-4.7 FlashX

ChatGLM
Released on Dec 29, 2025 12:00 AMKnowledge Cutoff Apr 1, 2025 12:00 AMSupportedTool InvocationSupportedReasoning
CompareTry in Chat

Low-cost, high-speed variant of GLM-4.7 optimized for high-throughput inference at a fraction of the flagship price.

Specifications

Context
200K
Maximum Output
128K
Inputtext
Outputtext

Performance (7-day Average)

Collecting…
Collecting…
Collecting…

Pricing

Input¥0.55/MTokens
Output¥3.30/MTokens
Cached Input¥0.11/MTokens

Availability Trend (24h)

Performance Metrics (24h)

Similar Models

¥0.55/¥0.55/M
ctx128Kmax16Kavailtps
InOutCap

Zhipu AI's lightweight GLM-4 variant for cost-effective tasks with 128K context.

¥1.10/¥1.10/M
ctx1.0Mmax4Kavailtps
InOutCap

GLM-4 variant with extended 1M token context window for processing very long documents.

Free/Free
ctx131Kmax98Kavailtps
InOutCap

Fast, cost-efficient version of GLM-4.5. Optimized for high-throughput applications.

¥4.40/¥13.20/M
ctx128Kmaxavailtps
InOutCap

Zhipu AI's GLM-4.5 AirX variant optimized for high-speed inference.