glm-4.7-flashx
Common Name: GLM-4.7 FlashX
ChatGLM
Released on Dec 29, 2025 12:00 AMKnowledge Cutoff Apr 1, 2025 12:00 AMSupportedTool InvocationSupportedReasoningLow-cost, high-speed variant of GLM-4.7 optimized for high-throughput inference at a fraction of the flagship price.
Specifications
Context
200K
Maximum Output
128K
Inputtext
Outputtext
Performance (7-day Average)
Collecting…
Collecting…
Collecting…
Pricing
Input¥0.55/MTokens
Output¥3.30/MTokens
Cached Input¥0.11/MTokens
Availability Trend (24h)
Performance Metrics (24h)
Similar Models
¥0.55/¥0.55/M
ctx128Kmax16Kavail—tps—
InOutCap
Zhipu AI's lightweight GLM-4 variant for cost-effective tasks with 128K context.
¥1.10/¥1.10/M
ctx1.0Mmax4Kavail—tps—
InOutCap
GLM-4 variant with extended 1M token context window for processing very long documents.
Free/Free
ctx131Kmax98Kavail—tps—
InOutCap
Fast, cost-efficient version of GLM-4.5. Optimized for high-throughput applications.
¥4.40/¥13.20/M
ctx128Kmax—avail—tps—
InOutCap
Zhipu AI's GLM-4.5 AirX variant optimized for high-speed inference.