Zhipu AI's lightweight GLM-4 variant for cost-effective tasks with 128K context.
Specifications
Context
128K
Maximum Output
16.4K
Inputtext
Outputtext
Performance (7-day Average)
Collecting…
Collecting…
Collecting…
Pricing
Input¥0.55/MTokens
Output¥0.55/MTokens
Batch Input¥0.275/MTokens
Batch Output¥0.275/MTokens
Availability Trend (24h)
Performance Metrics (24h)
Similar Models
¥0.55/¥3.30/M
ctx200Kmax128Kavail—tps—
InOutCap
Low-cost, high-speed variant of GLM-4.7 optimized for high-throughput inference at a fraction of the flagship price.
¥1.10/¥1.10/M
ctx1.0Mmax4Kavail—tps—
InOutCap
GLM-4 variant with extended 1M token context window for processing very long documents.
¥2.20/¥6.60/M
ctx64Kmax16Kavail—tps—
InOutCap
Zhipu AI's multimodal model with vision capabilities. Processes text, images, video, and files for analysis tasks.
Free/Free
ctx131Kmax98Kavail—tps—
InOutCap
Fast, cost-efficient version of GLM-4.5. Optimized for high-throughput applications.