GLM 5
ChatGLM-5
GLM 5 on FlexAI: Zhipu AI LLM, MIT license, available as a dedicated endpoint on FlexAI or your own infrastructure.
Pricing
Input
$0.54 / M tokens
Output
$1.73 / M tokens
Context
195K tokens
API endpoint
/v1/chat/completions
Compatibility
OpenAI
Parameters
744B MoE (40B active)
License
MIT
Hardware
12× H100
Quantization
FP8
GLM 5 runs as a dedicated endpoint, provisioned per customer on FlexAI's infrastructure or your own, not served through the shared Token Factory API.