Skip to content

    GLM 5 on FlexAI: Zhipu AI LLM, MIT license, available as a dedicated endpoint on FlexAI or your own infrastructure.

    Pricing

    Input

    $0.54 / M tokens

    Output

    $1.73 / M tokens

    Context

    195K tokens

    API endpoint

    /v1/chat/completions

    Compatibility

    OpenAI

    Parameters

    744B MoE (40B active)

    License

    MIT

    Hardware

    12× H100

    Quantization

    FP8

    GLM 5 runs as a dedicated endpoint, provisioned per customer on FlexAI's infrastructure or your own, not served through the shared Token Factory API.