Skip to content

    GLM 4.7 Flash

    Chat

    GLM-4.7-Flash

    GLM 4.7 Flash on FlexAI: Zhipu AI LLM, ChatGLM License, available as a dedicated endpoint on FlexAI or your own infrastructure.

    Pricing

    Input

    $0.054 / M tokens

    Output

    $0.36 / M tokens

    Context

    198K tokens

    API endpoint

    /v1/chat/completions

    Compatibility

    OpenAI

    Parameters

    ~106B MoE (lite)

    License

    ChatGLM License

    Hardware

    2× H100

    Quantization

    BF16

    GLM 4.7 Flash runs as a dedicated endpoint, provisioned per customer on FlexAI's infrastructure or your own, not served through the shared Token Factory API.