Skip to content

    Qwen3-VL 8B Instruct

    Chat

    Qwen/Qwen3-VL-8B-Instruct

    Qwen3-VL 8B Instruct on FlexAI: Alibaba LLM (Multimodal), Apache 2.0 license, available as a dedicated endpoint on FlexAI or your own infrastructure.

    Pricing

    Input

    $0.076 / M tokens

    Output

    $0.296 / M tokens

    Context

    256K tokens

    API endpoint

    /v1/chat/completions

    Compatibility

    OpenAI

    Parameters

    8B

    License

    Apache 2.0

    Hardware

    H100

    Quantization

    BF16

    Qwen3-VL 8B Instruct runs as a dedicated endpoint, provisioned per customer on FlexAI's infrastructure or your own, not served through the shared Token Factory API.