Skip to content

    Model family

    Qwen on FlexAI.Every variant. One key

    Qwen is Alibaba's open model line on FlexAI. 3 variants run serverless on the OpenAI-compatible API, with 12 more available as dedicated endpoints, spanning chat, code, multimodal, vision. One API key serves every variant.

    Variants

    Every served variant in the family, with live serverless pricing.

    Serverless · pay per token

    ModelContextPriceStatus
    Qwen3.6-35B-A3B256K$0.09 / $0.135 per M Serving
    Qwen3 Coder 30B A3B256K$0.063 / $0.234 per M Serving
    Qwen3.5 9B250K$0.09 / $0.135 per M Serving

    Dedicated endpoints · reserved GPUs

    ModelContextPriceStatus
    Qwen3 Coder 480B A35B256KDedicatedDedicated
    Qwen3.5 397B A17B256KDedicatedDedicated
    Qwen3 235B A22B 2507256KDedicatedDedicated
    Qwen3-VL 235B A22B Instruct256KDedicatedDedicated
    Qwen3 Coder Next256KDedicatedDedicated
    Qwen3-Next 80B A3B Instruct256KDedicatedDedicated
    Qwen2.5 Coder 32B Instruct128KDedicatedDedicated
    QwQ 32B128KDedicatedDedicated
    Qwen2.5 32B Instruct128KDedicatedDedicated
    Qwen3 30B A3B Thinking 250733KDedicatedDedicated
    Qwen3 8B41KDedicatedDedicated
    Qwen3.5-4B256KDedicatedDedicated

    Which variant for what

    Pick by the role you're filling. Same key for all of them.

    Flagship

    Qwen3.6-35B-A3B

    Qwen3.6-35B-A3B is the largest served serverless variant. Reach for it first.

    Fast & economical

    Qwen3.5 9B

    Qwen3.5 9B is the smallest serverless variant. Lowest latency and cost.

    Qwen3 Coder 30B A3B runs generate in coding agents

    Call the flagship

    OpenAI-compatible. Swap the model id for any variant above.

    curl https://tokens.flex.ai/v1/chat/completions \
      -H "Authorization: Bearer $FLEXAI_API_KEY" \
      -H "Content-Type: application/json" \
      -d '{
        "model": "Qwen3.6-35B-A3B-FP8",
        "messages": [{"role": "user", "content": "Hello from FlexAI"}]
      }'

    Run Qwen on one API key

    Every Qwen variant, serverless and dedicated, behind one OpenAI-compatible key.

    $10/month in free credits for your first 3 months