Skip to content

    Model family

    DeepSeek on FlexAI.Every variant. One key

    DeepSeek is DeepSeek's open model line on FlexAI. 2 variants run serverless on the OpenAI-compatible API, with 7 more available as dedicated endpoints, spanning chat. One API key serves every variant.

    Variants

    Every served variant in the family, with live serverless pricing.

    Serverless · pay per token

    ModelContextPriceStatus
    DeepSeek V3.2160K$0.225 / $0.225 per M Serving
    DeepSeek V4 Flash1.0M$0.082 / $0.164 per M Serving

    Dedicated endpoints · reserved GPUs

    ModelContextPriceStatus
    DeepSeek V4 Pro1.0MDedicatedDedicated
    DeepSeek R1160KDedicatedDedicated
    DeepSeek V3 0324160KDedicatedDedicated
    DeepSeek R1 0528160KDedicatedDedicated
    DeepSeek V332KDedicatedDedicated
    DeepSeek R1 Distill Qwen 32B32KDedicatedDedicated
    DeepSeek R1 Distill Qwen 1.5B32KDedicatedDedicated

    Which variant for what

    Pick by the role you're filling. Same key for all of them.

    Flagship

    DeepSeek V3.2

    DeepSeek V3.2 is the largest served serverless variant. Reach for it first.

    Fast & economical

    DeepSeek V4 Flash

    DeepSeek V4 Flash is the smallest serverless variant. Lowest latency and cost.

    DeepSeek V3.2 runs reason in research agents · DeepSeek V4 Flash runs summarize in research agents

    Call the flagship

    OpenAI-compatible. Swap the model id for any variant above.

    curl https://tokens.flex.ai/v1/chat/completions \
      -H "Authorization: Bearer $FLEXAI_API_KEY" \
      -H "Content-Type: application/json" \
      -d '{
        "model": "DeepSeek-V3.2",
        "messages": [{"role": "user", "content": "Hello from FlexAI"}]
      }'

    Run DeepSeek on one API key

    Every DeepSeek variant, serverless and dedicated, behind one OpenAI-compatible key.

    $10/month in free credits for your first 3 months