Skip to content

    Model family

    Mistral on FlexAI.Every variant. One key

    Mistral is Mistral's open model line on FlexAI. One variant runs serverless on the OpenAI-compatible API, with 4 more available as dedicated endpoints, spanning chat, multimodal, vision, code, transcription. One API key serves every variant.

    Variants

    Every served variant in the family, with live serverless pricing.

    Serverless · pay per token

    ModelContextPriceStatus
    Mistral Nemo128K$0.018 / $0.027 per M Serving

    Dedicated endpoints · reserved GPUs

    ModelContextPriceStatus
    Mistral Medium 3.5256KDedicatedDedicated
    Mistral Small 3.1 24B128KDedicatedDedicated
    Mistral 7B Instruct v0.2128KDedicatedDedicated
    Voxtral Mini 4B Realtime128KDedicatedDedicated

    Which variant for what

    Pick by the role you're filling. Same key for all of them.

    Flagship

    Mistral Nemo

    Mistral Nemo is the largest served serverless variant. Reach for it first.

    Mistral Nemo runs execute steps in workflow automation

    Call the flagship

    OpenAI-compatible. Swap the model id for any variant above.

    curl https://tokens.flex.ai/v1/chat/completions \
      -H "Authorization: Bearer $FLEXAI_API_KEY" \
      -H "Content-Type: application/json" \
      -d '{
        "model": "Mistral-Nemo-Instruct-2407-FP8",
        "messages": [{"role": "user", "content": "Hello from FlexAI"}]
      }'

    Run Mistral on one API key

    Every Mistral variant, serverless and dedicated, behind one OpenAI-compatible key.

    $10/month in free credits for your first 3 months