Skip to content

    Capability

    Embeddings,one API call away.

    Turn text into vectors for search, RAG, clustering, and classification. FlexAI serves open embedding models on the OpenAI-compatible /v1/embeddings endpoint, billed per token.

    Models you can call

    Served on FlexAI today, on the OpenAI-compatible API.

    ModelContextPriceStatus
    BGE-M38K$0.009 per M Serving

    Call it

    OpenAI-compatible /v1/embeddings. Same key as every other model.

    curl https://tokens.flex.ai/v1/embeddings \
      -H "Authorization: Bearer $FLEXAI_API_KEY" \
      -H "Content-Type: application/json" \
      -d '{
        "model": "bge-m3",
        "input": "FlexAI runs open models behind one key."
      }'

    Start serverless and pay per use. When volume proves out, move to a dedicated endpoint on the same key.

    Every modality, one API key

    Text, vision, image, audio, and embeddings behind one OpenAI-compatible key.

    $10/month in free credits for your first 3 months