Capability
Embeddings,one API call away.
Turn text into vectors for search, RAG, clustering, and classification. FlexAI serves open embedding models on the OpenAI-compatible /v1/embeddings endpoint, billed per token.
Models you can call
Served on FlexAI today, on the OpenAI-compatible API.
| Model | Context | Price | Status |
|---|---|---|---|
| BGE-M3 | 8K | $0.009 per M | Serving |
Call it
OpenAI-compatible /v1/embeddings. Same key as every other model.
curl https://tokens.flex.ai/v1/embeddings \
-H "Authorization: Bearer $FLEXAI_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "bge-m3",
"input": "FlexAI runs open models behind one key."
}'In the pipeline
Use cases that put this capability to work.
Start serverless and pay per use. When volume proves out, move to a dedicated endpoint on the same key.
Every modality, one API key
Text, vision, image, audio, and embeddings behind one OpenAI-compatible key.
$10/month in free credits for your first 3 months