Open. Portable. Observable. Fast.
Every model you and your agent need.One OpenAI-compatible key
Keep your app loop, prompts, tools, structured outputs, evals, and approvals. Swap models, not your agent.
$10/month in free credits for your first 3 months
curl https://tokens.flex.ai/v1/chat/completions \
-H "Authorization: Bearer $FLEXAI_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "Meta-Llama-3.1-8B-Instruct-FP8",
"messages": [{"role": "user", "content": "Hello from FlexAI"}]
}'The console, from builder to org
The same login, from your first key to org-wide governance.

Same login: builder and org-admin views.
DeepSeek V3.2ChatGPT-OSS 120BChatGPT-OSS 20BChatLlama 3.1 8B InstructChatLlama 3.3 70B InstructChatMistral NemoChatQwen3 Coder 30B A3BChatQwen3.6-35B-A3BChatGemma 4 31B ITChatGemma 4 26B A4BChatWhisper Large V3 TurboTranscriptionNVIDIA Parakeet TDT 0.6B v3TranscriptionKokoro-82MAudioBGE-M3EmbeddingsFLUX.1 [schnell]ImagePaddleOCR-VL 1.5VisionNemotron 3 Super 120B A12BChatMiniMax M2.7ChatQwen3.5 9BChatGLM 4.5 AirChatDeepSeek V4 FlashChat
DeepSeek V4 FlashChatGLM 4.5 AirChatQwen3.5 9BChatMiniMax M2.7ChatNemotron 3 Super 120B A12BChatPaddleOCR-VL 1.5VisionFLUX.1 [schnell]ImageBGE-M3EmbeddingsKokoro-82MAudioNVIDIA Parakeet TDT 0.6B v3TranscriptionWhisper Large V3 TurboTranscriptionGemma 4 26B A4BChatGemma 4 31B ITChatQwen3.6-35B-A3BChatQwen3 Coder 30B A3BChatMistral NemoChatLlama 3.3 70B InstructChatLlama 3.1 8B InstructChatGPT-OSS 20BChatGPT-OSS 120BChatDeepSeek V3.2Chat
Results in production
What teams get once they're building on Token Factory.
3 lines
To integrate
Point the OpenAI SDK at FlexAI and swap the model id.
From first call to production
One key the whole way.
- 1Try itAn open, rate-limited demo in the browser. Feel the API before you commit.
- 2IntegrateSwap base_url. Your prompts, tools, RAG, SDKs, and evals carry over unchanged.
- 3Go to productionSpend, limits, and key health are first-class, with reliable error semantics.
- 4
Observe
Routing across models and request metadata, coming to every call.
Frequently Asked Questions
Past steady-state volume? Find your serverless to dedicated break-even
Start with inference. Keep your agent loop
Every open model behind one OpenAI-compatible key, with source-linked pricing.
$10/month in free credits for your first 3 months
