Model family

GPT-OSS on FlexAI.Every variant. One key

GPT-OSS is OpenAI's open model line on FlexAI. One variant runs serverless on the OpenAI-compatible API, with 2 more available as dedicated endpoints, spanning chat. One API key serves every variant.

Get an API key All models

Variants

Every served variant in the family, with live serverless pricing.

Serverless · pay per token

Model	Context	Price	Status
GPT-OSS 120B	128K	$0.039 / $0.1 per M	Serving

Dedicated endpoints · reserved GPUs

Model	Context	Price	Status
GPT-OSS Safeguard 20B	128K	Dedicated	Dedicated
GPT-OSS 20B	128K	Dedicated	Dedicated

Which variant for what

Pick by the role you're filling. Same key for all of them.

Flagship

GPT-OSS 120B

GPT-OSS 120B is the largest served serverless variant. Reach for it first.

GPT-OSS 120B runs plan in workflow automation

Call the flagship

OpenAI-compatible. Swap the model id for any variant above.

curl https://tokens.flex.ai/v1/chat/completions \
  -H "Authorization: Bearer $FLEXAI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-oss-120b",
    "messages": [{"role": "user", "content": "Hello from FlexAI"}]
  }'

Where it runs

Use cases that put GPT-OSS to work in a pipeline.

Workflow automation

Agents that chain tools and approvals into reliable multi-step runs.

See the pipeline

Run GPT-OSS on one API key

Every GPT-OSS variant, serverless and dedicated, behind one OpenAI-compatible key.

Get an API key Talk to us

$10/month in free credits for your first 3 months

See how much you could save