Model family

DeepSeek on FlexAI.Every variant. One key

DeepSeek is DeepSeek's open model line on FlexAI. One variant runs serverless on the OpenAI-compatible API, with 10 more available as dedicated endpoints, spanning chat, vision. One API key serves every variant.

Get an API key All models

Variants

Every served variant in the family, with live serverless pricing.

Serverless · pay per token

Model	Context	Price	Status
DeepSeek V4 Flash	768K	$0.09 / $0.179 per M	Serving

Dedicated endpoints · reserved GPUs

Model	Context	Price	Status
DeepSeek V4 Pro	1.0M	Dedicated	Dedicated
DeepSeek V3.2	160K	Dedicated	Dedicated
DeepSeek R1	160K	Dedicated	Dedicated
DeepSeek V3 0324	160K	Dedicated	Dedicated
DeepSeek R1 0528	160K	Dedicated	Dedicated
DeepSeek V3	32K	Dedicated	Dedicated
DeepSeek V3.1	160K	Dedicated	Dedicated
DeepSeek R1 Distill Qwen 32B	32K	Dedicated	Dedicated
DeepSeek OCR	8K	Dedicated	Dedicated
DeepSeek R1 Distill Qwen 1.5B	32K	Dedicated	Dedicated

Which variant for what

Pick by the role you're filling. Same key for all of them.

Flagship

DeepSeek V4 Flash

DeepSeek V4 Flash is the largest served serverless variant. Reach for it first.

DeepSeek V4 Flash runs summarize in research agents

Call the flagship

OpenAI-compatible. Swap the model id for any variant above.

curl https://tokens.flex.ai/v1/chat/completions \
  -H "Authorization: Bearer $FLEXAI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "DeepSeek-V4-Flash",
    "messages": [{"role": "user", "content": "Hello from FlexAI"}]
  }'

Where it runs

Use cases that put DeepSeek to work in a pipeline.

Research agents

Agents that retrieve, reason over, and synthesize large source sets.

See the pipeline

Run DeepSeek on one API key

Every DeepSeek variant, serverless and dedicated, behind one OpenAI-compatible key.

Get an API key Talk to us

$10/month in free credits for your first 3 months

See how much you could save