DeepSeek R1 Distill Qwen 32B
ChatDeepSeek-R1-Distill-Qwen-32B-FP8-Dynamic
DeepSeek R1 Distill Qwen 32B on FlexAI: DeepSeek LLM (Reasoning), MIT license, available as a dedicated endpoint on FlexAI or your own infrastructure.
Pricing
Input
$0.243 / M tokens
Output
$0.243 / M tokens
Context
32K tokens
API endpoint
/v1/chat/completions
Compatibility
OpenAI
Parameters
32B
License
MIT
Hardware
H100
Quantization
FP8-Dynamic
DeepSeek R1 Distill Qwen 32B runs as a dedicated endpoint, provisioned per customer on FlexAI's infrastructure or your own, not served through the shared Token Factory API.