Qwen3 30B A3B Thinking 2507
ChatQwen3-30B-A3B-Thinking-2507-FP8
Qwen3 30B A3B Thinking 2507 on FlexAI: Alibaba LLM (Reasoning), Apache 2.0 license, available as a dedicated endpoint on FlexAI or your own infrastructure.
Pricing
Input
$0.072 / M tokens
Output
$0.252 / M tokens
Context
33K tokens
API endpoint
/v1/chat/completions
Compatibility
OpenAI
Parameters
30B MoE (3B active)
License
Apache 2.0
Hardware
H100
Quantization
FP8
Qwen3 30B A3B Thinking 2507 runs as a dedicated endpoint, provisioned per customer on FlexAI's infrastructure or your own, not served through the shared Token Factory API.