Qwen3.5-4B
ChatQwen/Qwen3.5-4B
Qwen3.5-4B on FlexAI: Alibaba LLM (Edge Multimodal), Apache 2.0 license, available as a dedicated endpoint on FlexAI or your own infrastructure.
Pricing
Input
$0.027 / M tokens
Output
$0.135 / M tokens
Context
256K tokens
API endpoint
/v1/chat/completions
Compatibility
OpenAI
Parameters
4B
License
Apache 2.0
Hardware
H100
Quantization
BF16
Qwen3.5-4B runs as a dedicated endpoint, provisioned per customer on FlexAI's infrastructure or your own, not served through the shared Token Factory API.