NVIDIA Nemotron 3 Nano 30B A3B FP8
ChatNVIDIA-Nemotron-3-Nano-30B-A3B-FP8
NVIDIA Nemotron 3 Nano 30B A3B FP8 on FlexAI: NVIDIA LLM, NVIDIA Open Model License, available as a dedicated endpoint on FlexAI or your own infrastructure.
Pricing
Input
$0.045 / M tokens
Output
$0.18 / M tokens
Context
256K tokens
API endpoint
/v1/chat/completions
Compatibility
OpenAI
Parameters
30B MoE (3B active)
License
NVIDIA Open Model License
Hardware
H100
Quantization
FP8
NVIDIA Nemotron 3 Nano 30B A3B FP8 runs as a dedicated endpoint, provisioned per customer on FlexAI's infrastructure or your own, not served through the shared Token Factory API.