Skip to content

    NVIDIA Nemotron 3 Nano 30B A3B FP8

    Chat

    NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

    NVIDIA Nemotron 3 Nano 30B A3B FP8 on FlexAI: NVIDIA LLM, NVIDIA Open Model License, available as a dedicated endpoint on FlexAI or your own infrastructure.

    Pricing

    Input

    $0.045 / M tokens

    Output

    $0.18 / M tokens

    Context

    256K tokens

    API endpoint

    /v1/chat/completions

    Compatibility

    OpenAI

    Parameters

    30B MoE (3B active)

    License

    NVIDIA Open Model License

    Hardware

    H100

    Quantization

    FP8

    NVIDIA Nemotron 3 Nano 30B A3B FP8 runs as a dedicated endpoint, provisioned per customer on FlexAI's infrastructure or your own, not served through the shared Token Factory API.