nemotron-3-nano-omni-30b-a3b-reasoning Model by NVIDIA

import os from openai import OpenAI client = OpenAI( base_url = "https://integrate.api.nvidia.com/v1", api_key = os.getenv("NVIDIA_API_KEY", "$NVIDIA_API_KEY") ) completion = client.chat.completions.create( model="nvidia/nemotron-3-nano-omni-30b-a3b-reasoning", messages=[{"role":"user","content":""}], temperature=0.6, top_p=0.95, max_tokens=65536, extra_body={"chat_template_kwargs":{"enable_thinking":True},"reasoning_budget":16384}, stream=False ) reasoning = getattr(completion.choices[0].message, "reasoning_content", None) if reasoning: print(reasoning) print(completion.choices[0].message.content)

Follow the steps below to download and run the NVIDIA NIM inference microservice for this model on your infrastructure of choice.

Step 1
Generate API Key

Step 2
Pull and Run the NIM

$ docker login nvcr.io
Username: $oauthtoken
Password: <PASTE_API_KEY_HERE>

Pull and run the NVIDIA NIM with the command below. This will download the optimized model for your infrastructure.

export NGC_API_KEY=<PASTE_API_KEY_HERE>
export LOCAL_NIM_CACHE=~/.cache/nim
mkdir -p "$LOCAL_NIM_CACHE"
chmod -R a+w "$LOCAL_NIM_CACHE"
docker run -it --rm \
    --gpus all \
    --ipc host \
    --shm-size=32GB \
    -e NGC_API_KEY \
    -v "$LOCAL_NIM_CACHE:/opt/nim/.cache" \
    -p 8000:8000 \
    nvcr.io/nim/nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:latest

Step 3
Test the NIM

You can now make a local API call using this curl command:

curl -X 'POST' \
'http://0.0.0.0:8000/v1/chat/completions' \
    -H 'Accept: application/json' \
    -H 'Content-Type: application/json' \
    -d '{
        "model": "nvidia/nemotron-3-nano-omni-30b-a3b-reasoning",
        "messages": [
            {
                "role": "user",
                "content": [
                    {
                        "type": "text",
                        "text": "What is in this image?"
                    },
                    {
                        "type": "image_url",
                        "image_url":
                            {
                                "url": "https://assets.ngc.nvidia.com/products/api-catalog/phi-3-5-vision/example1b.jpg"
                            }
                    }
                ]
            }
        ],
        "max_tokens": 1024
    }'

For more details on getting started with this NIM, visit the NVIDIA NIM Docs.

nvidia/nemotron-3-nano-omni-30b-a3b-reasoning

Prototype

Deploy

Step 1
Generate API Key

Step 2
Pull and Run the NIM

Step 3
Test the NIM

Step 1Generate API Key

Step 2Pull and Run the NIM

Step 3Test the NIM

Step 1
Generate API Key

Step 2
Pull and Run the NIM

Step 3
Test the NIM