NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters (1)

  • Free Endpoint
    4
  • Partner Endpoint
    1
  • Download Available
    2
  • Synthetic Data Generation
    4
  • Image-to-Text
    2
  • Drug Discovery
    0
  • Code Generation
    0
  • Retrieval Augmented Generation
    0
  • Deep Infra
    1
  • Bitdeer AI
    1
  • Lightning AI
    1
  • Vultr
    1
  • Together AI
    0
  • NVIDIA
    5
  • Google
    1
  • Meta
    0
  • Mistral AI
    0
  • Qwen
    0
  • A100 SXM4 80GB
    0
  • B200
    0
  • GB200
    0
  • GH200 144G HBM3e
    0
  • H100 80GB HBM3
    0
  • Video
  • 6 models
    NVIDIA
    Downloadable

    nemotron-3-nano-omni-30b-a3b-reasoning

    Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.
    Image-to-Text
    Items per page
    of 1 pages
    8.08M
    2w
    NVIDIA
    Free Endpoint

    cosmos-transfer2.5-2b

    Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
    Synthetic Data Generation
    2mo
    NVIDIA
    Downloadable

    cosmos-reason2-8b

    Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
    video understanding
    602K
    4mo
    NVIDIA
    Free Endpoint

    cosmos-transfer1-7b

    Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
    Synthetic Data Generation
    266
    10mo
    NVIDIA
    Free Endpoint

    cosmos-predict1-5b

    Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.
    Synthetic Data Generation
    750
    1y
    Google
    Free Endpoint

    paligemma

    Vision language model adept at comprehending text and visual inputs to produce informative responses
    image
    15.21K
    1y