NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

16 results for

Filters (1)

  • Download Available
    9
  • API Endpoint
    7
  • Launchable
    0
  • Enterprise
    0
  • Synthetic Data Generation
    1
  • Digital Twin
    1
  • Code Generation
    1
  • Image-to-Text
    1
  • Retrieval Augmented Generation
    0
  • NVIDIA
    14
  • Igenius
    1
  • Mistral AI
    1
  • OpenAI
    0
  • Pipecat
    0
  • NVIDIA AI
    0
  • NVIDIA BioNemo
    0
  • chat
  • NVIDIA

    nvidia-nemotron-nano-9b-v2

    High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.
    Model
    thinking budget
    675K
    6mo
    Mistral AI

    mistral-7b-instruct-v0.2

    This LLM follows instructions, completes requests, and generates creative text.
    Model
    chat
    487K
    9mo
    NVIDIA

    llama-3.1-nemotron-nano-4b-v1.1

    State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents
    Model
    edge
    98.5K
    8mo
    NVIDIA

    llama-3.1-nemotron-nano-8b-v1

    Leading reasoning and agentic AI accuracy model for PC and edge.
    Model
    chat
    574K
    8mo
    NVIDIA

    llama-3.1-nemotron-nano-vl-8b-v1

    Multi-modal vision-language model that understands text/img and creates informative responses
    Model
    doc intelligence
    7.09M
    8mo
    NVIDIA

    llama-3.1-nemotron-ultra-253b-v1

    Superior inference efficiency with highest accuracy for scientific and complex math reasoning, coding, tool calling, and instruction following.
    Model
    chat
    6.97M
    8mo
    NVIDIA

    llama-3.3-nemotron-super-49b-v1

    High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
    Model
    chat
    1.07M
    7mo
    NVIDIA

    llama-3.3-nemotron-super-49b-v1.5

    High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
    Model
    chat
    4.42M
    7mo
    NVIDIA

    nemotron-3-nano-30b-a3b

    Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
    Model
    MoE
    11.66M
    2mo
    NVIDIA

    nemotron-4-mini-hindi-4b-instruct

    A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
    Model
    Indic
    454K
    9mo
    NVIDIA

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    chat
    488K
    1y
    NVIDIA

    usdcode

    State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.
    Model
    OpenUSD
    335K
    8mo
    NVIDIA

    llama3-chatqa-1.5-8b

    Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.
    Model
    text-to-text
    482K
    9mo
    NVIDIA

    mistral-nemo-minitron-8b-base

    State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
    Model
    language generation
    4.83K
    1y
    NVIDIA

    nemotron-nano-12b-v2-vl

    Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
    Model
    language generation
    1.54M
    4mo
    Igenius

    colosseum_355b_instruct_16k

    NVIDIA DGX Cloud trained multilingual LLM designed for mission critical use cases in regulated industries including financial services, government, heavy industry
    Model
    Heavy industry
    85.17K
    9mo
    Items per page
    of 1 pages