Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

13 results for

Filters (1)

  • Free Endpoint
    13
  • Partner Endpoint
    6
  • Download Available
    10
  • Image-to-Text
    2
  • Retrieval Augmented Generation
    0
  • Object Detection
    0
  • Text-to-Embedding
    0
  • Optical Character Recognition
    0
  • Deepinfra
    5
  • Bitdeer
    3
  • Digital Ocean
    2
  • Lightning AI
    2
  • CoreWeave
    1
  • NVIDIA
    12
  • Mistral AI
    1
  • A100 SXM4 80GB
    3
  • A10G
    3
  • H100 80GB HBM3
    3
  • H100 NVL
    3
  • H200
    3
  • Chat
  • NVIDIA
    Free Endpoint

    nemotron-voicechat

    Nemotron 3 Voicechat
    Model
    English
    1.77K
    3mo
    Items per page
    of 1 pages
    Mistral AI
    Free Endpoint

    mistral-nemotron

    Built for agentic workflows, this model excels in coding, instruction following, and function calling
    Model
    language generation
    1.49M
    1y
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    Chat
    1.53M
    1y
    NVIDIA
    DownloadableFree Endpoint

    nemotron-nano-12b-v2-vl

    Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
    Model
    language generation
    2.47M
    7mo
    NVIDIA
    DownloadableFree Endpoint

    nemotron-3-nano-30b-a3b

    Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
    Model
    MoE
    11.91M
    6mo
    NVIDIA
    DownloadableFree Endpoint

    nemotron-3-super-120b-a12b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    Model
    MoE
    60.41M
    3mo
    NVIDIA
    DownloadableFree Endpoint

    nemotron-3-ultra-550b-a55b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    Model
    Agent
    7.73M
    15d
    NVIDIA
    DownloadableFree Endpoint

    nvidia-nemotron-nano-9b-v2

    High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.
    Model
    thinking budget
    988K
    10mo
    NVIDIA
    DownloadableFree Endpoint

    nemotron-3-nano-omni-30b-a3b-reasoning

    Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.
    Model
    Image-to-Text
    7.54M
    1mo
    NVIDIA
    DownloadableFree Endpoint

    llama-3.1-nemotron-nano-8b-v1

    Leading reasoning and agentic AI accuracy model for PC and edge.
    Model
    advanced reasoning
    1.47M
    11mo
    NVIDIA
    DownloadableFree Endpoint

    llama-3.1-nemotron-nano-vl-8b-v1

    Multi-modal vision-language model that understands text/img and creates informative responses
    Model
    doc intelligence
    10.15M
    11mo
    NVIDIA
    DownloadableFree Endpoint

    llama-3.3-nemotron-super-49b-v1

    High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
    Model
    advanced reasoning
    4.93M
    11mo
    NVIDIA
    DownloadableFree Endpoint

    llama-3.3-nemotron-super-49b-v1.5

    High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
    Model
    advanced reasoning
    3.17M
    10mo