NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

65 results for

Filters

  • Free Endpoint
    15
  • Partner Endpoint
    26
  • Download Available
    41
  • Launchable
    4
  • Developer Example
    2
  • Enterprise
    1
  • Enterprise Blueprint
    1
  • Code Generation
    9
  • Retrieval Augmented Generation
    7
  • Text-to-Embedding
    5
  • Image-to-Text
    4
  • Image Generation
    3
  • Deep Infra
    19
  • Together AI
    19
  • GMI Cloud
    11
  • CoreWeave
    7
  • Lightning AI
    5
  • NVIDIA
    29
  • Meta
    7
  • Qwen
    6
  • Google
    5
  • Mistral AI
    5
  • NVIDIA AI
    4
  • NVIDIA Isaac GR00T
    1
  • NVIDIA Omniverse
    1
  • DGX Spark
    30 MIN

    Text to Knowledge Graph

    Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization
    Playbook
    GraphRAG
    7mo
    Items per page
    of 3 pages
    Langchain
    Deprecation in 20dLaunchable

    Structured Report Generation

    Generate detailed, structured reports on any topic using LangGraph and Llama3.3 70B NIM.
    Blueprint
    LangGraph
    2mo
    NVIDIA
    Enterprise

    Synthetic Manipulation Motion Generation for Robotics

    Generate exponentially large amounts of synthetic motion trajectories for robot manipulation from just a few human demonstrations.
    Blueprint
    synthetic data
    2mo
    DGX Station
    30 MIN

    Text to Knowledge Graph on DGX Station

    Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization
    Playbook
    GraphRAG
    2mo
    Qwen
    Downloadable

    qwen3-next-80b-a3b-instruct

    Qwen3-Next Instruct blends hybrid attention, sparse MoE, and stability boosts for ultra-long context AI.
    Model
    text-generation
    24.38M
    7mo
    ByteDance
    Free Endpoint

    seed-oss-36b-instruct

    ByteDance open-source LLM with long-context, reasoning, and agentic intelligence.
    Model
    thinking budget
    1.24M
    8mo
    Abacus.AI
    Free Endpoint

    dracarys-llama-3.1-70b-instruct

    Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
    Model
    Code Generation
    671K
    12mo
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    Chat
    620K
    12mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    Chat
    1.43M
    1y
    Qwen
    Deprecation in 4dDownloadable

    qwen3-next-80b-a3b-thinking

    80B parameter AI model with hybrid reasoning, MoE architecture, support for 119 languages.
    Model
    Reasoning
    2.49M
    8mo
    Qwen
    Downloadable

    qwen3.5-122b-a10b

    122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.
    Model
    tool calling
    12.59M
    2mo
    Mistral AI
    Downloadable

    mistral-medium-3.5-128b

    A high performing model for text generation, coding and agentic use cases
    Model
    coding
    1.85M
    2w
    Meta
    Downloadable

    llama-3.1-70b-instruct

    Powers complex conversations with superior contextual understanding, reasoning and text generation.
    Model
    Chat
    3.75M
    11mo
    Meta
    Downloadable

    llama-3.2-1b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    chat
    31.92K445K
    12mo
    Meta
    Downloadable

    llama-3.2-3b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    20.51K1.32M
    12mo
    Minimaxai
    DeprecatedDownloadable

    minimax-m2.5

    MiniMax M2.5 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.
    Model
    coding
    6.59M
    2mo
    Minimaxai
    Free Endpoint

    minimax-m2.7

    MiniMax M2.7 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.
    Model
    coding
    13.26M
    1mo
    Mistral AI
    Downloadable

    mistral-7b-instruct-v0.3

    This LLM follows instructions, completes requests, and generates creative text.
    Model
    Chat
    188K
    11mo
    OpenAI
    Downloadable

    gpt-oss-120b

    Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
    Model
    reasoning
    38.66M
    9mo
    OpenAI
    Downloadable

    gpt-oss-20b

    Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math
    Model
    reasoning
    19.27M
    9mo
    Mistral AI
    Deprecation in 4dDownloadable

    mixtral-8x22b-instruct-v0.1

    An MOE LLM that follows instructions, completes requests, and generates creative text.
    Model
    Advanced Reasoning
    2.45M
    10mo
    Mistral AI
    Downloadable

    mixtral-8x7b-instruct-v0.1

    An MOE LLM that follows instructions, completes requests, and generates creative text.
    Model
    Advanced Reasoning
    767K
    10mo
    Google
    Downloadable

    gemma-4-31b-it

    Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.
    Model
    coding
    6.73M
    1mo
    Microsoft
    Downloadable

    phi-4-mini-instruct

    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    Model
    Chat
    568K
    12mo