NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters

  • Free Endpoint
    5
  • Partner Endpoint
    5
  • Download Available
    4
  • Code Generation
    5
  • Retrieval Augmented Generation
    0
  • Drug Discovery
    0
  • Image-to-Text
    0
  • Object Detection
    0
  • Fireworks AI
    4
  • Deep Infra
    3
  • Together AI
    2
  • Vultr
    1
  • GMI Cloud
    0
  • Google
    3
  • Mistral AI
    2
  • Qwen
    2
  • Abacus.AI
    1
  • BigCode
    1
  • 9 models
    Mistral AI
    Downloadable

    mistral-small-4-119b-2603

    Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context
    chat
    26
    Today
    Qwen
    Downloadable

    qwen2.5-coder-32b-instruct

    Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
    chat
    6.03M
    8mo
    Qwen
    Free Endpoint

    qwen2.5-coder-7b-instruct

    Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.
    chat
    577K
    9mo
    Abacus.AI
    Free Endpoint

    dracarys-llama-3.1-70b-instruct

    Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
    chat
    630K
    9mo
    Mistral AI
    Free Endpoint

    mamba-codestral-7b-v0.1

    Model for writing and interacting with code across a wide range of programming languages and tasks.
    chat
    553K
    9mo
    BigCode
    Downloadable

    starcoder2-7b

    Advanced programming model for code completion, summarization, and generation
    code completion
    11.96K
    1y
    Google
    Free Endpoint

    gemma-2-27b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    chat
    749K
    9mo
    Google
    Downloadable

    gemma-2-9b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    chat
    4.63M
    9mo
    Google
    Free Endpoint

    gemma-7b

    Cutting-edge text generation model text understanding, transformation, and code generation.
    chat
    569K
    10mo
    Items per page
    of 1 pages