NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta
Sorting by

qwenqwen2.5-coder-32b-instruct

Advanced LLM for code generation, reasoning, and fixing across popular programming languages.

code completioncode generationchattext-to-code

qwenqwen2.5-coder-7b-instruct

Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.

code completioncode generationchattext-to-code

abacusaidracarys-llama-3.1-70b-instruct

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

chatCode GenerationText-to-Text

mistralaimamba-codestral-7b-v0.1

Model for writing and interacting with code across a wide range of programming languages and tasks.

code completioncode generationchat

bigcodestarcoder2-7b

Advanced programming model for code completion, summarization, and generation

code completioncode generation

googlegemma-2-27b-it

Cutting-edge text generation model text understanding, transformation, and code generation.

chatCode GenerationText-to-TextLanguage Generation

googlegemma-2-9b-it

Cutting-edge text generation model text understanding, transformation, and code generation.

chatCode GenerationText-to-TextLanguage Generation

googlegemma-7b

Cutting-edge text generation model text understanding, transformation, and code generation.

chatCode GenerationText-to-TextLanguage Generation