Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters (2)
6 models
Sort By
dateCreated:DESC
Most Recent
Mistral AI
Downloadable
mistral-small-4-119b-2603
Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context
code generation
+2
7.15M
1mo
Qwen
Downloadable
qwen2.5-coder-32b-instruct
Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
code completion
+2
2.64M
9mo
Qwen
Deprecated
Free Endpoint
qwen2.5-coder-7b-instruct
Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.
code completion
+2
250K
10mo
Abacus.AI
Free Endpoint
dracarys-llama-3.1-70b-instruct
Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
Code Generation
+1
252K
10mo
Mistral AI
Deprecated
Free Endpoint
mamba-codestral-7b-v0.1
Model for writing and interacting with code across a wide range of programming languages and tasks.
code completion
+2
188K
10mo
BigCode
Deprecated
Downloadable
starcoder2-7b
Advanced programming model for code completion, summarization, and generation
code completion
+2
6.78K
1y
Items per page
24
1
1
of 1 pages