Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters (1)
8 models
Sort By
dateCreated:DESC
Most Recent
Mistral AI
Downloadable
mistral-small-4-119b-2603
Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context
code generation
+2
7.15M
1mo
Mistral AI
Free Endpoint
devstral-2-123b-instruct-2512
State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.
coding
+3
3.62M
4mo
Mistral AI
Deprecated
Downloadable
mistral-small-24b-instruct
Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.
code
+3
198K
9mo
Qwen
Downloadable
qwen2.5-coder-32b-instruct
Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
code completion
+2
2.64M
9mo
Qwen
Deprecated
Free Endpoint
qwen2.5-coder-7b-instruct
Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.
code completion
+2
250K
10mo
Abacus.AI
Free Endpoint
dracarys-llama-3.1-70b-instruct
Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
Code Generation
+1
252K
10mo
Mistral AI
Deprecated
Free Endpoint
mamba-codestral-7b-v0.1
Model for writing and interacting with code across a wide range of programming languages and tasks.
code completion
+2
188K
10mo
BigCode
Deprecated
Downloadable
starcoder2-7b
Advanced programming model for code completion, summarization, and generation
code completion
+2
6.78K
1y
Items per page
24
1
1
of 1 pages