Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
9 models
Sort By
dateCreated:DESC
Most Recent
Mistral AI
Downloadable
mistral-small-4-119b-2603
Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context
chat
+4
26
Today
Qwen
Downloadable
qwen2.5-coder-32b-instruct
Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
chat
+3
6.03M
8mo
Qwen
Free Endpoint
qwen2.5-coder-7b-instruct
Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.
chat
+3
577K
9mo
Abacus.AI
Free Endpoint
dracarys-llama-3.1-70b-instruct
Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
chat
+2
630K
9mo
Mistral AI
Free Endpoint
mamba-codestral-7b-v0.1
Model for writing and interacting with code across a wide range of programming languages and tasks.
chat
+3
553K
9mo
BigCode
Downloadable
starcoder2-7b
Advanced programming model for code completion, summarization, and generation
code completion
+2
11.96K
1y
Google
Free Endpoint
gemma-2-27b-it
Cutting-edge text generation model text understanding, transformation, and code generation.
chat
+4
749K
9mo
Google
Downloadable
gemma-2-9b-it
Cutting-edge text generation model text understanding, transformation, and code generation.
chat
+4
4.63M
9mo
Google
Free Endpoint
gemma-7b
Cutting-edge text generation model text understanding, transformation, and code generation.
chat
+4
569K
10mo
Items per page
24
1
1
of 1 pages