Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Filters (1)

Free Endpoint

Partner Endpoint

Download Available

Use Case

Code Generation

Drug Discovery

Image-to-Text

Retrieval Augmented Generation

Speech-to-Text

Inference Providers

Deepinfra

OpenRouter

Together AI

GMI Cloud

Bitdeer

Publisher

NVIDIA

Mistral AI

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

Items per page

of 1 pages

11mo

DownloadableFree Endpoint

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

11mo

DownloadableFree Endpoint

Leading reasoning and agentic AI accuracy model for PC and edge.

DownloadableFree Endpoint

An MOE LLM that follows instructions, completes requests, and generates creative text.

996K

11mo