⌘KCtrl+K

Your Privacy Choices

Contact

Explore

Models

⌘KCtrl+K

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (2)

Free Endpoint

Partner Endpoint

Download Available

Use Case

Code Generation

Retrieval Augmented Generation

Drug Discovery

Image-to-Text

Object Detection

Inference Providers

Deep Infra

Together AI

GMI Cloud

Bitdeer AI

CoreWeave

Publisher

NVIDIA

Mistral AI

Qwen

DeepSeek AI

Moonshotai

API Catalog Type

Enterprise

Blueprint Type

NVIDIA BioNemo

Labels (2)

reasoning

Advanced Reasoning

7 models

Sort By

DeepSeek AI

Free Endpoint

deepseek-v3.1-terminus

DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.

tool calling

12.29M

6mo

NVIDIA

Downloadable

llama-3.3-nemotron-super-49b-v1.5

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

math

2.54M

8mo

Moonshotai

Free Endpoint

kimi-k2-instruct

State-of-the-art open mixture-of-experts model with strong reasoning, coding, and agentic capabilities

coding

21.04M

9mo

Qwen

DeprecatedFree Endpoint

qwq-32b

Powerful reasoning model capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems.

coding

1.13M

9mo

NVIDIA

Downloadable

llama-3.3-nemotron-super-49b-v1

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

math

1.39M

8mo

Mistral AI

Downloadable

mixtral-8x22b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

Advanced Reasoning

2.12M

9mo

Mistral AI

Downloadable

mixtral-8x7b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

Advanced Reasoning

353K

9mo

Items per page

of 1 pages