Explore

Models

Skills

Blueprints

GPUs

Docs

Your Privacy Choices

Contact

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (3)

Free Endpoint

Partner Endpoint

Download Available

Use Case

Drug Discovery

Image-to-Text

Retrieval Augmented Generation

Speech-to-Text

Code Generation

Inference Providers

Deepinfra

OpenRouter

Together AI

GMI Cloud

Bitdeer

Publisher

Google

Mistral AI

DeepSeek AI

Stepfun ai

Z.ai

NIM Container GPUs

B200

H200

L40S

H100 80GB HBM3

A100 SXM4 80GB

Labels (3)

coding

reasoning

Agentic

5 models

Sort By

Z.ai

DownloadableFree Endpoint

glm-5.2

GLM-5.2 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.

Agentic AI

Items per page

of 1 pages

Mistral AI

DownloadableFree Endpoint

mistral-medium-3.5-128b

A high performing model for text generation, coding and agentic use cases

coding

2mo

DeepSeek AI

DownloadableFree Endpoint

deepseek-v4-pro

DeepSeek V4 scales to 1M-token context windows with efficient MoE architecture for coding tasks.

Moe

2mo

Google

DownloadableFree Endpoint

gemma-4-31b-it

Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.

reasoning

3mo

Stepfun-ai

Free Endpoint

step-3.5-flash

200B open-source reasoning engine with sparse MoE powering frontier agentic AI.

Agentic

10M

5mo