Explore

Models

Skills

Blueprints

GPUs

Docs

Your Privacy Choices

Contact

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (1)

Free Endpoint

Partner Endpoint

Download Available

Use Case

Drug Discovery

Image-to-Text

Retrieval Augmented Generation

Speech-to-Text

Code Generation

Inference Providers

OpenRouter

Deepinfra

Together AI

GMI Cloud

Fireworks AI

Publisher

DeepSeek AI

Minimaxai

Stepfun ai

Google

Mistral AI

NIM Container GPUs

B200

H200

H100 80GB HBM3

L40S

A100 SXM4 80GB

Labels (1)

coding

10 models

Sort By

Minimaxai

Free Endpoint

minimax-m3

MiniMax M3 Preview is a multimodal MoE vision-language model with strong reasoning, coding, and tool-calling capabilities.

coding

Items per page

of 1 pages

3.29M

10d

Stepfun-ai

DownloadableFree Endpoint

step-3.7-flash

A sparse MoE multimodal reasoning model good for enterprise, agentic and coding tasks.

Coding

4.34M

25d

Mistral AI

DownloadableFree Endpoint

mistral-medium-3.5-128b

A high performing model for text generation, coding and agentic use cases

coding

3.76M

1mo

DeepSeek AI

DownloadableFree Endpoint

deepseek-v4-flash

DeepSeek V4 Flash is a 284B MoE model with 1M-token context optimized for fast coding and agents.

MoE

15.16M

1mo

DeepSeek AI

DownloadableFree Endpoint

deepseek-v4-pro

DeepSeek V4 scales to 1M-token context windows with efficient MoE architecture for coding tasks.

Moe

7.5M

1mo

Z.ai

DownloadableFree Endpoint

glm-5.1

GLM-5.1 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.

Agentic AI

27.59M

2mo

Minimaxai

DownloadableFree Endpoint

minimax-m2.7

MiniMax M2.7 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.

reasoning

14.36M

2mo

Google

DownloadableFree Endpoint

gemma-4-31b-it

Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.

reasoning

5.49M

2mo

Stepfun-ai

Free Endpoint

step-3.5-flash

200B open-source reasoning engine with sparse MoE powering frontier agentic AI.

Agentic

11.71M

4mo

Sarvamai

DownloadableFree Endpoint

sarvam-m

Multilingual, hybrid-reasoning model optimized for Indian language tasks, programming, mathematical reasoning capabilities.

coding

236K

11mo