Try NVIDIA NIM APIs

Explore

Models

Skills

Blueprints

6 results for

Filters (1)

Free Endpoint

Partner Endpoint

Download Available

Use Case

Code Generation

Drug Discovery

Inference Providers

Together AI

Deepinfra

GMI Cloud

Lightning AI

Bitdeer

Publisher

Google

OpenAI

MIT

NVIDIA

NIM Container GPUs

B200

H200

L40S

A100 SXM4 80GB

B300 SXM6 AC

Labels (1)

Chat

Sort By

Google

DownloadableFree Endpoint

diffusiongemma-26b-a4b-it

Diffusion-based 26B parameter LLM enabling parallel token generation for real-time text apps

Model

diffusion-llm

97.31K

Items per page

of 1 pages

Google

Free Endpoint

gemma-2-2b-it

Advanced small language generative AI model for edge applications

Model

Chat

1.42M

Google

Free Endpoint

gemma-3n-e2b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

Model

language generation

33.75M

11mo

Google

Free Endpoint

gemma-3n-e4b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

Model

language generation

3.79M

11mo

Google

DownloadableFree Endpoint

gemma-4-31b-it

Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.

Model

reasoning

5.49M

2mo

OpenAI

DownloadableFree Endpoint

gpt-oss-120b

Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.

Model

reasoning

57.12M

10mo