Try NVIDIA NIM APIs

Skip to main content

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

27 results for

Filters

Free Endpoint

1

Partner Endpoint

5

Download Available

11

Use Case

Image Generation

4

Text-to-Image

4

Code Generation

1

Object Detection

1

Retrieval Augmented Generation

1

Inference Providers

Deepinfra

4

OpenRouter

2

CoreWeave

1

GMI Cloud

1

Together AI

1

Publisher

NVIDIA

20

Black forest labs

4

Baidu

1

Meta

1

Microsoft

1

Audience

AI Engineer

6

Ml Engineer

6

DevOps Engineer

5

Platform Engineer

5

Developer

4

Domain

Infrastructure

4

AI And Machine Learning

2

NIM Container GPUs

A100 PG509 200

1

A100 SXM4 80GB

1

A10G

1

B200

1

H100 80GB HBM3

1

Library

TAO

2

Brev

1

DGX Cloud

1

Megatron Core

1

TAO Toolkit

1

Sort By

Brev managed GPU instances with Docker support. Use when running TAO training, evaluation, or inference on Brev GPU instances, managing Brev deployments, or dispatching TAO jobs through the Brev CLI. Trigger phrases include "run on Brev", "Brev GPU instan

874

22d

Remote SLURM GPU cluster execution over SSH with sbatch/srun, Pyxis/Enroot containers, and Lustre-backed results. Use when running TAO training/eval/inference jobs on an on-prem or DGX SLURM cluster. Trigger phrases include "run on SLURM", "submit sbatch"

874

22d

How to launch distributed Megatron-LM training jobs on a SLURM cluster. Covers a minimal sbatch skeleton, environment-variable setup for torch.distributed.run, CUDA_DEVICE_MAX_CONNECTIONS rules across hardware and parallelism modes, container conventions,

1K

1mo

Kubernetes execution platform — submits TAO container jobs as single-pod k8s Jobs with NVIDIA GPU scheduling. Use when running on EKS / GKE / AKS / on-prem clusters with the NVIDIA GPU Operator installed, or when integrating TAO into an existing k8s-nativ

879

22d

Items per page

of 2 pages

DGX Cloud Lepton managed GPU compute platform with run/status/cancel interface. Use when submitting TAO jobs to DGX Cloud, dispatching training/eval/inference to Lepton GPU resources, or managing Lepton workspace deployments. Trigger phrases include "run

868

20d

Local or remote Docker execution for TAO SDK job containers using a Docker daemon with NVIDIA GPU runtime. Use when running TAO jobs on the current machine, a directly attached Docker host, or a remote GPU box exposed through DOCKER_HOST. Trigger phrases

886

20d

RTX Workstation

5 MIN

How to Get Started With Large Language Models on NVIDIA RTX PCs

Learn about using LLMs locally on PCs and workstations with Ollama, AnythingLLM, and LM Studio.

1mo

RTX Workstation

13 MIN

How to Get Started With Visual Generative AI on NVIDIA RTX PCs

Learn how to run advanced image and video generation locally with ComfyUI and LTX-2 on RTX PCs.

1mo

RTX Workstation

16 MIN

Run OpenClaw For Free On NVIDIA RTX GPUs & DGX Spark

Learn how to set up and host the popular AI agent using local inference apps optimized for RTX.

1mo

Downloadable

nv-embedqa-e5-v5

English text embedding model for question-answering retrieval.

16M

11mo

Black-forest-labs

Downloadable

FLUX.1-dev

FLUX.1 is a state-of-the-art suite of image generation models

246K

1y

Black-forest-labs

Downloadable

FLUX.1-Kontext-dev

FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.

4K

10mo

Black-forest-labs

Downloadable

FLUX.1-schnell

FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds

253K

1y

Black-forest-labs

Downloadable

flux.2-klein-4b

FLUX.2-klein-4B is a distilled image generation and editing model, producing outputs at lighting speed

271K

3mo

Downloadable

nemoretriever-page-elements-v2

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

243K

1y

Downloadable

nv-yolox-page-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

191

1y

Downloadable

parakeet-ctc-0.6b-asr

State-of-the-art accuracy and speed for English transcriptions.

1K

1y

Downloadable

TRELLIS

MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.

4K

10mo

DownloadableFree Endpoint

llama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

22M

1y

Downloadable

paddleocr

Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.

Optical Character Recognition

201K

1y

RTX Workstation

13 MIN

How to Build a Multi-GPU AI PC - A Practical Guide

Many people explore local generative AI for privacy and to avoid token limits, but newer models require significant memory and compute—leading some to adopt multi-GPU setups.

1mo

RTX Workstation

8 MIN

How to Fine-Tune an LLM on NVIDIA GPUs With Unsloth

Fine-tune popular AI models faster in Unsloth with NVIDIA RTX AI PCs, RTX PRO workstations, and DGX Spark—plus explore the new Nemotron Nano 3 family of open models.

1mo

RTX Workstation

18 MIN

NVIDIA Video Generation Guide

Learn how to create videos using LTX-2 in ComfyUI, accelerated on RTX. Learn how to take control of visual generative AI, creating high resolution video on RTX.

1mo

RTX Workstation

30 MIN

vLLM for Inference

Install and use vLLM on NVIDIA RTX Pro 6000

24d