Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

27 results for

Filters

  • Free Endpoint
    1
  • Partner Endpoint
    5
  • Download Available
    11
  • Image Generation
    4
  • Text-to-Image
    4
  • Code Generation
    1
  • Object Detection
    1
  • Retrieval Augmented Generation
    1
  • Deepinfra
    4
  • Together AI
    2
  • CoreWeave
    1
  • GMI Cloud
    1
  • OpenRouter
    1
  • NVIDIA
    20
  • Black forest labs
    4
  • Baidu
    1
  • Meta
    1
  • Microsoft
    1
  • AI Engineer
    6
  • Ml Engineer
    6
  • DevOps Engineer
    5
  • Platform Engineer
    5
  • Developer
    4
  • Infrastructure
    4
  • AI And Machine Learning
    2
  • A100 PG509 200
    2
  • A100 SXM4 80GB
    2
  • A10G
    2
  • B200
    2
  • H100 80GB HBM3
    2
  • TAO
    2
  • Brev
    1
  • DGX Cloud
    1
  • Megatron Core
    1
  • TAO Toolkit
    1
  • Brev managed GPU instances with Docker support. Use when running TAO training, evaluation, or inference on Brev GPU instances, managing Brev deployments, or dispatching TAO jobs through the Brev CLI. Trigger phrases include "run on Brev", "Brev GPU instan
    Skill
    Developer
    407
    9d

    Remote SLURM GPU cluster execution over SSH with sbatch/srun, Pyxis/Enroot containers, and Lustre-backed results. Use when running TAO training/eval/inference jobs on an on-prem or DGX SLURM cluster. Trigger phrases include "run on SLURM", "submit sbatch"
    Skill
    TAO
    404
    9d

    How to launch distributed Megatron-LM training jobs on a SLURM cluster. Covers a minimal sbatch skeleton, environment-variable setup for torch.distributed.run, CUDA_DEVICE_MAX_CONNECTIONS rules across hardware and parallelism modes, container conventions,
    Skill
    Developer
    627
    21d

    Kubernetes execution platform — submits TAO container jobs as single-pod k8s Jobs with NVIDIA GPU scheduling. Use when running on EKS / GKE / AKS / on-prem clusters with the NVIDIA GPU Operator installed, or when integrating TAO into an existing k8s-nativ
    Skill
    Developer
    407
    9d
    Items per page
    of 2 pages

    DGX Cloud Lepton managed GPU compute platform with run/status/cancel interface. Use when submitting TAO jobs to DGX Cloud, dispatching training/eval/inference to Lepton GPU resources, or managing Lepton workspace deployments. Trigger phrases include "run
    Skill
    AI Engineer
    401
    7d

    Local Docker execution for TAO SDK job containers using the host Docker daemon and NVIDIA GPU runtime. Use when running TAO jobs on the current machine or a directly attached Docker host. Trigger phrases include "run locally", "local Docker", "use my GPU"
    Skill
    Developer
    411
    7d
    RTX Workstation
    5 MIN

    How to Get Started With Large Language Models on NVIDIA RTX PCs

    Learn about using LLMs locally on PCs and workstations with Ollama, AnythingLLM, and LM Studio.
    Playbook
    LLMs
    24d
    RTX Workstation
    13 MIN

    How to Get Started With Visual Generative AI on NVIDIA RTX PCs

    Learn how to run advanced image and video generation locally with ComfyUI and LTX-2 on RTX PCs.
    Playbook
    Gen AI
    24d
    RTX Workstation
    16 MIN

    Run OpenClaw For Free On NVIDIA RTX GPUs & DGX Spark

    Learn how to set up and host the popular AI agent using local inference apps optimized for RTX.
    Playbook
    DGX Spark
    21d
    NVIDIA
    Downloadable

    nv-embedqa-e5-v5

    English text embedding model for question-answering retrieval.
    Model
    Embedding
    16.12M
    11mo
    Black-forest-labs
    Downloadable

    FLUX.1-dev

    FLUX.1 is a state-of-the-art suite of image generation models
    Model
    Text-to-Image
    246K
    1y
    Black-forest-labs
    Downloadable

    FLUX.1-Kontext-dev

    FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.
    Model
    Text-to-Image
    3.54K
    10mo
    Black-forest-labs
    Downloadable

    FLUX.1-schnell

    FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds
    Model
    Text-to-Image
    253K
    1y
    Black-forest-labs
    Downloadable

    flux.2-klein-4b

    FLUX.2-klein-4B is a distilled image generation and editing model, producing outputs at lighting speed
    Model
    image editing
    271K
    3mo
    NVIDIA
    Downloadable

    nemoretriever-page-elements-v2

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    243K
    1y
    NVIDIA
    Downloadable

    nv-yolox-page-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    191
    11mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-asr

    State-of-the-art accuracy and speed for English transcriptions.
    Model
    ASR
    1.13K
    1y
    Microsoft
    Downloadable

    TRELLIS

    MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.
    Model
    text-to-3d
    3.65K
    9mo
    Baidu
    Downloadable

    paddleocr

    Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
    Model
    Optical Character Recognition
    201K
    11mo
    Meta
    DownloadableFree Endpoint

    llama-3.1-8b-instruct

    Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    25.09M
    11mo
    RTX Workstation
    13 MIN

    How to Build a Multi-GPU AI PC - A Practical Guide

    Many people explore local generative AI for privacy and to avoid token limits, but newer models require significant memory and compute—leading some to adopt multi-GPU setups.
    Playbook
    ComfyUI
    21d
    RTX Workstation
    8 MIN

    How to Fine-Tune an LLM on NVIDIA GPUs With Unsloth

    Fine-tune popular AI models faster in Unsloth with NVIDIA RTX AI PCs, RTX PRO workstations, and DGX Spark—plus explore the new Nemotron Nano 3 family of open models.
    Playbook
    Fine-Tuning
    21d
    RTX Workstation
    18 MIN

    NVIDIA Video Generation Guide

    Learn how to create videos using LTX-2 in ComfyUI, accelerated on RTX. Learn how to take control of visual generative AI, creating high resolution video on RTX.
    Playbook
    ComfyUI
    21d
    RTX Workstation
    30 MIN

    vLLM for Inference

    Install and use vLLM on NVIDIA RTX Pro 6000
    Playbook
    vLLM
    11d