Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

15 results for

Filters

  • Free Endpoint
    5
  • Partner Endpoint
    5
  • Download Available
    6
  • Enterprise Blueprint
    1
  • Image Generation
    2
  • Text-to-Image
    2
  • Image-to-Text
    1
  • Synthetic Data Generation
    1
  • Deepinfra
    3
  • Bitdeer
    2
  • GMI Cloud
    2
  • Together AI
    2
  • Eigen AI
    1
  • NVIDIA
    8
  • Qwen
    3
  • Google
    1
  • Microsoft
    1
  • Mistral AI
    1
  • AI Engineer
    5
  • Data Scientist
    4
  • Ml Engineer
    4
  • Application Developer
    3
  • Developer
    3
  • NVIDIA Isaac GR00T
    1
  • NVIDIA Omniverse
    1
  • AI And Machine Learning
    5
  • B200
    1
  • GB200
    1
  • TAO Toolkit
    4
  • NeMo Retriever
    1
  • Qwen
    Downloadable

    qwen-image

    Qwen-Image is a text-to-image foundation model with advanced multilingual text rendering.
    Model
    Text-to-Image
    1mo

    Runs the DEFT embed-then-mine workflow for VCN AOI iterations — embeds the gap-analysis target parquet, embeds a source pool, and mines nearest-neighbour source images for downstream augmentation. Use as the immediate next step after `tao-route-visual-cha
    Skill
    Developer
    263
    4d
    Items per page
    of 1 pages
    Qwen
    DownloadableFree Endpoint

    qwen3.5-122b-a10b

    122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.
    Model
    tool calling
    10.33M
    3mo
    DGX Spark
    1 HR

    FLUX.1 Dreambooth LoRA Fine-tuning

    Fine-tune FLUX.1-dev 12B model using Dreambooth LoRA for custom image generation
    Playbook
    Image Generation
    8mo
    NVIDIA
    Free Endpoint

    cosmos3-nano

    Generates physics-aware videos from text prompts or an image prompt for physical AI development.
    Model
    autonomous vehicles
    1.79K
    16d
    Microsoft
    Downloadable

    TRELLIS

    MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.
    Model
    text-to-3d
    3.65K
    9mo
    Robotics
    Enterprise

    Synthetic Manipulation Motion Generation for Robotics

    Generate exponentially large amounts of synthetic motion trajectories for robot manipulation from just a few human demonstrations.
    Blueprint
    synthetic data
    3mo
    Mistral AI
    DownloadableFree Endpoint

    mistral-small-4-119b-2603

    Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context
    Model
    code generation
    12.52M
    3mo
    Qwen
    DownloadableFree Endpoint

    qwen3.5-397b-a17b

    Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
    Model
    MoE
    13.15M
    4mo
    Google
    Free Endpoint

    paligemma

    Vision language model adept at comprehending text and visual inputs to produce informative responses
    Model
    image
    10.22K
    1y

    CLIP vision-language model for image-text retrieval, zero-shot classification, embedding extraction, ONNX export, and TensorRT deployment. Use when fine-tuning or training CLIP, running zero-shot classification, computing image embeddings, or deploying CL
    Skill
    AI Engineer
    263
    4d

    OCRNet for scene text recognition. Recognizes text content from cropped text-region images and supports CTC and attention-based decoders. Use when training, evaluating, exporting, pruning, quantizing, retraining, or running inference for a TAO OCRNet mode
    Skill
    Developer
    264
    4d

    OCDNet for scene text detection. Detects arbitrary-oriented text regions in natural images using a differentiable binarization approach. Use when training, evaluating, exporting, pruning, quantizing, retraining, or running inference for a TAO OCDNet model
    Skill
    AI Engineer
    260
    4d
    Stability AI
    Downloadable

    stable-diffusion-3.5-large

    Stable Diffusion 3.5 is a popular text-to-image generation model
    Model
    Text-to-Image
    10mo

    Use when the user wants to search, query, extract, transcribe, describe, quote, filter, or aggregate across documents — PDFs, scanned forms / images (`.jpg` `.png` `.tiff`), Office (`.docx` `.pptx`), text (`.html` `.txt`), audio (`.mp3` `.wav` `.m4a`), or
    Skill
    Developer
    501
    16d