Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

15 results for

Filters

  • Free Endpoint
    2
  • Partner Endpoint
    1
  • Download Available
    1
  • Launchable
    5
  • Developer Example
    2
  • Enterprise Blueprint
    1
  • Bitdeer
    1
  • Deepinfra
    1
  • Eigen AI
    1
  • GMI Cloud
    1
  • Together AI
    1
  • NVIDIA
    12
  • Cyborg
    1
  • Qwen
    1
  • Viavi
    1
  • AI Engineer
    6
  • Developer
    6
  • Ml Engineer
    5
  • Application Developer
    2
  • Data Scientist
    2
  • NVIDIA AI
    4
  • AI And Machine Learning
    6
  • Accelerated Computing
    1
  • RAG
    3
  • MONAI
    2
  • TAO
    1
  • cuPyNumeric
    1
  • NVIDIA RAG Blueprint — deploy, configure, troubleshoot, and manage. Handles any RAG action: deploy, install, start, enable, disable, toggle, change, configure, troubleshoot, debug, fix, shutdown, stop, or tear down any RAG feature or service (Agentic RAG,
    Skill
    Developer
    622
    17d

    Performance benchmarking for a deployed NVIDIA RAG Blueprint server: profiling pass + aiperf load test driven by a single YAML config. Not for accuracy / RAGAS scoring (use rag-eval) or for deploying / repairing services (use rag-blueprint).
    Skill
    Developer
    462
    17d

    Filesystem RAG benchmarks: corpus/, train.json, evaluate_rag.py (RAGAS quality). Not for prod monitoring, latency/throughput benchmarking (use rag-perf), or evals outside this repo layout.
    Skill
    Developer
    476
    17d
    Media
    LaunchableDeveloper Example

    Streaming Data to RAG

    Sensor-captured radio enables real-time awareness, AI-driven analytics for actionable, searchable insights.
    Blueprint
    NVIDIA AI
    3mo
    Items per page
    of 1 pages
    Cyborg
    Deprecation in 30dLaunchable

    Cyborg Enterprise RAG

    Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.
    Blueprint
    NIM
    3mo
    DGX Spark
    30 MIN

    RAG Application in AI Workbench

    Install and use AI Workbench to clone and run a reproducible RAG application
    Playbook
    DGX
    8mo
    General
    LaunchableEnterprise

    Build an Enterprise RAG Pipeline Blueprint

    Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.
    Blueprint
    NVIDIA AI
    3mo
    Telecom
    Deprecation in 5dLaunchable

    Intent-Driven RAN Energy Efficiency Blueprint

    Build a closed-loop agentic workflow for energy optimization.
    Blueprint
    NVIDIA AI
    3mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    Chat
    1.53M
    1y
    Qwen
    DownloadableFree Endpoint

    qwen3.5-397b-a17b

    Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
    Model
    MoE
    13.15M
    4mo
    Telecom
    LaunchableDeveloper Example

    AI Agent for Telecom Network Configuration Planning

    Automate and optimize the configuration of radio access network (RAN) parameters using agentic AI and a large language model (LLM)-driven framework.
    Blueprint
    NVIDIA AI
    3mo

    Load a sharded, on-disk dataset (sharded .npy, Parquet/Arrow, raw binary, sharded HDF5, custom layouts) into a distributed cuPyNumeric ndarray via a manual partition + leaf @task launch with CPU/OMP/GPU variants. Use when no single-call loader fits, inclu
    Skill
    Developer
    431
    15d

    Used for extracting selected metadata from one DICOM file and flagging standard-tag PHI presence. Not for anonymization or clinical use.
    Skill
    Developer
    406
    15d

    Used for command-shape or live NV-Reason-CXR chest X-ray reasoning smoke tests. Not for diagnosis or clinical reporting.
    Skill
    Developer
    393
    15d

    Multi-step video annotation pipeline that turns raw videos into Chain-of-Thought training data — multi-level captions, structured descriptions, and QA pairs (MCQ, binary, open-ended) with reasoning traces, via VLM/LLM distillation. Use when the user wants
    Skill
    TAO
    178
    3d