Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Contact

Explore

⌘KCtrl+K

Search Results

Searching for: run-on-rtx

Sort By

Publisher

Use Case

NIM Type

Blueprint Type

GPU Types

Launchable

Sorting by Last Updated

microsoft TRELLIS

MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.

text-to-3d Run-on-RTX image-to-3d

black-forest-labs FLUX.1-Kontext-dev

FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.

Image Generation Text-to-Image Run-on-RTX

nvidia nv-embedqa-e5-v5

English text embedding model for question-answering retrieval.

Embedding run-on-rtx Retrieval Augmented Generation Nemo retriever Text-to-Embedding

nvidia nv-yolox-page-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection Data ingestion Chart Detection nemo retriever Table Detection run-on-rtx extraction

baidu paddleocr

Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.

Optical Character Recognition Table Extraction Optical Character Detection nemo retriever data ingestion run-on-rtx extraction

meta llama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

chat Code Generation Text-to-Text Language Generation Run-on-RTX

deepseek-ai deepseek-r1-distill-llama-8b

Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.

Distillation coding chat reasoning run-on-rtx math

nvidia nvclip

NV-CLIP is a multimodal embeddings model for image and text.

Computer vision multimodal embeddings text and image Run-on-rtx

nvidia parakeet-ctc-0.6b-asr

State-of-the-art accuracy and speed for English transcriptions.

ASR Streaming English Batch Speech-to-Text Fast NVIDIA NIM Run-on-RTX

nvidia studiovoice

Enhance speech by correcting common audio degradations to create studio quality speech output.

Nvidia Maxine Speech-to-speech Digital Human Run-on-RTX Speech Enhancement

black-forest-labs FLUX.1-dev

FLUX.1 is a state-of-the-art suite of image generation models

Image Generation Text-to-Image Run-on-RTX

black-forest-labs FLUX.1-schnell

FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds

Image Generation Text-to-Image Run-on-RTX

nvidia nemoretriever-page-elements-v2

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection Chart Detection nemo retriever Table Detection data ingestion run-on-rtx