Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Contact

Explore

⌘KCtrl+K

142 results for

Filters (5)

API Endpoint

Download Available

Launchable

Enterprise

Use Case

Code Generation

Image-to-Text

Drug Discovery

Text Translation

Synthetic Data Generation

Publisher

NVIDIA

baichuan2-13b-chat

Support Chinese and English chat, coding, math, instruction following, solving quizzes

Model

Chinese Language Generation

473K

9mo

BAAI

bge-m3

Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.

Model

Embeddings

2.07M

10mo

NVIDIA

Launchable

Biomedical AI-Q Research Agent Blueprint

Build advanced AI agents within the biomedical domain using the AI-Q Blueprint and the BioNeMo Virtual Screening Blueprint

Blueprint

Launchable

MediaTek

breeze-7b-instruct

LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.

Model

chat

475K

9mo

NVIDIA

Enterprise

Build A Generative Protein Binder Design Pipeline

This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.

Blueprint

NVIDIA BioNemo

NVIDIA

LaunchableEnterprise

Build a Video Search and Summarization (VSS) Agent

Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A

Blueprint

vision

NVIDIA

LaunchableEnterprise

Build an AI Agent for Enterprise Research

Build a custom enterprise research assistant powered by state-of-the-art models that process and synthesize multimodal data, enabling reasoning, planning, and refinement to generate comprehensive reports.

Blueprint

NIM

NVIDIA

Launchable

Build an AI Virtual Assistant

Create intelligent virtual assistants for customer service across every industry

Blueprint

Customer Service

NVIDIA

LaunchableEnterprise

Build an Enterprise RAG Pipeline Blueprint

Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.

Blueprint

NIM

University at Buffalo

cached

Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.

Model

nemo retriever

738

NVIDIA

canary-1b-asr

Multi-lingual model supporting speech-to-text recognition and translation.

Model

Automatic Speech Recognition

1.58K

11mo

THUDM

chatglm3-6b

Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.

Model

Text Translation

511K

7mo

NVIDIA

cosmos-nemotron-34b

Multi-modal vision-language model that understands text/img/video and creates informative responses

Model

VLM

NVIDIA

cosmos-reason1-7b

Reasoning vision language model (VLM) for physical AI and robotics.

Model

video understanding

15.93K

6mo

NVIDIA

cosmos-reason2-8b

Vision language model that excels in understanding the physical world using structured reasoning on videos or images.

Model

video understanding

194K

2mo

Cyborg

Launchable

Cyborg Enterprise RAG

Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.

Blueprint

NIM

DeepSeek AI

deepseek-r1-distill-llama-8b

Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.

Model

Distillation

4.14M

8mo

DeepSeek AI

deepseek-r1-distill-qwen-14b

Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.

Model

coding

2.07K3.78M

9mo

DeepSeek AI

deepseek-r1-distill-qwen-32b

Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.

Model

coding

2.44K4.17M

9mo

DeepSeek AI

deepseek-r1-distill-qwen-7b

Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.

Model

coding

2.21K4.11M

9mo

DeepSeek AI

deepseek-v3.1

DeepSeek V3.1 Instruct is a hybrid AI model with fast reasoning, 128K context, and strong tool use.

Model

Reasoning

14.26M

6mo

DeepSeek AI

deepseek-v3.2

State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.

Model

long context

14.82M

2mo

Mistral AI

devstral-2-123b-instruct-2512

State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.

Model

coding

5.09M

2mo

Abacus.AI

dracarys-llama-3.1-70b-instruct

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

Model

chat

534K

9mo

Items per page

of 6 pages