Try NVIDIA NIM APIs

⌘KCtrl+K

98 results for

Sort By

DGX Spark

30 MIN

Text to Knowledge Graph

Transform unstructured text into interactive knowledge graphs with LLM inference and graph visualization

Playbook

GraphRAG

4mo

Qwen

qwen3-next-80b-a3b-instruct

Qwen3-Next Instruct blends hybrid attention, sparse MoE, and stability boosts for ultra-long context AI.

Model

chat

11.15M

5mo

Microsoft

phi-4-mini-flash-reasoning

Lightweight reasoning model for applications in latency bound, memory/compute constrained environments

Model

edge

468K

7mo

ByteDance

seed-oss-36b-instruct

ByteDance open-source LLM with long-context, reasoning, and agentic intelligence.

Model

thinking budget

3.46M

6mo

IBM

granite-guardian-3.0-8b

Detects jailbreaking, bias, violence, profanity, sexual content, and unethical behavior

Model

Guardrail

442K

Qwen

qwen3-next-80b-a3b-thinking

80B parameter AI model with hybrid reasoning, MoE architecture, support for 119 languages.

Model

Reasoning

3.89M

5mo

Google

shieldgemma-9b

Guardrail model to ensure that responses from LLMs are appropriate and safe

Model

Guardrail

553K

Google

gemma-2-27b-it

Cutting-edge text generation model text understanding, transformation, and code generation.

Model

chat

652K

9mo

Google

gemma-2-9b-it

Cutting-edge text generation model text understanding, transformation, and code generation.

Model

chat

4.35M

9mo

Google

gemma-7b

Cutting-edge text generation model text understanding, transformation, and code generation.

Model

chat

502K

9mo

llama-3.1-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

Model

chat

7.2M

9mo

llama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

Model

chat

15.61K392K

9mo

llama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

Model

chat

13.04K640K

9mo

Minimaxai

minimax-m2.5

MiniMax M2.5 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.

Model

coding

Mistral AI

mistral-7b-instruct-v0.2

This LLM follows instructions, completes requests, and generates creative text.

Model

chat

516K

9mo

Mistral AI

mistral-7b-instruct-v0.3

This LLM follows instructions, completes requests, and generates creative text.

Model

chat

819K

9mo

Google

gemma-3-1b-it

A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

Model

Translation

4.57K512K

9mo

OpenAI

gpt-oss-120b

Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.

Model

text-to-text

36.1M

7mo

OpenAI

gpt-oss-20b

Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math

Model

text-to-text

7.97M

7mo

Qwen

qwen3.5-122b-a10b

122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.

Model

tool calling

878K

MediaTek

breeze-7b-instruct

LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.

Model

chat

526K

9mo

DeepSeek AI

deepseek-v3.1

DeepSeek V3.1 Instruct is a hybrid AI model with fast reasoning, 128K context, and strong tool use.

Model

Reasoning

14.58M

6mo

DeepSeek AI

deepseek-v3.2

State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.

Model

long context

15.64M

2mo

Abacus.AI

dracarys-llama-3.1-70b-instruct

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

Model

chat

586K

9mo

Items per page

of 5 pages