NVIDIA
Explore
Models
Blueprints
GPUs
Docs

Deploy Models Now with NVIDIA NIM

Optimized inference for the world’s leading models
Free serverless APIs for developmentAccelerated by DGX Cloud
Self-Host on your GPU infrastructure
Continuous vulnerability fixes
ExploreModelsBlueprintsGPUsDocsForums

workstations

  • Run on RTX
  • Run on Spark

models

  • Reasoning
  • Vision
  • Visual Design
  • Retrieval
  • Speech
  • Biology
  • Simulation
  • Climate & Weather
  • Safety & Moderation

industries

  • Automotive
  • Gaming
  • Healthcare
  • Industrial
  • Robotics

Gaming

Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2025 NVIDIA Corporation

Digital Humans

Build dynamic game characters capable of natural language interactions

Run Anywhere

nvidiaparakeet-ctc-1.1b-asr

Record-setting accuracy and performance for English transcription.

Preview

nvidianemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

The Latest Language Models

Build natural language understanding into development workflows

Run Anywhere

metallama-3.1-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

Run Anywhere

mistralaimixtral-8x22b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

Preview

googlegemma-2-27b-it

Cutting-edge text generation model text understanding, transformation, and code generation.