Try NVIDIA NIM APIs

nvidia riva-translate-4b-instruct

Translation model in 12 languages with few-shots example prompts capability.

text translation chat nvidia

nvidia riva-translate-1_6b

Enable smooth global interactions in 36 languages.

text translation neural machine translation nvidia nim nvidia

google gemma-3n-e4b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generation speech recognition visual qa chat google

google gemma-3n-e2b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generation speech recognition visual qa chat google

mistralai mistral-nemotron

Built for agentic workflows, this model excels in coding, instruction following, and function calling

language generation chat instruction following function calling mistralai

nvidia llama-3.1-nemotron-nano-vl-8b-v1

Multi-modal vision-language model that understands text/img and creates informative responses

doc intelligence multiple image understanding ocr nvidia

speakleash bielik-11b-v2.3-instruct

State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.

polish sovereign ai chat chatbots summarization speakleash

ibm granite-3.3-8b-instruct

Small language model fine-tuned for improved reasoning, coding, and instruction-following

coding reasoning instruction following ibm

black-forest-labs FLUX.1-schnell

FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds

image generation text-to-image run-on-rtx black-forest-labs

utter-project eurollm-9b-instruct

State-of-the-art, multilingual model tailored to all 24 official European Union languages.

sovereign ai chat chat text-to-text multilingual european regional language generation utter-project

gotocompany gemma-2-9b-cpt-sahabatai-instruct

SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.

sovereign ai chat indonesian chat text-to-text regional language generation gotocompany

mistralai mistral-small-3.1-24b-instruct-2503

Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses

language generation multimodal image understanding mistralai

mistralai mistral-medium-3-instruct

Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.

language generation image-to-text multimodal visual question answering mistralai

nvidia parakeet-1.1b-rnnt-multilingual-asr

High accuracy and optimized performance for transcription in 25 languages

asr streaming speech-to-text multilingual nvidia nim nvidia

black-forest-labs FLUX.1-dev

FLUX.1 is a state-of-the-art suite of image generation models

image generation text-to-image run-on-rtx black-forest-labs

meta llama-4-maverick-17b-128e-instruct

A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

language generation image-to-text vision assistant visual question answering meta

meta llama-4-scout-17b-16e-instruct

A multimodal, multilingual 16 MoE model with 17B parameters.

language generation image-to-text vision assistant visual question answering meta

nvidia magpie-tts-multilingual

Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.

tts text-to-speech nvidia nim nvidia riva multilingual nvidia

google gemma-3-27b-it

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

vision assistant visual question answering language generation image-to-text google

google gemma-3-1b-it

A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

translation chat chat text-to-text language generation google

nvidia nemoretriever-parse

Cutting-edge vision-language model exceling in retrieving text and metadata from images.

optical character recognition nemo retriever data ingestion table extraction supported language - english nvidia

microsoft phi-4-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

code generation chat text-to-text language generation microsoft

microsoft phi-4-multimodal-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

speech recognition visual qa language generation image-to-text chart and table understanding microsoft

arc evo2-40b

Evo 2 is a biological foundation model that is able to integrate information over long genomic sequences while retaining sensitivity to single-nucleotide changes.

dna generation biology nim bionemo drug discovery arc

mistralai mistral-small-24b-instruct

Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.

code chat reasoning agent-centric multilingual mistralai

tiiuae falcon3-7b-instruct

Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities

coding chat code generation language generation improved reasoning math scientific knowledge tiiuae

igenius italia_10b_instruct_16k

Multilingual LLM with emphasis on European languages supporting regulated use cases including financial services, government, heavy industry

heavy industry government chat highly regulated use case support financial services igenius

qwen qwen2.5-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

chinese language generation chat text-to-text large language models qwen

nvidia genmol

Fragment-Based Molecular Generation by Discrete Diffusion.

chemistry nim bionemo molecule generation drug discovery nvidia

nvidia cosmos-nemotron-34b

Multi-modal vision-language model that understands text/img/video and creates informative responses

vlm vision language model image caption image to text nvidia

qwen qwen2.5-coder-32b-instruct

Advanced LLM for code generation, reasoning, and fixing across popular programming languages.

code completion code generation chat text-to-code qwen

qwen qwen2.5-coder-7b-instruct

Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.

code completion code generation chat text-to-code qwen

writer palmyra-creative-122b

Powerful LLM designed for creative thinking and writing.

content generation chat chat text-to-text writer

nvidia nemotron-4-mini-hindi-4b-instruct

A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.

indic chat chat text-to-text language generation nvidia

nvidia llama-3.1-nemotron-70b-instruct

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA in order to improve the helpfulness of LLM generated responses.

chat code generation chat text-to-text language generation nvidia

zyphra zamba2-7b-instruct

Efficient hybrid state-space model designed for conversational and reasoning tasks.

chat chat language generation text-to-text zyphra

institute-of-science-tokyo llama-3.1-swallow-70b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

sovereign ai large language model chat regional language generation institute-of-science-tokyo

institute-of-science-tokyo llama-3.1-swallow-8b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

sovereign ai large language model chat chat regional language generation institute-of-science-tokyo

nvidia mistral-nemo-minitron-8b-8k-instruct

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.

small language model chat code generation chat text-to-text language generation nvidia

meta llama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

chat code generation chat text-to-text language generation meta

meta llama-3.2-11b-vision-instruct

Cutting-edge vision-language model exceling in high-quality reasoning from images.

image-text retrieval visual qa image-to-text image captioning visual grounding meta

meta llama-3.2-90b-vision-instruct

Cutting-edge vision-Language model exceling in high-quality reasoning from images.

image-text retrieval visual qa image captioning image-to-text visual grounding meta

meta llama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

chat code generation text-to-text language generation meta

nvidia llama-3.1-nemotron-51b-instruct

Unique language model that delivers an unmatched accuracy-efficiency performance.

chat language generation chat text-to-text nvidia

qwen qwen2-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

chinese language generation chat chat text-to-text large language models qwen

abacusai dracarys-llama-3.1-70b-instruct

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

chat code generation text-to-text abacusai

nvidia vila

Multi-modal vision-language model that understands text/img/video and creates informative responses

vlm vision language model image caption image to text nvidia

yentinglin llama-3-taiwan-70b-instruct

Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.

regional language generation chat code generation large language models yentinglin

tokyotech-llm llama-3-swallow-70b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

large language model chat regional language generation tokyotech-llm

microsoft phi-3.5-vision-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

vision assistant visual question answering language generation image-to-text microsoft

ai21labs jamba-1.5-mini-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

chat chat language generation text-to-text ai21labs

ai21labs jamba-1.5-large-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

chat chat language generation text-to-text ai21labs

nvidia nemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

chat text-to-text language generation nvidia

nvidia mistral-nemo-minitron-8b-base

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.

language generation text-to-text chat small language model nvidia

microsoft phi-3.5-moe-instruct

Advanced LLM based on Mixture of Experts architecure to deliver compute efficient content generation

moe chat code generation chat text-to-text language generation microsoft

microsoft phi-3.5-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

code generation chat text-to-text language generation large language models microsoft

rakuten rakutenai-7b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chat chat text-to-text language generation large language models rakuten

rakuten rakutenai-7b-chat

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chat chat text-to-text language generation large language models rakuten

nvidia megatron-1b-nmt

Enable smooth global interactions in 36 languages.

text translation neural machine translation nvidia nim nvidia

ipd proteinmpnn

ProteinMPNN is a deep learning model for predicting amino acid sequences for protein backbones.

biology nim bionemo drug discovery protein generation ipd

microsoft florence-2

Vision foundation model capable of performing diverse computer vision and vision language tasks.

image classification image object detection cv multimodal vision assistant vlm visual question answering computer vision language generation image-to-text text-to-image microsoft

google gemma-2-2b-it

Advanced small language generative AI model for edge applications

chat code generation chat text-to-text language generation google

thudm chatglm3-6b

Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.

text translation chat code generation chat text-to-text regional language generation thudm

mistralai mamba-codestral-7b-v0.1

Model for writing and interacting with code across a wide range of programming languages and tasks.

code completion code generation chat code generation mistralai

baichuan-inc baichuan2-13b-chat

Support Chinese and English chat, coding, math, instruction following, solving quizzes

chinese language generation text translation chat chat text-to-text baichuan-inc

meta llama-3.1-405b-instruct

Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.

synthetic data generation chat code generation meta

meta llama-3.1-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

code generation chat text-to-text language generation meta

meta llama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

code generation chat text-to-text language generation run-on-rtx meta

nv-mistralai mistral-nemo-12b-instruct

Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.

code generation chat language generation text-to-text run-on-rtx nv-mistralai

nvidia nv-embedqa-mistral-7b-v2

Multilingual text question-answering retrieval, transforming textual information into dense vector representations.

nemo retriever embedding retrieval augmented generation nvidia

nvidia maisi

MAISI is a pre-trained volumetric (3D) CT Latent Diffusion Generative Model.

image generation medical imaging nvidia nim nvidia

microsoft phi-3-medium-128k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

chat code generation chat text-to-text language generation large language models microsoft

bigcode starcoder2-7b

Advanced programming model for code completion, summarization, and generation

code completion code generation code generation bigcode

bigcode starcoder2-15b

Advanced programming model for code completion, summarization, and generation

code completion code generation code generation bigcode

google gemma-2-27b-it

Cutting-edge text generation model text understanding, transformation, and code generation.

chat code generation chat text-to-text language generation google

google gemma-2-9b-it

Cutting-edge text generation model text understanding, transformation, and code generation.

chat code generation text-to-text language generation google

01-ai yi-large

Powerful model trained on English and Chinese for diverse tasks including chatbot and creative writing.

chat code generation chat text-to-text multilingual 01-ai

mistralai mistral-7b-instruct-v0.3

This LLM follows instructions, completes requests, and generates creative text.

chat text-to-text language generation mistralai

upstage solar-10.7b-instruct

Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.

non-commercial use only chat text-to-text language generation large language models upstage

baai bge-m3

Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.

embeddings retrieval augmented generation text-to-embedding baai

mediatek breeze-7b-instruct

LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.

chat chat text-to-text regional language generation mediatek

google codegemma-1.1-7b

Advanced programming model for code generation, completion, reasoning, and instruction following.

chat code generation code completion google

ipd rfdiffusion

A generative model of protein backbones for protein binder design.

biology nim bionemo drug discovery protein generation ipd

microsoft phi-3-small-8k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

chat code generation chat text-to-text language generation large language models microsoft

microsoft phi-3-small-128k-instruct

Long context cutting-edge lightweight open language model exceling in high-quality reasoning.

chat code generation chat text-to-text language generation large language models microsoft

microsoft phi-3-medium-4k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

chat code generation chat text-to-text language generation large language models microsoft

microsoft phi-3-vision-128k-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

image cv vision assistant vlm visual question answering computer vision language generation image-to-text video microsoft

google paligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

image cv vision assistant vlm visual question answering computer vision language generation image-to-text video google

aisingapore sea-lion-7b-instruct

LLM to represent and serve the linguistic and cultural diversity of Southeast Asia

chat text-to-text regional language generation large language models aisingapore

microsoft phi-3-mini-4k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

chat code generation chat text-to-text language generation large language models microsoft

databricks dbrx-instruct

A general-purpose LLM with state-of-the-art performance in language understanding, coding, and RAG.

chat chat text-to-text language generation large language models databricks

microsoft phi-3-mini-128k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

chat code generation chat text-to-text language generation large language models microsoft

mistralai mixtral-8x22b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoning chat code generation chat text-to-text large language models mistralai

meta llama3-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

chat large language models code generation chat text-to-text language generation meta

meta llama3-8b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chat code generation chat text-to-text language generation large language models meta

google recurrentgemma-2b

Novel recurrent architecture based language model for faster inference when generating long sequences.

chat code generation chat text-to-text language generation google

google codegemma-7b

Cutting-edge model built on Google's Gemma-7B specialized for code generation and code completion.

chat code generation chat language generation text-to-code google

google gemma-2b

Lightweight language model deployable on laptop, desktop or the cloud for summarization and reasoning.

chat code generation chat text-to-text language generation google

nvidia embed-qa-4

GPU-accelerated generation of text embeddings used for question-answering retrieval.

embeddings retrieval augmented generation text-to-embedding nvidia

nvidia rerank-qa-mistral-4b

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

ranking retrieval augmented generation nvidia

google deplot

Translate images of plots into tables with one-shot visual language understanding.

nemo retriever multimodal data ingestion image-to-text google

nvidia neva-22b

Multi-modal vision-language model that understands text/images and generates informative responses

image cv vision assistant non-commercial use only vlm visual question answering computer vision image-to-text video nvidia

adept fuyu-8b

Multi-modal model for a wide range of tasks, including image understanding and language generation.

image cv multimodal vlm computer vision image understanding language generation image-to-text video adept

google gemma-7b

Cutting-edge text generation model text understanding, transformation, and code generation.

chat code generation chat text-to-text language generation google

mistralai mistral-7b-instruct-v0.2

This LLM follows instructions, completes requests, and generates creative text.

chat text-to-text language generation nvidia nim mistralai

nvidia molmim

MolMIM performs controlled generation, finding molecules with the right properties.

chemistry nim bionemo molecule generation drug discovery nvidia

mistralai mixtral-8x7b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoning chat code generation chat text-to-text large language models mistralai