Try NVIDIA NIM APIs

Explore Models Blueprints GPUs Docs

Manage My Privacy

Contact

Search Results

Searching for: Llama Nemotron

Sorting by Most Recent

meta llama-guard-4-12b

Multi-modal model to classify safety for input prompts as well output responses.

llm multimodal safety content safety guardrail content moderator meta

nvidia llama-3.2-nemoretriever-500m-rerank-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

nemo retriever retrieval augmented generation reranking nvidia

nvidia llama-3.2-nemoretriever-1b-vlm-embed-v1

Multimodal question-answer retrieval representing user queries as text and documents as images.

nemo retriever embedding retrieval augmented generation text-to-embedding nvidia

mistralai mistral-nemotron

Built for agentic workflows, this model excels in coding, instruction following, and function calling

language generation chat instruction following function calling mistralai

nvidia llama-3.1-nemotron-nano-vl-8b-v1

Multi-modal vision-language model that understands text/img and creates informative responses

doc intelligence multiple image understanding ocr nvidia

nvidia llama-3.1-nemotron-nano-4b-v1.1

State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents

edge tool calling reasoning math nvidia

nvidia llama-3.1-nemotron-ultra-253b-v1

Superior inference efficiency with highest accuracy for scientific and complex math reasoning, coding, tool calling, and instruction following.

chat math advanced reasoning instruction following function calling nvidia

meta llama-4-maverick-17b-128e-instruct

A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

language generation image-to-text vision assistant visual question answering meta

meta llama-4-scout-17b-16e-instruct

A multimodal, multilingual 16 MoE model with 17B parameters.

language generation image-to-text vision assistant visual question answering meta

nvidia Build an AI Agent for Enterprise Research

Build artificial general agents (AGA) powered by AGI models that continuously process and synthesize multimodal enterprise data, enabling reasoning, planning, and refinement to generate comprehensive reports.

nim launchable llama nemotron reasoning blueprint enterprise retrieval-augmented generation nvidia ai nemo retriever nvidia

nvidia llama-3.3-nemotron-super-49b-v1

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

math advanced reasoning instruction following function calling nvidia

nvidia llama-3.1-nemotron-nano-8b-v1

Leading reasoning and agentic AI accuracy model for PC and edge.

math advanced reasoning instruction following function calling nvidia

deepseek-ai deepseek-r1-distill-llama-8b

Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.

distillation coding chat reasoning run-on-rtx math deepseek-ai

nvidia llama-3.1-nemoguard-8b-topic-control

Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.

dialogue safety llm safety guard model content safety nvidia

nvidia llama-3.1-nemoguard-8b-content-safety

Leading content safety model for enhancing the safety and moderation capabilities of LLMs

llm safety content moderation guard model content safety nvidia

llamaindex Document Research Assistant for Blog Creation

Automate research, and generate blogs with AI Agents using LlamaIndex and Llama3.3-70B NIM LLM.

blog creation launchable ai agents blueprint partner llamaindex nvidia ai llamaindex

langchain Structured Report Generation

Generate detailed, structured reports on any topic using LangGraph and Llama3.3 70B NIM

langgraph report generation launchable ai agents blueprint partner nvidia ai langchain

crewai Code Documentation for Software Development

Document your github repositories with AI Agents using CrewAI and Llama3.3 70B NIM.

code documentation crewai launchable ai agents blueprint partner nvidia ai crewai

nvidia cosmos-nemotron-34b

Multi-modal vision-language model that understands text/img/video and creates informative responses

vlm vision language model image caption image to text nvidia

nvidia llama-3.2-nv-embedqa-1b-v2

Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.

nemo retriever run-on-rtx embedding retrieval augmented generation text-to-embedding nvidia

nvidia llama-3.2-nv-rerankqa-1b-v2

Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.

nemo retriever retrieval augmented generation reranking nvidia

meta llama-3.3-70b-instruct

Advanced LLM for reasoning, math, general knowledge, and function calling

reasoning code generation text-to-text instruction following math meta

nvidia nemotron-4-mini-hindi-4b-instruct

A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.

indic chat chat text-to-text language generation nvidia

nvidia llama-3.1-nemotron-70b-instruct

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA in order to improve the helpfulness of LLM generated responses.

chat code generation chat text-to-text language generation nvidia

nvidia Vulnerability Analysis for Container Security

Rapidly identify and mitigate container security vulnerabilities with generative AI.

generative ai launchable nv-embedqa-e5-v5 blueprint llama-3_1-70b-instruct cybersecurity nvidia ai nvidia

institute-of-science-tokyo llama-3.1-swallow-70b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

sovereign ai large language model chat regional language generation institute-of-science-tokyo

institute-of-science-tokyo llama-3.1-swallow-8b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

sovereign ai large language model chat chat regional language generation institute-of-science-tokyo

nvidia llama-3.1-nemotron-70b-reward

Leaderboard topping reward model supporting RLHF for better alignment with human preferences.

text-to-text reward model rlhf nvidia

meta llama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

chat code generation chat text-to-text language generation meta

meta llama-3.2-11b-vision-instruct

Cutting-edge vision-language model exceling in high-quality reasoning from images.

image-text retrieval visual qa image-to-text image captioning visual grounding meta

meta llama-3.2-90b-vision-instruct

Cutting-edge vision-Language model exceling in high-quality reasoning from images.

image-text retrieval visual qa image captioning image-to-text visual grounding meta

meta llama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

chat code generation text-to-text language generation meta

nvidia llama-3.1-nemotron-51b-instruct

Unique language model that delivers an unmatched accuracy-efficiency performance.

chat language generation chat text-to-text nvidia

abacusai dracarys-llama-3.1-70b-instruct

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

chat code generation text-to-text abacusai

yentinglin llama-3-taiwan-70b-instruct

Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.

regional language generation chat code generation large language models yentinglin

tokyotech-llm llama-3-swallow-70b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

large language model chat regional language generation tokyotech-llm

nvidia nemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

chat text-to-text language generation nvidia

meta llama-3.1-405b-instruct

Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.

synthetic data generation chat code generation meta

meta llama-3.1-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

code generation chat text-to-text language generation meta

meta llama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

code generation chat text-to-text language generation run-on-rtx meta

nvidia llama3-chatqa-1.5-70b

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

text-to-text chat non-commercial use only chat nvidia

nvidia llama3-chatqa-1.5-8b

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

text-to-text chat non-commercial use only nvidia

meta llama3-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

chat large language models code generation chat text-to-text language generation meta

meta llama3-8b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chat code generation chat text-to-text language generation large language models meta