Search Results
Searching for: Guard Model
mistralaimagistral-small-2506
High performance reasoning model optimized for efficiency and edge deployment

nvidiariva-translate-4b-instruct
Translation model in 12 languages with few-shots example prompts capability.

metallama-guard-4-12b
Multi-modal model to classify safety for input prompts as well output responses.

googlegemma-3n-e4b-it
An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

googlegemma-3n-e2b-it
An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

nvidiallama-3.2-nemoretriever-500m-rerank-v2
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

mistralaimistral-nemotron
Built for agentic workflows, this model excels in coding, instruction following, and function calling

nvidiaRefine AI Agents through Continuous Model Distillation with Data Flywheels
Build a data flywheel, with NVIDIA NeMo microservices, that continuously optimizes AI agents for latency and cost — while maintaining accuracy targets.

nvidiaSafety for Agentic AI
Improve safety, security, and privacy of AI systems at build, deploy and run stages.

nvidiaAI Agent for Telecom Network Configuration Planning
Automate and optimize the configuration of radio access network (RAN) parameters using agentic AI and a large language model (LLM)-driven framework.

nvidiallama-3.1-nemotron-nano-vl-8b-v1
Multi-modal vision-language model that understands text/img and creates informative responses

speakleashbielik-11b-v2.3-instruct
State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.

nvidiallama-3.1-nemotron-nano-4b-v1.1
State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents

marinmarin-8b-instruct
State-of-the-art open model trained on open datasets, excelling in reasoning, math, and science.

ibmgranite-3.3-8b-instruct
Small language model fine-tuned for improved reasoning, coding, and instruction-following

qwenqwen3-235b-a22b
Advanced reasoing MOE mode excelling at reasoning, multilingual tasks, and instruction following

black-forest-labsFLUX.1-schnell
FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds

utter-projecteurollm-9b-instruct
State-of-the-art, multilingual model tailored to all 24 official European Union languages.

mistralaimistral-small-3.1-24b-instruct-2503
Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses

mistralaimistral-medium-3-instruct
Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.

black-forest-labsFLUX.1-dev
FLUX.1 is a state-of-the-art suite of image generation models

metallama-4-maverick-17b-128e-instruct
A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

metallama-4-scout-17b-16e-instruct
A multimodal, multilingual 16 MoE model with 17B parameters.

nvidiaBuild an AI Agent for Enterprise Research
Build artificial general agents (AGA) powered by AGI models that continuously process and synthesize multimodal enterprise data, enabling reasoning, planning, and refinement to generate comprehensive reports.

nvidiacosmos-predict1-7b
Generalist model to generate future world state as videos from text and image prompts to create synthetic training data for robots and autonomous vehicles.

nvidiallama-3.3-nemotron-super-49b-v1
High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

nvidiallama-3.1-nemotron-nano-8b-v1
Leading reasoning and agentic AI accuracy model for PC and edge.

nvidianv-embedcode-7b-v1
The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

nvidianemoretriever-table-structure-v1
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

nvidianemoretriever-graphic-elements-v1
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

nvidianemoretriever-page-elements-v2
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

googlegemma-3-27b-it
Cutting-edge open multimodal model exceling in high-quality reasoning from images.

googlegemma-3-1b-it
A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

nvidianemoretriever-parse
Cutting-edge vision-language model exceling in retrieving text and metadata from images.

nvidiaLLM Router
Route LLM requests to the best model for the task at hand.

microsoftphi-4-multimodal-instruct
Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

nvidiacanary-1b-asr
Multi-lingual model supporting speech-to-text recognition and translation.

nvidiacanary-0.6b-turbo-asr
Multi-lingual model supporting speech-to-text recognition and translation.

mistralaimistral-small-24b-instruct
Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.

nvidiaBuild an Enterprise RAG pipeline
Continuously extract, embed, and index multimodal data for fast, accurate semantic search. Built on world-class NeMo Retriever models, the RAG blueprint connects AI applications to multimodal enterprise data wherever it resides.

nvidiallama-3.1-nemoguard-8b-topic-control
Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.

nvidianemoguard-jailbreak-detect
Industry leading jailbreak classification model for protection from adversarial attempts

nvidiallama-3.1-nemoguard-8b-content-safety
Leading content safety model for enhancing the safety and moderation capabilities of LLMs

qwenqwen2.5-7b-instruct
Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

nvidiaPDF to Podcast
Transform PDFs into AI podcasts for engaging on-the-go audio content.

nvidiacosmos-nemotron-34b
Multi-modal vision-language model that understands text/img/video and creates informative responses

qwenqwen2.5-coder-7b-instruct
Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.

nvidiallama-3.2-nv-rerankqa-1b-v2
Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.

nvidianv-yolox-page-elements-v1
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

nvidiaconformer-ctc-asr
Automatic speech recognition model that transcribes speech in lower case English with record-setting accuracy and performance

hivedeepfake-image-detection
Advanced AI model detects faces and identifies deep fake images.

ibmgranite-guardian-3.0-8b
Detects jailbreaking, bias, violence, profanity, sexual content, and unethical behavior

nvidiallama-3.1-nemotron-70b-instruct
Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA in order to improve the helpfulness of LLM generated responses.

zyphrazamba2-7b-instruct
Efficient hybrid state-space model designed for conversational and reasoning tasks.

institute-of-science-tokyollama-3.1-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.

institute-of-science-tokyollama-3.1-swallow-8b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.

nvidiamistral-nemo-minitron-8b-8k-instruct
State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.

nvidiallama-3.1-nemotron-70b-reward
Leaderboard topping reward model supporting RLHF for better alignment with human preferences.

metallama-3.2-3b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

metallama-3.2-11b-vision-instruct
Cutting-edge vision-language model exceling in high-quality reasoning from images.

metallama-3.2-90b-vision-instruct
Cutting-edge vision-Language model exceling in high-quality reasoning from images.

metallama-3.2-1b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

nvidiallama-3.1-nemotron-51b-instruct
Unique language model that delivers an unmatched accuracy-efficiency performance.

qwenqwen2-7b-instruct
Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

abacusaidracarys-llama-3.1-70b-instruct
Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

hiveai-generated-image-detection
Robust image classification model for detecting and managing AI-generated content.

yentinglinllama-3-taiwan-70b-instruct
Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.

tokyotech-llmllama-3-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.

microsoftphi-3.5-vision-instruct
Cutting-edge open multimodal model exceling in high-quality reasoning from images.

nvidiamistral-nemo-minitron-8b-base
State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.

microsoftphi-3.5-mini-instruct
Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

rakutenrakutenai-7b-instruct
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

rakutenrakutenai-7b-chat
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

nvidianv-grounding-dino
Grounding dino is an open vocabulary zero-shot object detection model.

ipdproteinmpnn
ProteinMPNN is a deep learning model for predicting amino acid sequences for protein backbones.

microsoftflorence-2
Vision foundation model capable of performing diverse computer vision and vision language tasks.

googleshieldgemma-9b
Guardrail model to ensure that responses from LLMs are appropriate and safe

googlegemma-2-2b-it
Advanced small language generative AI model for edge applications

mistralaimamba-codestral-7b-v0.1
Model for writing and interacting with code across a wide range of programming languages and tasks.

metallama-3.1-8b-instruct
Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

nv-mistralaimistral-nemo-12b-instruct
Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.

nvidianv-rerankqa-mistral-4b-v3
Multilingual text reranking model.

nvidianv-embedqa-e5-v5
English text embedding model for question-answering retrieval.


microsoftphi-3-medium-128k-instruct
Cutting-edge lightweight open language model exceling in high-quality reasoning.

bigcodestarcoder2-7b
Advanced programming model for code completion, summarization, and generation

bigcodestarcoder2-15b
Advanced programming model for code completion, summarization, and generation

googlegemma-2-27b-it
Cutting-edge text generation model text understanding, transformation, and code generation.

googlegemma-2-9b-it
Cutting-edge text generation model text understanding, transformation, and code generation.


stabilityaistable-diffusion-3-medium
Advanced text-to-image model for generating high quality images

upstagesolar-10.7b-instruct
Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.

googlecodegemma-1.1-7b
Advanced programming model for code generation, completion, reasoning, and instruction following.

ipdrfdiffusion
A generative model of protein backbones for protein binder design.

microsoftphi-3-small-8k-instruct
Cutting-edge lightweight open language model exceling in high-quality reasoning.

microsoftphi-3-small-128k-instruct
Long context cutting-edge lightweight open language model exceling in high-quality reasoning.

microsoftphi-3-medium-4k-instruct
Cutting-edge lightweight open language model exceling in high-quality reasoning.

microsoftphi-3-vision-128k-instruct
Cutting-edge open multimodal model exceling in high-quality reasoning from images.

aisingaporesea-lion-7b-instruct
LLM to represent and serve the linguistic and cultural diversity of Southeast Asia

microsoftphi-3-mini-4k-instruct
Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

databricksdbrx-instruct
A general-purpose LLM with state-of-the-art performance in language understanding, coding, and RAG.

snowflakearctic-embed-l
Optimized community model for text embedding.

microsoftphi-3-mini-128k-instruct
Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

mistralaimixtral-8x22b-instruct-v0.1
An MOE LLM that follows instructions, completes requests, and generates creative text.

metallama3-70b-instruct
Powers complex conversations with superior contextual understanding, reasoning and text generation.

metallama3-8b-instruct
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

googlerecurrentgemma-2b
Novel recurrent architecture based language model for faster inference when generating long sequences.

googlecodegemma-7b
Cutting-edge model built on Google's Gemma-7B specialized for code generation and code completion.

nvidiarerank-qa-mistral-4b
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

nvidiadeepvariant
Run Google's DeepVariant optimized for GPU. Switch models for high accuracy on all major sequencers.

stabilityaistable-video-diffusion
Stable Video Diffusion (SVD) is a generative diffusion model that leverages a single image as a conditioning frame to synthesize video sequences.

stabilityaisdxl-turbo
A fast generative text-to-image model that can synthesize photorealistic images from a text prompt in a single network evaluation

mistralaimixtral-8x7b-instruct-v0.1
An MOE LLM that follows instructions, completes requests, and generates creative text.