NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters

  • Free Endpoint
    8
  • Partner Endpoint
    4
  • Download Available
    3
  • Retrieval Augmented Generation
    2
  • Image-to-Text
    2
  • Code Generation
    0
  • Drug Discovery
    0
  • Object Detection
    0
  • Fireworks AI
    3
  • Deep Infra
    3
  • Together AI
    1
  • Bitdeer AI
    1
  • GMI Cloud
    0
  • Mistral AI
    8
  • NVIDIA
    3
  • Meta
    0
  • Microsoft
    0
  • Google
    0
  • 11 models
    Mistral AI
    Downloadable

    mistral-small-4-119b-2603

    Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context
    chat
    938
    1d
    Mistral AI
    Free Endpoint

    mistral-large-3-675b-instruct-2512

    A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
    chat
    6.55M
    3mo
    Mistral AI
    Free Endpoint

    mistral-nemotron

    Built for agentic workflows, this model excels in coding, instruction following, and function calling
    chat
    801K
    9mo
    Mistral AI
    Free Endpoint

    mistral-small-3.1-24b-instruct-2503

    Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
    chat
    2.17M
    10mo
    Mistral AI
    Free Endpoint

    mistral-medium-3-instruct

    Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
    chat
    5.33M
    8mo
    NVIDIA
    Free Endpoint

    nv-embedcode-7b-v1

    The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.
    nemo retriever
    283K
    9mo
    Mistral AI
    Downloadable

    mistral-small-24b-instruct

    Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.
    chat
    614K
    8mo
    NVIDIA
    Free Endpoint

    mistral-nemo-minitron-8b-base

    State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
    language generation
    3.86K
    1y
    Mistral AI
    Downloadable

    mistral-7b-instruct-v0.3

    This LLM follows instructions, completes requests, and generates creative text.
    chat
    867K
    9mo
    NVIDIA
    Free Endpoint

    rerank-qa-mistral-4b

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    Ranking
    172K
    1y
    Mistral AI
    Free Endpoint

    mistral-7b-instruct-v0.2

    This LLM follows instructions, completes requests, and generates creative text.
    chat
    560K
    10mo
    Items per page
    of 1 pages