---
title: "mistral-7b-instruct-v0.2"
publisher: "mistralai"
type: "endpoint"
updated: "2025-05-22T00:17:47.031Z"
description: "This LLM follows instructions, completes requests, and generates creative text."
canonical: "https://build.nvidia.com/mistralai/mistral-7b-instruct-v2"
---

# Model Overview

## Description:

Mistral-7B-Instruct is a language model that can follow instructions, complete requests, and generate creative text formats.
It is an instruct version of the Mistral-7B-v0.2 generative text model fine-tuned using a variety of publicly available conversation datasets.

## Third-Party Community Consideration:

This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see [Mistral's 7B Instruct Hugging Face Model Card](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2).

## Terms of use

By using this software or model, you are agreeing to the [terms and conditions](https://mistral.ai/terms-of-service/) of the license, acceptable use policy and Mistral's privacy policy. Mistral-7B is released under the Apache 2.0 license

## References(s):

Mistral 7B Instruct [Model Card](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on Hugging Face <br>
Mistral 7B [paper](https://arxiv.org/abs/2310.06825) <br>
Mistral 7B [blogpost](https://mistral.ai/news/announcing-mistral-7b/) <br>

## Model Architecture:

**Architecture Type:** Transformer <br>
**Network Architecture:** Mistral-7B <br>
**Model Version:** 0.1 <br>

## Input:
**Input Format:** Text <br>
**Input Parameters:** Max Tokens, Temperature, Top P <br>

## Output:
**Output Format:** Text <br>
**Output Parameters:** None <br>

## Software Integration:

**Supported Hardware Platform(s):** Hopper, Ampere, Turing <br>
**Supported Operating System(s):** Linux <br>

# Inference:

**Engine:** [Triton](https://developer.nvidia.com/triton-inference-server) <br>
**Test Hardware:** Other <br>

## Prototype

```python
from openai import OpenAI

client = OpenAI(
base_url = "https://integrate.api.nvidia.com/v1",
api_key = "$NVIDIA_API_KEY"
)

completion = client.chat.completions.create(
model="",
messages=[{"role":"user","content":""}],
temperature=,
top_p=,
max_tokens=,
stream=NaN
)

print(completion.choices[0].message)
```

```python
from langchain_nvidia_ai_endpoints import ChatNVIDIA

client = ChatNVIDIA(
model="",
api_key="$NVIDIA_API_KEY", 
temperature=,
top_p=,
max_tokens=,
)

response = client.invoke([{"role":"user","content":""}])
print(response.content)
```

```javascript
import OpenAI from 'openai';

const openai = new OpenAI({
apiKey: '$NVIDIA_API_KEY',
baseURL: 'https://integrate.api.nvidia.com/v1',
})

async function main() {
const completion = await openai.chat.completions.create({
model: "",
messages: [{"role":"user","content":""}],
temperature: ,
top_p: ,
max_tokens: ,
stream: ,
})

process.stdout.write(completion.choices[0]?.message?.content);

}

main();
```

```bash
curl https://integrate.api.nvidia.com/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $NVIDIA_API_KEY" \
-d '{
"model": "mistralai/mistral-7b-instruct-v0.2",
"messages": [{"role":"user","content":""}],
"temperature": ,   
"top_p": ,
"max_tokens": ,
"stream":                 
}'
```