---
title: "deepseek-coder-6.7b-instruct"
publisher: "deepseek-ai"
type: "endpoint"
updated: "2025-01-30T18:17:27.044Z"
description: "Powerful coding model offering advanced capabilities in code generation, completion, and infilling"
canonical: "https://build.nvidia.com/deepseek-ai/deepseek-coder-6_7b-instruct"
---

# Model Overview

## Description:

DeepSeek Coder is a series of code language models trained from scratch on 2T tokens, comprising 87% code and 13% natural language in English and Chinese. These models are available in sizes ranging from 1B to 33B parameters and are designed to support project-level code completion and infilling. The 6.7B parameter model, deepSeek-coder-6.7b-instruct, is fine-tuned on 2B tokens of instruction data and offers state-of-the-art performance on multiple programming languages and benchmarks.

## Third-Party Community Consideration
This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see link to the [deepseek-coder-6.7b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct) on Hugging Face.

## Terms of Use
GOVERNING TERMS: The use of this model is subject to the MIT License and [DeepSeek AI Model Agreement](https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct/blob/main/LICENSE).

## Model Architecture:
**Architecture Type:** Generative Pre-Trained Transformer (GPT)-based <br>
**Network Architecture:** Pre-trained on project-level code corpus with a window size of 16K and a fill-in-the-blank task <br>
**Model Version:** 1.0 <br>

## Input:
**Input Type:** Text <br>
**Output Format:** String <br>
**Input Parameters:** Temperature, Top K, Top P, Max Output Tokens <br>

## Output:
**Output Type:** Text <br>
**Output Format:** String <br>

## Software Integration:
**Supported Hardware Platform(s):** NVIDIA L4 GPUs <br>
**Supported Operating System(s):** Linux <br>

## Inference:

**Engine:** [Triton](https://developer.nvidia.com/triton-inference-server) <br>
**Test Hardware:** Other <br>

## Prototype

```python
from openai import OpenAI

client = OpenAI(
base_url = "https://integrate.api.nvidia.com/v1",
api_key = "$NVIDIA_API_KEY"
)

completion = client.chat.completions.create(
model="",
messages=[{"role":"user","content":""}],
temperature=,
top_p=,
max_tokens=,
stream=NaN
)

print(completion.choices[0].message)
```

```python
from langchain_nvidia_ai_endpoints import ChatNVIDIA

client = ChatNVIDIA(
model="",
api_key="$NVIDIA_API_KEY", 
temperature=,
top_p=,
max_tokens=,
)

response = client.invoke([{"role":"user","content":""}])
print(response.content)
```

```javascript
import OpenAI from 'openai';

const openai = new OpenAI({
apiKey: '$NVIDIA_API_KEY',
baseURL: 'https://integrate.api.nvidia.com/v1',
})

async function main() {
const completion = await openai.chat.completions.create({
model: "",
messages: [{"role":"user","content":""}],
temperature: ,
top_p: ,
max_tokens: ,
stream: ,
})

process.stdout.write(completion.choices[0]?.message?.content);

}

main();
```

```bash
curl https://integrate.api.nvidia.com/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $NVIDIA_API_KEY" \
-d '{
"model": "deepseek-ai/deepseek-coder-6.7b-instruct",
"messages": [{"role":"user","content":""}],
"temperature": ,   
"top_p": ,
"max_tokens": ,
"stream":                 
}'
```