---
title: "baichuan2-13b-chat"
publisher: "baichuan-inc"
type: "endpoint"
updated: "2025-05-22T19:13:59.035Z"
description: "Support Chinese and English chat, coding, math, instruction following, solving quizzes"
canonical: "https://build.nvidia.com/baichuan-inc/baichuan2-13b-chat"
---

## Model Overview

### Description:
Baichuan 2 is the new generation of large-scale open-source language models launched by Baichuan Intelligence inc.. It is trained on a high-quality corpus with 2.6 trillion tokens and has achieved the best performance in authoritative Chinese and English benchmarks of the same size. This 13B version for chat models is fully open to academic research. Developers can also use them for free in commercial applications after obtaining an official commercial license.

### Evaluation
Baichuan2-13B-Chat is tested on authoritative Chinese-English datasets across six domains: General, Legal, Medical, Mathematics, Code, and Multilingual Translation. For more detailed evaluation results of original models, please refer to [GitHub](https://github.com/baichuan-inc/Baichuan2).

### Terms and Conditions
We hereby declare that our team has not developed any applications based on Baichuan 2 models, not on iOS, Android, the web, or any other platform. We strongly call on all users not to use Baichuan 2 models for any activities that harm national / social security or violate the law. Also, we ask users not to use Baichuan 2 models for Internet services that have not undergone appropriate security reviews and filings. We hope that all users can abide by this principle and ensure that the development of technology proceeds in a regulated and legal environment.

If any problems arise due to the use of Baichuan 2 open-source models, including but not limited to data security issues, public opinion risks, or any risks and problems brought about by the model being misled, abused, spread or improperly exploited, we will not assume any responsibility.

### Third-Party Community Consideration
This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see link to Baichuan-Inc's [Model Card](https://huggingface.co/baichuan-inc/Baichuan2-13B-Chat)

### References(s):
[GitHub](https://github.com/baichuan-inc/Baichuan2) <br>
[HuggingFace](https://huggingface.co/baichuan-inc/Baichuan2-13B-Chat/tree/main) <br>
[Technical Report](https://arxiv.org/abs/2309.10305) <br>

### License, Acceptable Use, and Research Privacy Policy
By using this model, you are agreeing to the terms and conditions of the [Apache 2.0](https://github.com/baichuan-inc/Baichuan2/blob/main/LICENSE) and [Community License for Baichuan2 Model](https://huggingface.co/baichuan-inc/Baichuan2-7B-Base/resolve/main/Baichuan%202%E6%A8%A1%E5%9E%8B%E7%A4%BE%E5%8C%BA%E8%AE%B8%E5%8F%AF%E5%8D%8F%E8%AE%AE.pdf).

### Model Architecture:
**Architecture Type:** Transformer <br>
**Fine-tuned from model:** Baichuan2 <br>

### Input:
**Input Type:** Text <br>
**Input Format:** String <br>
**Input Parameters:** Temperature, Top P, Max Output Tokens<br>

### Output:
**Output Type:** Text <br>
**Output Format:** String <br>

## Inference:
**Engine:** Triton TensorRT-LLM <br>
**Test Hardware:** L40 <br>

## Prototype

```python
from openai import OpenAI

client = OpenAI(
base_url = "https://integrate.api.nvidia.com/v1",
api_key = "$NVIDIA_API_KEY"
)

completion = client.chat.completions.create(
model="",
messages=[{"role":"user","content":""}],
temperature=,
top_p=,
max_tokens=,
stream=NaN
)

print(completion.choices[0].message)
```

```javascript
import OpenAI from 'openai';

const openai = new OpenAI({
apiKey: '$NVIDIA_API_KEY',
baseURL: 'https://integrate.api.nvidia.com/v1',
})

async function main() {
const completion = await openai.chat.completions.create({
model: "",
messages: [{"role":"user","content":""}],
temperature: ,
top_p: ,
max_tokens: ,
stream: ,
})

process.stdout.write(completion.choices[0]?.message?.content);

}

main();
```

```bash
curl https://integrate.api.nvidia.com/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $NVIDIA_API_KEY" \
-d '{
"model": "baichuan-inc/baichuan2-13b-chat",
"messages": [{"role":"user","content":""}],
"temperature": ,   
"top_p": ,
"max_tokens": ,
"stream":                 
}'
```