---
title: "rakutenai-7b-chat"
publisher: "rakuten"
type: "endpoint"
updated: "2025-05-22T18:02:46.364Z"
description: "Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation."
canonical: "https://build.nvidia.com/rakuten/rakutenai-7b-chat"
---

## Model Description

Rakuten AI 7b chat model is a conversational AI system designed for seamless, natural interactions. With 7 billion parameters, it excels in understanding and generating human-like text in both English and Japanese. Its extensive training dataset includes diverse sources, making it suitable for customer service, virtual assistants, and other applications. Optimized for performance, the Rakutan AI 7b enhances user experience with advanced language processing capabilities and robust multilingual support, particularly excelling in Japanese-language use cases. 

This model is ready for commercial use.

**Model Developer** Rakuten Group, Inc.

**License** This model is licensed under [Apache License, Version 2.0.](https://www.apache.org/licenses/LICENSE-2.0)

**Language(s)** Japanese, English

## Third-Party Community Consideration: 
This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see link to Non-NVIDIA [7B Chat Model Card](https://huggingface.co/Rakuten/RakutenAI-7B-chat).

## References:

RakutenAI 7B chat [Model Card](https://huggingface.co/Rakuten/RakutenAI-7B-chat) on Hugging Face <br>
RakutenAI 7B chat [paper](https://arxiv.org/abs/2403.15484) <br>

## Model Architecture:

**Architecture Type:** Transformer <br>
**Network Architecture:** Mistral-7B <br>
**Model Version:** 0.1 <br>

## Input:
**Input Type(s):** Text <br>
**Input Format:** String <br>
**Input Parameters:** Max Tokens, Temperature, Top P <br>

### Limitations & Bias:

The suite of RakutenAI-7B models is capable of generating human-like text on a wide range of topics. However, like all LLMs, they have limitations and can produce biased, inaccurate, or unsafe outputs. Please exercise caution and judgement while interacting with them.

## Output:
**Output Type(s):** Text <br>
**Output Format:** String <br>

## Software Integration:
**Supported Hardware Platform(s):** NVIDIA Lovelace <br>
**Supported Operating System(s):** Linux <br>

## Training Datasets:

- [JSNLI](https://nlp.ist.i.kyoto-u.ac.jp/?%E6%97%A5%E6%9C%AC%E8%AA%9ESNLI%28JSNLI%29%E3%83%87%E3%83%BC%E3%82%BF%E3%82%BB%E3%83%83%E3%83%88)
- [RTE](https://nlp.ist.i.kyoto-u.ac.jp/?Textual+Entailment+%E8%A9%95%E4%BE%A1%E3%83%87%E3%83%BC%E3%82%BF)
- [KUCI](https://nlp.ist.i.kyoto-u.ac.jp/?KUCI)
- [BELEBELE](https://huggingface.co/datasets/facebook/belebele)
- [JCS](https://aclanthology.org/2022.lrec-1.317/)
- [JNLI](https://aclanthology.org/2022.lrec-1.317/)
- [Dolly-15K](https://huggingface.co/datasets/databricks/databricks-dolly-15k)
- [OpenAssistant1](https://huggingface.co/datasets/OpenAssistant/oasst1)

# Inference:

**Engine:** Triton Inference Server <br>
**Test Hardware:** NVIDIA L40 Systems <br>

## Ethical Considerations:
NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications.  When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse. 

Please report security vulnerabilities or NVIDIA AI Concerns [here](https://www.nvidia.com/en-us/support/submit-security-vulnerability/).

## Prototype

```python
from openai import OpenAI

client = OpenAI(
base_url = "https://integrate.api.nvidia.com/v1",
api_key = "$NVIDIA_API_KEY"
)

completion = client.chat.completions.create(
model="",
messages=[{"role":"user","content":""}],
temperature=,
top_p=,
max_tokens=,
stream=NaN
)

print(completion.choices[0].message)
```

```javascript
import OpenAI from 'openai';

const openai = new OpenAI({
apiKey: '$NVIDIA_API_KEY',
baseURL: 'https://integrate.api.nvidia.com/v1',
})

async function main() {
const completion = await openai.chat.completions.create({
model: "",
messages: [{"role":"user","content":""}],
temperature: ,
top_p: ,
max_tokens: ,
stream: ,
})

process.stdout.write(completion.choices[0]?.message?.content);

}

main();
```

```bash
curl https://integrate.api.nvidia.com/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $NVIDIA_API_KEY" \
-d '{
"model": "rakuten/rakutenai-7b-chat",
"messages": [{"role":"user","content":""}],
"temperature": ,   
"top_p": ,
"max_tokens": ,
"stream":                 
}'
```