nvidia/llama-3.1-nemotron-70b-reward

PREVIEW

Leaderboard topping reward model supporting RLHF for better alignment with human preferences.

Input

Enter a conversation between a user & assistant:

Output