nvidia/llama-3.1-nemotron-70b-reward
PREVIEWLeaderboard topping reward model supporting RLHF for better alignment with human preferences.
Enter a conversation between a user & assistant:
Leaderboard topping reward model supporting RLHF for better alignment with human preferences.