Publisher
Use Case
NIM Type
Sorting by Most Recent
nvidia /
llama-3.1-nemotron-70b-rewardLeaderboard topping reward model supporting RLHF for better alignment with human preferences.
Leaderboard topping reward model supporting RLHF for better alignment with human preferences.