Explore
NIM
Docs
Forums
Login
text: RLHF
Clear Filters
Filters (1)
Sort
Clear filters
Sorting by Most Recent
Models ( - )
Agent Blueprints ( - )
nvidia
/
llama-3.1-nemotron-70b-reward
Leaderboard topping reward model supporting RLHF for better alignment with human preferences.
Text-to-text
Reward Model