Explore
Models
Blueprints
Docs
Forums
Login
Search Results
Searching for:
RLHF
text: RLHF
Clear Filters
Filters (1)
Sort
Clear filters
Sorting by Most Recent
All ( - )
Models ( - )
Blueprints ( - )
nvidia
/
llama-3.1-nemotron-70b-reward
Leaderboard topping reward model supporting RLHF for better alignment with human preferences.
Text-to-text
Reward Model