Explore
Models
Blueprints
Docs
Forums
Login
nvidia
/
llama-3.1-nemotron-70b-reward
PREVIEW
Leaderboard topping reward model supporting RLHF for better alignment with human preferences.
rlhf
reward model
text-to-text
Build
Experience
Model Card
API Reference
Input
Try
View Examples
Enter a conversation between a user & assistant:
Add Another User & Assistant Turn
Reset
Run
Output