Explore
Models
Blueprints
GPUs
Docs
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Publisher
Use Case
NIM Type
text: RLHF
Clear Filters
Filters (1)
Sort
Clear filters
Sorting by Most Recent
nvidia
llama-3.1-nemotron-70b-reward
Leaderboard topping reward model supporting RLHF for better alignment with human preferences.
text-to-text
reward model
rlhf
nvidia