llama3-70b-instruct Model by Meta

The following are key system requirements and supported features to consider when self-hosting the llama3-70b-instruct model.

GPU Memory Requirements

Deploying this NIM with less than the recommended amount of GPU memory requires setting the environment variable NIM_RELAX_MEM_CONSTRAINTS=1

Feature	Supported
LoRA Customization	✅
Fine-tuning Customization	❌
Tool Calling	❌
TensorRT-LLM Local Engine Building	❌