deepseek-r1-distill-qwen-32b Model by Deepseek-ai

The following are key system requirements and supported features to consider when self-hosting the deepseek-r1-distill-qwen-32b model.

GPU Memory Requirements

Precision	Minimum GPU Memory	Recommended GPU Memory
bf16	64 GB	72 GB
fp8	32 GB	36 GB

Deploying this NIM with less than the recommended amount of GPU memory requires setting the environment variable NIM_RELAX_MEM_CONSTRAINTS=1

Feature	Supported
LoRA Customization	❌
Fine-tuning Customization	✅
Tool Calling	❌
TensorRT-LLM Local Engine Building	✅