NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

nvidia

llama-3.1-nemotron-nano-8b-v1

Downloadable

Leading reasoning and agentic AI accuracy model for PC and edge.

nvidia

llama-3.1-nemotron-nano-8b-v1

Downloadable

Leading reasoning and agentic AI accuracy model for PC and edge.

advanced reasoningfunction callinginstruction followingmath

The following are key system requirements and supported features to consider when self-hosting the llama-3.1-nemotron-nano-8b-v1 model.

GPU Memory Requirements

PrecisionMinimum GPU MemoryRecommended GPU Memory
bf1616 GB33 GB
fp88 GB16 GB

Deploying this NIM with less than the recommended amount of GPU memory requires setting the environment variable NIM_RELAX_MEM_CONSTRAINTS=1

Feature Support

FeatureSupported
LoRA Customization✅
Fine-tuning Customization✅
Tool Calling✅
TensorRT-LLM Local Engine Building✅

Links

  • Documentation
  • NVIDIA NGC Catalog