LM Studio on DGX Spark

30 MIN

Deploy LM Studio and serve LLMs on a Spark device

SymptomCauseFix
API returns "model not found" errorModel not downloaded or loaded in LM StudioRun lms ls to verify download status, then load model with lms load {model-name}
lms command not foundPATH issue assuming successful installationRefresh your shell by running source ~/.bashrc
Model load fails - CUDA out of memoryModel too large for available VRAMSwitch to a smaller model or a different quantization

NOTE

DGX Spark uses a Unified Memory Architecture (UMA), which enables dynamic memory sharing between the GPU and CPU. With many applications still updating to take advantage of UMA, you may encounter memory issues even when within the memory capacity of DGX Spark. If that happens, manually flush the buffer cache with:

sudo sh -c 'sync; echo 3 > /proc/sys/vm/drop_caches'

For latest known issues, please review the DGX Spark User Guide.