Setup Nanochat on Dual-Spark
This playbook shows you how to run Andrej Karpathy’s Nanochat on Spark. Nanochat is popularized as being the best ChatGPT that $100 can buy. This playbook makes it possible to train and run Nanochat locally on your dual-Spark setup.
You’ll set up a local, end-to-end ChatGPT-like training pipeline, including pre-training, mid-training, post-training, and optional reinforcement learning. You will also be able to chat with your model through a simple web UI.
nvidia-smidocker run --rm --gpus all nvcr.io/nvidia/pytorch:25.11-py3 nvidia-smiThe reference training scripts can be found in the Nanochat repository here on GitHub
Duration: Upto 5 days depending on model size and number of training stages.
Risks:
Rollback:
$HOME/.cache/nanochat