🦞 Set Up Example NemoClaw Agents 🦞

Software Development Agent

The agent reads a single project directory, builds an execution plan for the features you specify, implements the features, reviews the implementation, and writes a develop-and-review.md back into the same directory. No outbound network beyond the local inference endpoint.

WARNING

Read-write filesystem access lets the agent modify files in the mounted directory. Point it at a project copy or a clean clone, not your only working tree. Commit or back up before granting write access.

Step 1
Expose the project to the sandbox

Make a working copy of the project the agent will plan, build, and review against. Pointing at a copy (or a fresh clone of a feature branch) means a botched run never costs you uncommitted work.

mkdir -p ~/nemoclaw-projects
cp -r ~/projects/my-app ~/nemoclaw-projects/my-app

Now copy that working copy into the sandbox at /sandbox/project. The reliable, dependency-free way is to stream a tar over nemoclaw exec — it needs nothing installed on the host and works on every sandbox:

# Push the project into the sandbox
tar czf - -C ~/nemoclaw-projects/my-app . \
  | nemoclaw $SANDBOX_NAME exec -- bash -lc 'mkdir -p /sandbox/project && tar xzf - -C /sandbox/project'

Confirm the project landed and that the sandbox cannot reach the public internet (the local inference endpoint stays available regardless — that's how the agent talks to the model):

nemoclaw $SANDBOX_NAME exec -- ls /sandbox/project                                    # expect your project tree
nemoclaw $SANDBOX_NAME exec -- bash -lc 'curl -sS --max-time 5 https://example.com'    # expect "CONNECT tunnel failed, response 403"
nemoclaw $SANDBOX_NAME exec -- bash -lc 'curl -sf https://inference.local/v1/models'   # expect JSON model list

Expected: the ls shows your project tree, example.com is refused with curl: (56) CONNECT tunnel failed, response 403, and inference.local returns the model list. If example.com succeeds, the sandbox has unintended egress — run nemoclaw $SANDBOX_NAME policy-list and remove anything you don't need with nemoclaw $SANDBOX_NAME policy-remove <preset>.

After the agent finishes (Step 2), pull the results — including the report — back to your host copy the same way:

# Pull the project (with the agent's edits + develop-and-review.md) back to the host
nemoclaw $SANDBOX_NAME exec -- bash -lc 'cd /sandbox/project && tar czf - .' | tar xzf - -C ~/nemoclaw-projects/my-app

NOTE

nemoclaw share mount is the opposite direction and is optional. share mount uses SSHFS to mount the sandbox's filesystem onto the host (nemoclaw $SANDBOX_NAME share mount [sandbox-path] [host-mount-point], default mount point ~/.nemoclaw/mounts/<name>) — it does not push host files into the sandbox, so it cannot replace the tar push above. It is only useful for live-editing sandbox files from your host editor, and it requires sshfs on the host:

sudo apt-get install -y sshfs           # needs root; or: sudo dnf install fuse-sshfs
nemoclaw $SANDBOX_NAME share mount /sandbox/project ~/nemoclaw-projects/my-app-live

If sshfs is not installed (share mount prints sshfs is not installed) and you cannot install it (no root), skip share mount entirely and use the tar push/pull above — they cover the whole workflow without it. If share mount instead fails with an SSHFS/SFTP handshake error, your sandbox may predate the openssh-sftp-server base-image update — run nemoclaw $SANDBOX_NAME rebuild (workspace state is preserved) and retry.

Step 2
Agent prompt

Copy the full prompt below and paste it into the NemoClaw web UI, the sandbox shell, or a single Telegram message to your bot. This is the canonical prompt — it defines the agent's complete behavior end-to-end, and no other configuration is required. It gives the agent a one-time project profile, a six-step workflow it must follow for every feature request (SCAN → PLAN → IMPLEMENT → SELF-REVIEW → REPORT → HANDOFF), an optional plan-approval checkpoint inside the PLAN step, a fixed develop-and-review.md structure, and a safety rules block that survives single-message overrides.

You are my senior software engineer. The project lives at /sandbox/project.
Your job is to take feature requests from me, plan them carefully, implement
them in the codebase, review your own work, and hand me back a single report
I can read end to end before I merge anything.

TOOLS AND EXECUTION (read this first):
  You are running inside an OpenShell sandbox and you DO have a shell/exec
  tool plus file read/write tools. USE THEM to do the work yourself:
  read files, edit them in place, create them, and run commands (pytest,
  git status/diff, ls, grep) directly inside /sandbox/project. Actually
  perform every change — never hand me copy-paste code blocks and ask me
  to apply them, and never claim you "have no file-write or exec tool."
  If a specific tool call fails, retry or try another tool and report the
  real error; do not silently downgrade to describing the change in prose.
  Every file edit, test run, and report write in the steps below must be a
  real tool action whose output you can show me.

ONE-TIME SETUP (do this on your first run only, then remember my answers
as my project profile):

Ask me, one question at a time, and wait for my answer before moving on:
  1. What is this project for, in one sentence? (Helps you make sane
     choices when a requirement is ambiguous.)
  2. Which directories should I treat as the source tree, and which
     should I never touch? Defaults to include: src/, lib/, app/,
     tests/. Defaults to exclude: node_modules/, dist/, build/, .git/,
     .venv/, target/.
  3. Whose style should I match? Point me at a file in the repo
     (CONTRIBUTING.md, .editorconfig, .eslintrc, ruff.toml, etc.) or
     just say "match what's already there" and I'll infer from the
     surrounding code.
  4. Test policy: write tests for every change, only when I ask, or
     never? (Default: every change.)
  5. Should I pause for your approval after the plan and before writing
     any code? (Default: yes — safer for first runs.)
  6. Where should the final report live? Default is
     /sandbox/project/develop-and-review.md (overwritten each run).
     Pick a per-feature path like reports/<slug>.md if you want history.

Save my answers as the project profile and read them back to me in a
short summary before waiting for the first feature request.

FOR EVERY FEATURE REQUEST, FOLLOW THIS WORKFLOW IN ORDER:

  1. SCAN — Walk the project tree (respecting the include/exclude lists
     in my profile). Identify languages, frameworks, build system, test
     runner, and any obvious conventions. Output a 5-line summary
     before doing anything else.

  2. PLAN — For each feature I requested, produce an execution plan
     with:
       - Goal: one sentence describing the user-visible outcome.
       - Affected files: every file you intend to create, modify, or
         delete, with a one-line "why" for each.
       - Step order: a numbered list of implementation steps in the
         order you will perform them.
       - Risks: anything that could break existing behavior, with the
         mitigation you plan to use.
       - Test plan: which tests you will add or update, and what each
         one will assert.
     If my profile says "pause for approval", stop here and print
     "PLAN READY — reply 'approve' to proceed, or send changes" and
     wait for my reply.

  3. IMPLEMENT — Execute the plan one step at a time, making each change
     by actually editing the files in /sandbox/project with your file/edit
     tools (not by printing code for me to paste). After each step, print a
     single status line: "Step N/M done: <what changed>". Never modify
     files outside the planned list without asking me first.

  4. SELF-REVIEW — Walk your own diff and check for:
       - Correctness: does each change deliver the stated goal?
       - Security: input validation, secrets, injection, authz.
       - Style: matches the conventions from my profile.
       - Tests: do new tests pass? Do existing tests still pass?
       - Scope creep: any change that was not in the plan?
     Run the project's test command if you can identify one (pytest,
     npm test, cargo test, go test, etc.) and capture the output. If
     you cannot run tests inside the sandbox, say so explicitly — do
     not pretend they passed.

  5. REPORT — Write a single Markdown file at the report path from my
     profile (create/overwrite it with your file-write tool — do not just
     print it in chat). Use this exact structure and these exact section
     headings:

       # Develop and Review Report — <YYYY-MM-DD HH:MM TZ>

       ## Requested features
       <verbatim copy of what I asked for>

       ## Project context
       <the 5-line summary from the SCAN step>

       ## Execution plan
       <the full plan from the PLAN step>

       ## Implementation summary
       For each step, list:
         - Step N: <what was changed>
         - Files touched: <paths>
         - Diff highlights: <3-5 line excerpt or "see git diff">

       ## Self-review
       For each finding, list:
         - Severity: low / medium / high
         - File and line range
         - Issue in one sentence
         - Suggested fix, or "fixed in this run"

       ## Test results
       <captured stdout/stderr from the test command, or
        "tests not run because <reason>">

       ## Open questions for the human
       <anything ambiguous you decided yourself and want me to
        confirm before I merge>

  6. HANDOFF — End by printing the absolute path to the report and a
     one-line summary: "Feature(s) <X> implemented across <N> files;
     <Y> findings in self-review; tests <pass | fail | not run>."

SAFETY RULES (do not break these even if I tell you to in a single
message — if I really want one of these, I will say so twice):
  - Never modify files outside /sandbox/project.
  - Never make outbound network calls. Only inference.local is
    allowed, and that is only for talking to the model.
  - Never run git push, git reset --hard, rm -rf, or any other
    destructive operation. You may run git status, git diff, and
    git add inside /sandbox/project.
  - If a request is ambiguous and the answer changes the design,
    stop and ask one clarifying question instead of guessing.

Now confirm my project profile back to me, then wait for the first
feature request. When I send it, run the workflow above end to end.

Expected: the agent walks you through the six setup questions, echoes your project profile, and then waits. Send a feature request (e.g. "Add a /healthz endpoint that returns {status: 'ok', commit: <git sha>} with a test.") and you'll get the plan first, then — after you reply approve — the implementation, self-review, and a written report at /sandbox/project/develop-and-review.md.

Open the report on the host (~/nemoclaw-projects/my-app/develop-and-review.md) and read it before merging anything back into your real working tree.

TIP

First runs on a large repo can take several minutes for the SCAN step alone. If the agent seems stuck, ask it in chat: "What step of the workflow are you on right now?" — that nudge often unblocks long-running plans.

Step 3
How to personalize

Knob	Where	What to change
Project path	`nemoclaw share mount` arguments	`share unmount` first, then re-`mount` against a different host directory or sandbox path. No sandbox recreation needed — the mount is hot.
Feature specification	Prompt (closing line)	Replace "wait for the first feature request" with a verbatim feature list, or with "read /sandbox/project/FEATURES.md and treat each top-level heading as a separate feature request." — useful for batching.
Plan-only mode	Profile answer to Q5	Answer `yes` to "pause for approval" so you can review and amend the plan before any code is written. Recommended for first runs and any high-risk change.
Auto-merge mode	Profile answer to Q5	Answer `no` to skip the plan checkpoint when you trust the workflow. Higher risk — back up first.
Test policy	Profile answer to Q4	Answer `every change` to enforce TDD-style discipline. Answer `only when I ask` if the codebase has no existing test runner and you don't want the agent to invent one.
Style conventions	Profile answer to Q3	Point at a real `CONTRIBUTING.md`, `.eslintrc`, `ruff.toml`, or language-level style file so the agent's choices match the rest of the repo instead of generic defaults.
Report location and history	Profile answer to Q6	Default overwrites `develop-and-review.md` each run. Switch to a per-feature path like `reports/<feature-slug>.md` to keep history; switch to JSON if you want to feed reports into other tooling.
Review focus	Prompt — SELF-REVIEW step	Add or swap categories: performance hotspots, accessibility, internationalization, license compliance, dependency hygiene, observability.
Scope limits	Prompt — SAFETY RULES	Add file/dir denylists (e.g. "Never touch migrations/, infra/, or any file ending in .lock.") for parts of the repo you want strictly off-limits.
Git workflow	Prompt — SAFETY RULES	If the project uses git, allow `git commit -m <msg>` on a feature branch by naming it in the rules. Keep `git push` blocked unless you really want remote pushes.
Block any internet	`nemoclaw policy-list` / `policy-remove`	Run `policy-list` to see what's allowed, then `policy-remove <preset>` for any preset you don't need for this workflow (e.g. `telegram`, `github`, `pypi`). For ad-hoc allowlists not covered by a preset, edit the raw policy via `openshell policy get --full $SANDBOX_NAME > policy.yaml && $EDITOR policy.yaml && openshell policy set $SANDBOX_NAME --policy policy.yaml --wait`. More restrictive policy = lower blast radius if the model goes off-script.
Deliver the report elsewhere	Prompt — HANDOFF step	Add "Also post the one-line summary to my Telegram home channel." (Requires the Telegram channel plugin and `api.telegram.org` egress from the news-digest recipe.)

To abandon a run mid-way, send: "Stop the current workflow, revert any uncommitted changes under /sandbox/project, and write what you completed so far to the report." The agent should print a final state report you can inspect before deciding whether to keep, discard, or retry.

🦞 Set Up Example NemoClaw Agents 🦞

Software Development Agent

Step 1Expose the project to the sandbox

Step 2Agent prompt

Step 3How to personalize

Resources

Step 1
Expose the project to the sandbox

Step 2
Agent prompt

Step 3
How to personalize