AgentSuiteLocal

User Manual — AgentSuiteLocal v1.0.0

AgentSuiteLocal runs seven specialist AI agents on your machine. Each agent takes a one-sentence business goal, walks a five-stage pipeline (intake → extract → spec → execute → QA), and writes a folder of structured markdown artifacts to your disk. You review them, approve what’s good, and the approved output feeds every future run on that project.

That’s the whole loop. Everything in this app is a UI on top of it. All processing happens locally — nothing leaves your machine unless you explicitly configure a cloud API key.

Manual version: v1.0.0 · Last updated: 2026-05-08


1. Installation walkthrough

Installer welcome screen The installer welcome screen — your first view when you launch AgentSuiteLocal for the first time.

The installer runs once and walks 6 short steps. After setup completes it doesn’t appear again unless you reinstall.

Windows SmartScreen note: Windows may show “Windows protected your PC” before the installer opens because AgentSuiteLocal is free, open-source beta software and the installer is unsigned. If you downloaded it from the official GitHub Releases page, click More info and then Run anyway to continue.

Step Screen What happens
1 Welcome Intro screen. Click Get started.
2 License & privacy Read the license. Check the box. Click I agree.
3 Hardware & model tier The app probes your CPU, RAM, and disk and recommends a tier — Light (8 GB RAM, gemma4:e2b), Balanced (16 GB RAM, gemma4:e4b, recommended), or Pro (32 GB RAM, gemma4:26b). Pick one.
4 Ollama & model download Confirms Ollama is running, then pulls the model for the tier you chose. Includes a 3-attempt retry loop with backoff. If Ollama isn’t running yet, open the Ollama app and the screen unlocks automatically.
5 Smoke test Runs four sequential checks: Ollama reachable, model loaded, API healthy, real test inference. Each check shows a green tick or a fix card with a specific action. Skip smoke test is available with a confirmation if you need to bypass it (the app may not work after skipping).
6 You’re set up Click Launch AgentSuiteLocal to open the main app.

What changed from earlier versions. The installer was 11 screens before v0.7.1, then 5 in v0.7.1 (UX-1), then 6 in v0.8.8 (smoke test re-introduced). Agent selection, the cloud-fallback API key, and Python runtime checks now live in Settings, not the installer. If you have inputs from a v0.7.0-or-earlier walkthrough, expect the screens to look different.


2. First run — from Dashboard to approved artifact

Dashboard overview The Dashboard shows your projects, recent runs, engine status, and any runs waiting for your review.

Step 1: Open the app. You’ll see the Dashboard with an engine status card and a “Start a new run” prompt.

Step 2: Click New run (or click an agent card in the Agents view). The New Run screen appears.

Step 3: Fill in three fields:

Step 4: Click Start run. The New Run screen transitions to the Live Run view.

Step 5: Wait — or come back later. The pipeline runs five stages. Each lights up when it starts and shows a checkmark when complete. The whole pipeline takes 9–16 minutes on typical hardware. Closing the Live Run view does not stop the run — it continues in the background. You can also click Cancel at any time to stop the run and save partial artifacts.

Step 6: Review. When QA completes, the Approval Gate opens automatically.

Approval gate The approval gate shows every artifact with rendered markdown preview, QA scores across nine dimensions, and the Approve / Reject buttons.

Step 7: Read the artifacts. The file tree on the left lists every artifact the agent produced. Click any file to read it in the centre panel. The right panel shows QA scores across nine dimensions.

Step 8: Approve or reject. If the QA composite score meets the gate threshold (default 7.0), the Approve & promote button is enabled. Click it to copy every artifact to your kernel. If the score is below the threshold, or if the content is wrong, click Reject and run again with a sharper goal — or click Override & approve to bypass the gate with a confirmation.


3. Agent reference

Founder

Design

Product

Engineering

Marketing

Trust / Risk

CIO


4. The live view — understanding what you’re seeing

Live run view The live run view shows the five-stage pipeline as it progresses, with elapsed timers and a live output log.

Stage indicators

Each of the five stages has a row in the left panel:

Timing

Cancel and reconnect

QA scores

After the QA stage, nine dimensions appear in the right panel:

Each dimension scores 0–10. The composite is a weighted average. The default approval threshold is 7.0 (configurable in Settings).

If the model returns fewer than 9 dimensions, an amber notice appears: “Partial QA scores — the model returned N of 9 dimensions.” This is common with smaller models and doesn’t mean the output is wrong — it means the QA rubric response was truncated.


5. The kernel — what it is and how to use it

Kernel view The Kernel view shows all your approved artifacts grouped by project and agent. Click any file to preview it inline.

The kernel is the canonical store of approved artifacts. Every run loads the kernel for its project as context before starting.

Location on disk: ~/AgentSuite/.agentsuite/_kernel/{project}/{agent}/{YYYY-MM-DD-HHMMSS}/

When you approve a run, all its artifacts are copied to the kernel with a timestamped folder. Subsequent runs on the same project see this content as prior context.

The Kernel view lists all approved exports grouped by project, then agent, then reverse-chronological. Each entry shows: agent name, project, timestamp, artifact count, total size, and an Open folder button. Clicking an artifact opens an inline preview rendered as markdown.

Comparing versions: if a project/agent combination has more than one approved export, a Compare button appears. It opens a side-by-side diff: left = selected version, right = latest. A file selector lets you pick which artifact to diff.

Exporting: in the Approval Gate and Run Detail views, an Export dropdown offers:


6. Pipelines

Pipelines chain multiple agents end-to-end. The output of each agent is passed as context to the next.


7. Model management

Model management The Model Management screen lets you pull new models, see which is active, and delete ones you no longer need.

Open Settings → Models (or click Models in the sidebar) to manage your local models.

Installed models: lists all models Ollama has pulled, with size and last-used date. Set as active switches the model used for future runs; Delete (with confirmation) removes a model from disk.

Recommended models:

Tier Model Disk Min RAM
Light gemma4:e2b ~3 GB 8 GB
Balanced gemma4:e4b ~5 GB 16 GB
Pro gemma4:26b ~17 GB 32 GB

Click Pull next to any recommended model to download it. A live progress bar shows download progress streamed directly from Ollama.

Any model already in your Ollama library shows up in the Models view — click Set as active to switch. Power users can use Ollama directly to manage advanced model variants outside the app.


8. Settings reference

Run configuration

| Setting | Default | Description | |———|———|————-| | Model tier | balanced | Which tier to use for all runs | | Run timeout | 15 min | Max time before a run is killed with an error | | QA gate threshold | 7.0 | Minimum composite QA score to enable the Approve button | | Workspace path | ~/AgentSuite | Where runs and the kernel are stored | | Desktop notifications | on | OS toast when a run reaches a terminal state | | Telemetry | off | Local-only JSONL log of app events |

Cloud fallback

When an Anthropic API key is present, the model dropdown gains three cloud options: Claude 3.5 Haiku, Claude 3.5 Sonnet, and Claude Opus 4. Selecting a cloud model routes all future runs through the Anthropic API. A permanent warning reads: “Cloud runs send your goal and context to Anthropic’s servers and incur API costs. Local runs are always free.”

The API key is stored in the OS credential store (Windows Credential Manager, macOS Keychain, Linux Secret Service) — never in settings.json. See SECURITY.md.

Notifications and telemetry

Desktop notifications fire on terminal-state transitions (waiting, approved, rejected, error, cancelled, timeout) and respect your OS Do Not Disturb. Telemetry, when enabled, writes to ~/.agentsuitelocal/usage.jsonl only — nothing leaves your machine.

Uninstall

Settings → Danger zone → Uninstall AgentSuiteLocal opens a three-phase flow:

  1. Shows your workspace size and asks whether to delete all run data.
  2. Asks whether to delete the Ollama model.
  3. Calls the system uninstaller (Windows: Add/Remove Programs entry; macOS: removes the .app).

9. Troubleshooting

The installer is stuck on “Ollama & model download” and Continue never lights up. The AI engine isn’t running yet. Look for the Ollama icon in your system tray (Windows) or menu bar (macOS) — a small llama head. If you don’t see it, open the Ollama app you installed. Once the icon appears, the installer should unlock within a few seconds. If you haven’t installed Ollama, get it from ollama.com/download.

Model download fails after 3 attempts. Check your internet connection. Ollama pulls from Ollama’s servers — if those are unreachable, the download fails. Try again when connectivity is restored. If the download partially completes but the screen reports “Model download appears incomplete”, click the one-click re-pull button.

A run starts but immediately shows an error. The most common cause is Ollama not running. Check the engine status card on the Dashboard — it shows “Not running” if Ollama has stopped. Restart Ollama and try again. If the same step fails twice, open an issue with the error message.

Run shows “Timed out after 15 minutes.” The model stopped responding mid-run. Options: (a) increase the run timeout in Settings → Run timeout, (b) switch to a lighter model tier, (c) reduce the size of your inputs folder, (d) verify Ollama is still running — it can crash under memory pressure.

The run finished but QA score is below 7.0. Usually the goal was too vague. Sharpen it, add an inputs folder with real context, or switch to a higher model tier. A score of 6.5+ on a small model often improves to 8.0+ on Balanced with the same goal.

Approve button is greyed out despite a score above 7.0. The QA gate threshold may have been raised in Settings above the current score. Either lower the threshold or click Override & approve (amber button) to bypass with a confirmation dialog.

QA dimensions show “partial QA scores” notice. The model returned fewer than 9 dimensions — common with smaller models. Scores shown are still valid; the composite is calculated from what was returned. Switch to Balanced or Pro for full 9-dimension scores.

The artifacts look cut off or much shorter than expected. Usually the Light tier running out of context window. Switch to Balanced or Pro and re-run.

“Connection lost — reconnecting” appears in the Live view. The SSE connection dropped. The app reconnects automatically (up to 10 attempts with backoff: 1s → 2s → 4s → 8s → 16s → 30s). The run continues in the background. If reconnection fails, navigate to Runs to check status.

Kernel files are not showing up in the Kernel view. A run must be approved before its artifacts appear in the Kernel. Verify the run shows “approved” in Runs. If the kernel is empty after approval, check the workspace path in Settings — the kernel lives under that path.

Retry button doesn’t pre-fill the goal field. Runs created before v0.7.0 may not have a goal field. Fill it in manually for those runs. All v0.7.0+ runs save the goal automatically.

Desktop notifications are not appearing. Windows: Settings → System → Notifications → AgentSuiteLocal. macOS: System Settings → Notifications → AgentSuiteLocal. Also confirm the Desktop notifications toggle is on in AgentSuiteLocal Settings.

The app opens to the installer on every launch. The setup-complete state is stored in the browser’s localStorage under agentsuite_setup_complete. If the value is cleared (private browsing, profile reset, etc.) the installer reappears. Walk through the installer once and it persists.

PDF export fails. PDF export uses reportlab, which is pure Python and requires no system runtime. A failure usually means the run has no artifact files yet. Ensure the run has completed at least one step and reached the approval gate before exporting. If the error persists, the backend log will show the underlying exception.

Antivirus flags the installer or .exe. Some AV tools flag PyInstaller-bundled executables as suspicious — a known false positive. Add the install folder to your AV exclusion list, or upload the .exe to virustotal.com to verify independently.

Building from source? Common build problems (PyInstaller, Vite top-level await, iscc not found, pytest import errors) are documented in CONTRIBUTING.md, not here.


10. FAQ

See docs/FAQ.md for the full list. Short answers to the most common ones:


Found something confusing or wrong in this manual? Open an issue — user manuals only stay honest when readers push back.