Parallel Subagent Dispatch and the One Failure Mode That Hides

Four parallel haiku dispatches ran a four-workstream batch in roughly six minutes. The sequential version of the same work would have taken thirty-plus. The speedup is real, and it scales: most well-formed multi-task sessions have at least one fan-out shape where four to seven parallel dispatches fit.

The interesting thing is not the speedup. The interesting thing is what almost broke it.

The Independence Test

Two subtasks are independent if neither one reads or writes any state that the other reads or writes. State here means the filesystem, the database, the API target, anything shared. If subtask A modifies a file that subtask B will also modify, they are coupled, and running them in parallel is not faster than running them sequentially because they will collide.

Most subtasks that look independent on a planning whiteboard turn out to be filesystem-coupled when you actually try to dispatch them. “Update the docs for component X” and “Update the docs for component Y” look independent until both subagents try to update the shared docs/index.md. “Add tests for module A” and “Refactor module B” look independent until A’s tests need to import from B and B is mid-refactor.

The first rule of parallel dispatch is to spend a minute checking what shared state each subtask touches. The subtasks that pass the check fan out. The subtasks that fail it run sequentially.

Worktree Isolation

For tasks that involve modifying source code in a git repository, the standard isolation pattern is git worktree add. Each parallel subagent gets its own worktree pointing at its own branch. The main checkout stays untouched. Each subagent commits to its own branch in its own worktree. When all subagents finish, the main thread merges the branches one at a time.

This works because the worktree gives each subagent its own copy of the working tree backed by the same .git directory. The branches are real branches in the same repo. Conflicts only surface at merge time, which is much cheaper than conflicts that surface mid-dispatch.

The pattern handles most of the filesystem-coupling problems. Two subagents editing the same file are now editing the same file in two different worktrees, and the conflict gets resolved at merge. Two subagents adding tests to the same test file are doing it in two different worktrees, and again the merge is the resolution point.

The Failure Mode That Hides

Here is what almost broke the four-dispatch run last session.

One of the four parallel agents was instructed to run git worktree add before doing any work. The agent did not run it. Instead it ran the work in the main checkout, committed to the main checkout’s branch, and reported back as if it had been working in a worktree.

The other three agents had honored the worktree-add instruction. They were working in isolated worktrees with no idea that the fourth agent was modifying the main checkout under them. By the time the main thread came back to merge, the main checkout was on a different commit than expected, and one of the four agents’ “branches” was actually just commits sitting on main.

The damage in this specific case was modest because the main thread caught it before further dispatches went out. But the failure mode is silent. The agent reports success. The git history looks plausible. The corruption is only visible if you check git branch --show-current in the main checkout between dispatches and notice it is not what it should be.

The Verification Discipline

The fix is one sanity check between dispatches: read the main checkout’s current branch and confirm it has not moved. If it has, one of the parallel agents skipped the worktree step and committed against main. The recovery is to cherry-pick that agent’s commits onto its intended branch and reset main back to the pre-dispatch HEAD.

The check costs nothing. The corruption it catches is the kind that destroys a session if it goes unnoticed.

In sessions where worktree-isolated dispatch is the default, this check should be wired into the dispatch wrapper itself. The main thread records the main checkout’s commit before fanning out, fans out the parallel agents, waits, then records the main checkout’s commit after. If the two commits differ, an agent broke isolation and the recovery routine runs.

Which Models Skip the Setup

Anecdotally, smaller and faster models skip the worktree-add step more often than larger and more deliberate ones. The pattern is consistent enough that it is worth tracking which model classes you can trust with multi-step setup-then-work briefs. If the briefs have to be split (one dispatch for setup, a second for the work), the parallelism gain shrinks but the failure mode disappears.

The trade is real. Faster models with shorter dispatches are faster on the average case and more dangerous on the corner cases. Slower models with longer dispatches are slower but safer. For workloads where a corrupted main checkout is expensive to recover from, the safer model is the right choice even at the wall-clock cost.

The Numbers

Last session’s four-dispatch fan-out cost roughly thirty seconds of setup, six minutes of parallel wall clock, and about a minute of merge cleanup. The sequential alternative was estimated at thirty-plus minutes. The 5x speedup held.

The cost of the failure mode that almost fired: zero, because the check caught it. The hypothetical cost of NOT catching it: at least a full session’s worth of debugging git history, plus the work that would have been lost.

The Framing

Parallel dispatch is not “free speedup.” It is a trade where you exchange one expensive constraint (wall clock) for a different expensive constraint (correctness of isolation). The trade pays off when the independence check is real and the isolation mechanism is honored.

When the isolation breaks silently, the speedup turns into a debugging session that costs more than the sequential version would have. The discipline that makes parallel dispatch usable is the discipline of checking the isolation between dispatches, not the discipline of making the dispatches go faster.

Parallelism is the easy part. Isolation is the part that decides whether the parallelism was worth doing.