Kjell Tore Guttormsen f93d6abdae feat: initial open marketplace with llm-security, config-audit, ultraplan-local

2026-04-06 18:47:49 +02:00

13 KiB

Raw Blame History

ultraplan-local Roadmap

Vision

ultraplan-local is a deep planning specialist. It does one thing: creates plans so thorough they can be implemented without questions.

The plan is the product. Everything else exists to make the plan better.

What we ARE

The most thorough planning process available as a Claude Code plugin
Autonomous: gathers all information itself, needs no human help along the way
Plans that stand on their own — implementable by someone who has never seen the codebase

What we are NOT

Not a project engine (that's Harness)
Not a behavior framework (that's Superpowers)
Not an execution engine, team manager, or issue tracker
Not optimized for infrastructure-as-code (Terraform, Helm, Pulumi) — the agents are designed for application code. IaC projects get a result, but agents like architecture-mapper and test-strategist provide less value there.

Quality Goals

A plan from ultraplan-local should:

Be implementable without asking questions
Have testable verification criteria for each step
Contain no placeholders, TBDs, or vague instructions
Include TDD structure where the project uses tests
Have a quantitative assessment of its own quality (score A-D)

v0.4.0 — Information-Complete and Plan Quality (DONE)

Completed 2026-04-06. See CHANGELOG.md for details.

Delivered:

3 new agents: task-finder, git-historian, spec-reviewer
All agents run for all codebase sizes (turns scale, not agent count)
No-placeholder rule in plan-critic (TBD/TODO = blocker)
Quantitative plan scoring (A-D grades, 5 weighted dimensions)
[ASSUMPTION] marking with threshold warning (>3 = warning)
Spec-reviewer as new phase before exploration

v1.0.0 — Production-Ready Plugin

Two pillars: (1) features that close real user friction, and (2) repo infrastructure for a credible open-source project.

Each feature item has a Rationale tracing back to a role simulation or research finding.

Pillar 1: Plugin Features

1. `--quick` mode

New mode that skips the exploration phase. Plans directly from interview plus minimal file checking (Glob/Grep to verify file paths mentioned in the conversation).

/ultraplan-local --quick Add rate limiting to the API

Flow: interview → spec → plan (without agent swarm) → adversarial review → done.

Useful when:

The developer knows the code well and needs structure, not mapping
The codebase is small and simple
The time/cost of full exploration isn't worth it

Rationale: Solo developer simulation revealed that 6 agents on 12 files feels like overkill when the developer already knows the code. git-historian provides zero value for solo projects with short history.

Changes: commands/ultraplan-local.md (new mode parsing), agents/planning-orchestrator.md (new quick path that skips Phase 2).

2. `--export pr` for shareable plan output

Generates a PR-ready summary from an existing plan:

/ultraplan-local --export pr .claude/plans/ultraplan-2026-04-06-rate-limiting.md

Output: a markdown block formatted as a PR description (Summary, Changes, Test plan) that can be copied directly into a PR.

Possible export formats:

pr — PR description with summary and test plan
issue — issue comment with plan summary
markdown — clean plan without internal metadata (score, revisions)

Rationale: OSS contributor simulation showed that the plan is a local file with no easy way to share. The user wanted to share with a maintainer for approval before implementation.

Changes: commands/ultraplan-local.md (new --export mode parsing and output format).

3. task-finder categorization

Update the task-finder agent to categorize findings into three levels:

Category	Meaning	Example
Must-change	Files that must be modified to implement the task	`src/auth/middleware.ts`
Must-respect	Interfaces and contracts that must be honored	`src/types/auth.d.ts`
Reference	Useful context, but no changes needed	`src/utils/jwt.ts`

Rationale: Senior engineer simulation (2000+ files) revealed that task-finder reported 47 files in a flat list. Without prioritization, it's useless for planning.

Changes: agents/task-finder.md (updated output format and instructions).

4. Adaptive interview depth

The interview adapts to the user's response depth:

Detailed answers (>2 sentences, technical language): ask fewer, more targeted questions. Assume the user is senior and knows what they want.
Short/uncertain answers (<1 sentence, "don't know"): ask simpler questions, offer alternatives instead of open-ended questions. For bugs: focus on reproduction ("What do you see?" / "What did you expect?") instead of technical requirements.

Rationale: Junior developer simulation showed that the interview assumes the user understands the problem. The junior didn't know enough to answer open-ended questions well, resulting in a thin spec and a C-grade plan.

Changes: commands/ultraplan-local.md (updated Phase 2 interview instructions).

5. Complete `plugin.json` metadata

Add missing fields for marketplace readiness:

{
  "name": "ultraplan-local",
  "version": "1.0.0",
  "description": "...",
  "author": "Kjell Tore Guttormsen",
  "homepage": "https://git.fromaitochitta.com/open/ultraplan-local",
  "repository": "https://git.fromaitochitta.com/open/ultraplan-local.git",
  "license": "MIT",
  "keywords": ["planning", "implementation", "agents", "adversarial-review"]
}

Rationale: Plugin ecosystem research showed that plugin.json is missing 5 of the fields that marketplace and discovery tools use. Highest leverage gap for distribution.

Changes: .claude-plugin/plugin.json.

6. Documented IaC limitation in README

Add a section in README under "When to use" that explicitly states that ultraplan-local is designed for application code, and that IaC projects (Terraform, Helm, Pulumi, CDK) get reduced value from the exploration agents.

Rationale: DevOps simulation showed that architecture-mapper looks for src/lib/controllers (irrelevant for Terraform), test-strategist doesn't know infra testing tools, and the plan misses Terraform-specific steps like state locking.

Changes: README.md (new section in the "When to use" section).

Pillar 2: Repo Infrastructure

7. Forgejo issue templates

Create .forgejo/ISSUE_TEMPLATE/ with two YAML templates:

bug_report.yaml:

Plugin version (required)
Claude Code version
Reproduction steps
Expected vs actual behavior
Auto-label: type: bug

feature_request.yaml:

Problem description
Proposed solution
Alternatives considered
Auto-label: type: enhancement

Rationale: Forgejo audit showed no .gitea/ or .forgejo/ infrastructure. Standard for an open-source project that accepts issues.

8. Label set in Forgejo

Create via Forgejo API or UI:

Label	Color	Use
`type: bug`	red	Something is broken
`type: enhancement`	blue	New feature or improvement
`type: docs`	green	Documentation only
`status: confirmed`	yellow	Verified/accepted
`status: wontfix`	gray	Closed without action
`good first issue`	purple	Low complexity, well scoped

Rationale: No labels exist. Necessary for triage.

9. Forgejo Release for v1.0.0

Create a Release object (not just a git tag) with CHANGELOG content attached. Use v1.0.0 as the tag name.

Rationale: Repo audit showed that commits exist but no Release objects. Releases are the first thing users see on a Forgejo project.

10. README badges

Add badges to README:

![Version](https://img.shields.io/badge/version-1.0.0-blue)
![License](https://img.shields.io/badge/license-MIT-green)
![Platform](https://img.shields.io/badge/platform-Claude%20Code-purple)

Rationale: Quality signal on first visit. Standard for open source.

11. CONTRIBUTING.md tailored for solo project

Rewrite to be honest about the contribution model:

"This is a solo project. Issues are welcome. PRs are considered but not expected."
Remove section about PR workflow
Keep: how to report bugs, suggest improvements

Rationale: Current CONTRIBUTING.md implies that PRs are welcome, but the project is marked as solo. Dishonest signaling.

v1.3.0 — Session-Aware Parallel Execution (DONE)

Completed 2026-04-06. See CHANGELOG.md for details.

Delivered:

/ultraexecute-local auto-detects ## Execution Strategy in plans
Multi-session parallel orchestration via claude -p per wave
--fg flag: force sequential execution, ignore Execution Strategy
--session N flag: execute only session N (used by child processes)
Phase 2.5 (Execution strategy decision) and Phase 2.6 (Multi-session orchestration)
Execution Strategy section in plan template (sessions, waves, scope fences)
planning-orchestrator generates Execution Strategy for plans with > 5 steps
File overlap analysis to group steps into sessions and waves

v1.2.0 — Disciplined Plan Executor (DONE)

Completed 2026-04-06. See CHANGELOG.md for details.

Delivered:

/ultraexecute-local command: 9-phase workflow for disciplined plan execution
4 modes: execute, --resume, --dry-run, --step N
Per-step protocol: implement → verify → on-failure → checkpoint
Progress file for crash recovery and resume
Entry/exit condition checking for session specs
Scope fence enforcement (never-touch protection)
JSON summary block for headless log parsing
Stats tracking to ultraexecute-stats.jsonl
Positioning: Harness = project engine, Kiur = TDD, Ultraexecute = plan executor

v1.1.0 — Headless Multi-Session Execution (DONE)

Completed 2026-04-06. See CHANGELOG.md for details.

Delivered:

--decompose mode: splits plan into self-contained headless sessions
--export headless format: shortcut to decompose
session-decomposer agent: analyzes step dependencies, groups into sessions, generates dependency graph + launch script
Session spec template with scope fences, entry/exit conditions, failure handling
Failure recovery per step in plan template: On failure + Checkpoint
Headless readiness as new dimension in plan-critic (9 dimensions, rebalanced weights)

Future (after v1.1, unprioritized)

Based on competitive analysis and simulations. Each item has a rationale for why it's not in v1.0.

Feature	Source	Why not v1.0
Plan auto-update during execution	Windsurf differentiator	Major architecture change — the plan is currently static after generation. Requires hooks that observe execution and update the plan file. Windsurf spent months on this.
Issue integration (`--issue #42`)	OSS contributor simulation	Tracker-dependent (Linear, Forgejo, GitHub, Jira). Too ambitious for first stable release.
Plan diff on re-planning	Senior engineer simulation	Useful but not a blocker. Can be solved with `diff` on two plan files manually.
Cost estimate in plan summary	Senior engineer simulation	Requires reliable token counting. Claude Code API doesn't expose this directly.
IDE sidebar for plan	Windsurf differentiator	Requires VS Code extension — entirely different technology stack.
IaC-adapted agents	DevOps simulation	Niche need. Solved with documented limitation in v1.0.
Bug mode (`--bug`)	Junior simulation	Can be partially solved with adaptive interview (v1.0 item 4). Dedicated mode is overkill for first release.
Solution memory	Roadmap v0.4.0 future	Secondary — plan quality should stand on its own without history.

Competitive Position

What ultraplan-local has that nobody else does

Feature	Copilot Workspace	Cursor	Windsurf	ultraplan-local
Adversarial review (plan-critic + scope-guardian)	No	No	No	Yes
Quantitative plan scoring (A-D)	No	No	No	Yes
No-placeholder enforcement (hard blocker)	No	No	No	Yes
`[ASSUMPTION]` marking with threshold warning	No	No	No	Yes
Spec-driven headless mode (`--spec`)	No	No	No	Yes
TDD-structured steps (RED-GREEN-REFACTOR)	No	No	No	Yes
Full interview phase for requirements gathering	No	No	Partial	Yes
12 specialized agents	No	No	No	Yes
Session decomposition into headless sessions	No	No	No	Yes
Failure recovery per step (On failure/Checkpoint)	No	No	No	Yes
Parallel wave-based execution (`launch.sh`)	No	No	No	Yes

Known gaps vs competitors

Gap	Who has it	Status
Plan updates during execution	Windsurf	Future — major architecture change
PR-native output	Copilot Workspace	v1.0 — `--export pr`
Issue integration	Copilot Workspace	Future — tracker-dependent
Sandbox execution during planning	Cursor	Out of scope — different architecture
IDE sidebar	Windsurf	Future — requires VS Code extension

Compatibility

Harness users: Plans from ultraplan are detailed enough to manually decompose into Harness feature_list.json
Superpowers users: TDD task structure matches Superpowers' plan format. Plans are compatible with the executing-plans skill.

13 KiB Raw Blame History

ultraplan-local Roadmap

Vision

What we ARE

What we are NOT

Quality Goals

v0.4.0 — Information-Complete and Plan Quality (DONE)

v1.0.0 — Production-Ready Plugin

Pillar 1: Plugin Features

1. --quick mode

2. --export pr for shareable plan output

3. task-finder categorization

4. Adaptive interview depth

5. Complete plugin.json metadata

6. Documented IaC limitation in README

Pillar 2: Repo Infrastructure

7. Forgejo issue templates

8. Label set in Forgejo

9. Forgejo Release for v1.0.0

10. README badges

11. CONTRIBUTING.md tailored for solo project

v1.3.0 — Session-Aware Parallel Execution (DONE)

v1.2.0 — Disciplined Plan Executor (DONE)

v1.1.0 — Headless Multi-Session Execution (DONE)

Future (after v1.1, unprioritized)

Competitive Position

What ultraplan-local has that nobody else does

Known gaps vs competitors

Compatibility

13 KiB

Raw Blame History

1. `--quick` mode

2. `--export pr` for shareable plan output

5. Complete `plugin.json` metadata