ktg-plugin-marketplace

Author	SHA1	Message	Date
Kjell Tore Guttormsen	79b6e29073	feat(humanizer): update audit/analysis command templates group A [skip-docs] Wave 5 Step 13. Threads the humanizer vocabulary through five audit/ analysis command templates and adds a shape test that locks the structure in place. - commands/posture.md, tokens.md, feature-gap.md (findings-renderers): reference userImpactCategory/userActionLanguage/relevanceContext; remove hardcoded A/B/C/D/F-to-prose tables (humanizer owns the grade-context vocabulary now via the stderr scorecard headline). - commands/manifest.md, whats-active.md (inventory CLIs): add --raw pass-through for CLI-surface consistency. --raw is a no-op in these CLIs, but the flag is threaded through so users get uniform behaviour. - All five files: --raw flag parsed from $ARGUMENTS and passed verbatim to the underlying scanner CLI when present. tests/commands/group-a-shape.test.mjs (new, +5 tests, 767 → 772): - structural: every file has a bash invocation block, Read tool reference, and --raw/$ARGUMENTS plumbing - findings-renderers only: at least one humanized field referenced; no hardcoded "[grade] grade is..." prose tables	2026-05-01 19:41:08 +02:00
Kjell Tore Guttormsen	07629e9dae	test(humanizer): default-output snapshot test (SC-5) [skip-docs] Step 12 of v5.1.0 humanizer Wave 4. Adds tests/snapshot-default-output .test.mjs and seeds three snapshots in tests/snapshots/default-output/ that capture humanized default-mode output for representative CLIs. Coverage: - scan-orchestrator: stdout JSON envelope (humanized findings); time fields normalized. - token-hotspots-cli: stdout JSON envelope (humanized payload.findings); duration_ms normalized. - posture: stderr humanized scorecard; (Xms) durations normalized. Snapshot envelope is uniform on disk: { kind: 'json', payload: ... } for JSON streams and { kind: 'text', payload: '...' } for stderr text. This keeps the snapshot files self-describing and easy to read. Re-seeding requires UPDATE_SNAPSHOT=1 — drift fails the test by design, so any humanizer prose change is intentional and re-approved. Tests: 764 to 767 (+3 SC-5 cases). Full suite passes. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 18:21:31 +02:00
Kjell Tore Guttormsen	20b867adc1	test(humanizer): --raw backwards-compatibility test (SC-7) [skip-docs] Step 11 of v5.1.0 humanizer Wave 4. Adds tests/raw-backcompat.test.mjs mirroring the SC-6 contract for the --raw flag — the explicit "v5.0.0 verbatim" escape hatch. - 4 fixture-deterministic CLIs (scan-orchestrator, posture, token-hotspots-cli, fix-cli) get strict byte-equal against tests/snapshots/v5.0.0/<cli>.json with time fields normalized. - drift-cli is checked under the same contract guarded by ensureDriftBaseline. - 3 environment-aware CLIs (plugin-health, manifest, whats-active) are checked for mode-equivalence (--raw equals --json). - Posture additionally asserts its --raw stderr scorecard reproduces tests/snapshots/v5.0.0-stderr/posture.txt verbatim, modulo (Xms) duration markers normalized to (0ms). - Cross-cutting suite asserts --raw findings carry no humanizer fields on any CLI. Tests: 751 to 764 (+13 SC-7 cases). Full suite passes. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 18:20:04 +02:00
Kjell Tore Guttormsen	12af13a703	test(humanizer): JSON backwards-compatibility test (SC-6) [skip-docs] Step 10 of v5.1.0 humanizer Wave 4. Adds tests/json-backcompat.test.mjs asserting that --json output of every CLI remains backwards-compatible with the v5.0.0 contract. Coverage strategy mirrors Wave 3 cli-humanizer test discovery: - 4 fixture-deterministic CLIs (scan-orchestrator, posture, token-hotspots-cli, fix-cli) get strict byte-equal byte-equal --json vs frozen tests/snapshots/v5.0.0/ snapshot, with time-varying fields (timestamp, target path, duration_ms, generatedAt, durationMs) normalized. - drift-cli is checked with the same byte-equal contract guarded by an ensureDriftBaseline precondition; the test silently skips when the baseline cannot be created. - 3 environment-aware CLIs (plugin-health-scanner, manifest, whats-active) read live config-cascade state, so frozen snapshots drift as the marketplace evolves. They are verified by mode- equivalence (--json equals --raw) instead — the same approach established in Wave 3 cli-humanizer.test.mjs. A cross-cutting suite asserts --json output of the 4 deterministic CLIs never carries humanizer fields (userImpactCategory, userActionLanguage, relevanceContext) on any finding, walking both top-level findings arrays and scanners[].findings paths. Tests: 739 to 751 (+12 SC-6 cases). Full suite passes. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 18:18:29 +02:00
Kjell Tore Guttormsen	8b146bf489	feat(humanizer): scenario read-test corpus + runner (SC-4) [skip-docs] Step 9 of v5.1.0 humanizer Wave 4. Adds tests/scenario-read-test.mjs runner, tests/scenario-read-test.test.mjs wrapper, and 5 scenario fixtures in tests/scenarios/ that feed deterministic raw findings through humanizeFinding and assert the humanized title/description/recommendation match brief-owner-approved regex patterns encoding the ground-truth what/why/whatNext answers. Corpus selection (per brief criteria): - 01-tok-cascade.json - TOK/CPS category (token efficiency) - 02-cps-volatile.json - TOK/CPS category (cache prefix stability) - 03-cnf-conflict.json - CNF category (conflicts) - 04-gap-no-claude-md.json - GAP category (feature gap) - 05-set-invalid-json.json - SET category, AND its v5.0.0 title + description carry tier1 'invalid' (the brief criterion 'one finding whose v5.0.0 description uses a forbidden word'). Runner mechanics: - Loads scenarios matching ^\\d{2}-[a-z0-9-]+\\.json$ in sorted order. - Calls humanizeFinding(scannerInput) and matches each humanized field against its declared pattern (case-insensitive regex). - Verifies humanizer-added structural fields (userImpactCategory, userActionLanguage, relevanceContext) are non-empty strings. - Per session decision (1a) acceptance is deterministic regex matching without a runtime human approval gate. Wrapper adds 3 tests: scenario-match (binds runner to node --test), category-coverage (TOK/CPS, CNF, GAP, SET all present), and tier1-presence (at least one v5.0.0 title or description contains a tier1 forbidden word). Tests: 736 to 739 (+3 SC-4 tests). Full suite passes. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 18:16:23 +02:00
Kjell Tore Guttormsen	c5c937e94e	feat(humanizer): forbidden-words lint runner + test wrapper (SC-3) [skip-docs] Step 8 of v5.1.0 humanizer Wave 4. Adds tests/lint-default-output.mjs runner and tests/scanners/lint-default-output.test.mjs wrapper that exercise SC-3 against the 6 prose CLIs (scan-orchestrator, posture, token-hotspots-cli, plugin-health-scanner, drift-cli, fix-cli) running in default (humanized) mode against tests/fixtures/marketplace-medium. Lint scope is stderr only — JSON envelope keys ("scanner", "severity") are structural, not prose. Humanized prose fields embedded inside JSON are already covered by tests/lib/humanizer-data.test.mjs tier1/tier3 checks. Code references inside backticks pass the lint (stripBacktickSpans) so technical identifiers can appear when wrapped. Default-mode prose fixes to land lint at zero violations: - scan-orchestrator: top banner switches to "Config-Audit v2.2.0" and per-scanner progress wraps "[XXX] Label" in backticks. --raw and --json paths preserve the v5.0.0 verbatim banner via new opts.humanizedProgress flag on runAllScanners. - plugin-health-scanner: top banner switches to "Plugin Health v2.1.0" in default mode; --raw/--json keep "Plugin Health Scanner v2.1.0". - scoring.mjs generateHealthScorecard humanized branch: area names (CLAUDE.md, Hooks, MCP, Settings, Rules, Imports, Conflicts, Token Efficiency, Plugin Hygiene) are wrapped in backticks; dot-padding compensates so column alignment matches v5.0.0 layout. - posture / drift-cli / fix-cli: thread humanizedProgress flag through their runAllScanners calls so default mode emits humanized progress and --raw/--json preserve the v5.0.0 stderr snapshot. Test infrastructure only — user-facing docs land in Wave 5/6 once commands and agents consume the humanized payload. Tests: 735 to 736 (+1 SC-3 wrapper). Full suite passes. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 18:11:15 +02:00
Kjell Tore Guttormsen	5eecb968d8	feat(humanizer): wire humanizer into 6 remaining CLIs with --raw Adds --raw flag to all 6 remaining CLIs and wires humanization into the default rendering path. --json and --raw both bypass humanization for v5.0.0 byte-equal output; default mode humanizes findings/diff/prose. token-hotspots-cli: humanizes payload.findings before stdout JSON write. plugin-health-scanner: humanizes finding titles in stderr brief summary; --json/--raw write byte-identical v5.0.0-shape result to stdout. drift-cli: humanizes diff.{newFindings,resolvedFindings,unchangedFindings, movedFindings} before formatDiffReport; --raw applies to save and list modes too. Baselines remain raw v5.0.0 on disk. fix-cli: humanizes manual-finding titles in stderr fix-plan prose; both --json and --raw produce identical machine-readable JSON to stdout. manifest, whats-active: --raw is a no-op (no findings, inventory only) but parsed for CLI surface consistency. Decision on missing --output-file flag for drift-cli/fix-cli/plugin-health: deferred. SC-6/SC-7 tests in Wave 4 will use stdout-redirect (the simpler Alt B path) since these CLIs already write JSON to stdout in machine modes. Test cli-humanizer.test.mjs covers all 6 CLIs. Three CLIs that read environment state (plugin-health, manifest, whats-active) verify mode-equivalence (--json == --raw) instead of frozen-snapshot byte-equal, because their output reflects current marketplace state which drifts as plugins are added since the Wave 0 capture. Wave 3 / Step 7 of v5.1.0 humanizer. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 17:47:09 +02:00
Kjell Tore Guttormsen	70ff900578	feat(humanizer): wire humanizer into posture and scoring scorecard generateHealthScorecard signature: 2-arg → 3-arg (areaScores, opportunityCount, options = {}). options.humanized=true renders friendlier title, grade-context line per overall grade, and rephrased opportunity line. options.humanized=false (or 2-arg call) preserves v5.0.0 verbatim output for backwards-compat. topActions also gets an optional options.humanized that swaps recommendations through humanizeFinding lookup. posture.mjs main(): --json → write JSON to stdout, suppress stderr scorecard --raw → write JSON to stdout (byte-identical to --json), write v5.0.0 verbatim scorecard to stderr default → humanized scorecard to stderr, no stdout posture.test.mjs scorecard-prose assertions re-anchored to --raw mode (the explicit v5.0.0 path) — Wave 0 audit only covered finding-title strings; scorecard prose surfaces here for the first time. Wave 3 / Step 6 of v5.1.0 humanizer. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 17:38:03 +02:00
Kjell Tore Guttormsen	5ff6594976	feat(humanizer): wire humanizer into scan-orchestrator main with --raw bypass Adds --json and --raw flags to scan-orchestrator CLI main(). Default mode runs humanizeEnvelope(env) before serialization; --json and --raw bypass the humanizer for v5.0.0 byte-equal output (SC-6 / SC-7 paths). Save-baseline path always writes the raw v5.0.0-shape envelope so future humanizer-data updates do not trigger false-positive drift findings. runAllScanners() unchanged — it remains the v5.0.0-shape source of truth for in-process callers (posture, scoring, drift, etc.). Wave 3 / Step 5 of v5.1.0 humanizer. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 17:31:37 +02:00
Kjell Tore Guttormsen	dff278f02a	test(humanizer): replace title-string assertions with ID-based checks Wave 2 / Step 4 of v5.1.0 plain-language UX humanizer rollout. Re-anchors 34 title-string assertions across 4 test files so they survive Wave 3's title/description/recommendation rewriting at the CLI layer. Anchoring strategy per scanner: - GAP findings: scanner + category + recommendation substring (humanizer preserves stable identifiers like CLAUDE.md, .mcp.json, hook in rec). Hardcoded CA-GAP-NNN IDs for positive checks. - HKV findings: scanner + evidence regex (evidence preserved verbatim). - SET findings: scanner + evidence regex (evidence preserved verbatim). - PLH findings: scanner + hardcoded CA-PLH-NNN IDs (no evidence on most PLH findings, so ID is the only stable anchor for specific cases; negative checks use scanner + title-substring spanning raw + humanized). Per docs/v5.1.0-test-audit.md classification: only (b) WILL BREAK assertions modified. (a) shape-only assertions (error-message formatting, pure existence checks) untouched. tests/lib/output.test.mjs and tests/lib/diff-engine.test.mjs and tests/scanners/fix-engine.test.mjs unchanged (synthetic test inputs, not scanner output). Test count unchanged: 689/689 pass. IDs harvested via deterministic runtime dump per fixture (resetCounter + scan).	2026-05-01 17:22:55 +02:00
Kjell Tore Guttormsen	1a45caf18b	feat(humanizer): translation module with category, action, relevance Wave 1 / Step 3 of v5.1.0 plain-language UX humanizer. scanners/lib/humanizer.mjs exports three pure functions: - humanizeFinding(f) -> new finding object with translated title/description/recommendation + three new fields (userImpactCategory, userActionLanguage, relevanceContext). - humanizeFindings(findings) -> mapped array. - humanizeEnvelope(env) -> walks env.scanners[].findings. Plus computeRelevanceContext(filePath) as a named export for unit testing. Field semantics: - userImpactCategory: from scanner prefix per research/02 line 124 (Configuration mistake / Conflict / Wasted tokens / Dead config / Missed opportunity / Other). - userActionLanguage: from severity per research/02 line 134 (Fix this now / Fix soon / Fix when convenient / Optional cleanup / FYI). - relevanceContext: deterministic file-path heuristic — looks for /tests/fixtures/ or /test/fixtures/ substring (test-fixture-no-impact), .local. basename (affects-this-machine-only), defaults to affects-everyone. No subprocess, no network. Lookup order per scanner: static[title] -> patterns regex match -> _default -> fall through to original strings (when scanner prefix absent). Original id, scanner, severity, file, line, evidence, category, autoFixable, and optional details are preserved exactly. Pure — verified by deepEqual of input before/after. Test (32 cases): purity, field preservation across all paths, known/unknown scanner handling, all 5 severities, all 6 categories, relevance heuristic for 4 path types, envelope walking, ANSI-free guarantee. All pass. Regression: 689/689 tests (657 + 32 new = 54 new across Wave 1). Project: .claude/projects/2026-05-01-config-audit-ux-redesign/	2026-05-01 17:03:49 +02:00
Kjell Tore Guttormsen	02ee2a8b83	feat(humanizer): translation table for 12 scanners + plugin-health Wave 1 / Step 2 of v5.1.0 plain-language UX humanizer. scanners/lib/humanizer-data.mjs exports TRANSLATIONS keyed by scanner prefix (CML, SET, HKV, RUL, MCP, IMP, CNF, GAP, TOK, CPS, DIS, COL, PLH). Each scanner has: - static: exact-title -> {title, description, recommendation} - patterns: array of {regex, translation} for template-literal titles - _default: graceful fallback for unknown findings Architectural change vs. plan: keys translations by exact scanner title (not finding ID). Reason: finding IDs are sequence-based (global counter in lib/output.mjs:34), not stable per finding-type — two runs can produce different IDs for the same logical issue. Title strings ARE stable (defined as string literals or template patterns in the scanner source). Translations follow research/03 SR-1..SR-17: - active voice, second person, present tense - sentences <= 25 words - tier1 absolute prohibitions and tier3 domain jargon are kept out of prose - tier1/tier3 terms are permitted inside `backtick spans` (code references like filenames and field names) — established technical-doc convention Test (12 cases): all 13 scanners covered; every static and pattern entry has the 3 required fields; tier1 and tier3 forbidden-word checks pass (with backtick-span exclusion); reference-stable imports. All pass. Regression: 657/657 tests (645 + 12 new). Project: .claude/projects/2026-05-01-config-audit-ux-redesign/	2026-05-01 17:00:59 +02:00
Kjell Tore Guttormsen	8c07fe3493	feat(humanizer): forbidden-words data file (tier1/2/3) Wave 1 / Step 1 of v5.1.0 plain-language UX humanizer. tests/lint-forbidden-words.json defines the SC-3 forbidden-words vocabulary used by the lint runner (Wave 4 / Step 8) and the humanizer-data translation guard (Wave 1 / Step 2). - Tier 1: 19 absolute prohibitions (failure if matched in default output) — sourced from Microsoft Writing Style Guide, Federal Plain Language, GOV.UK, Google Developer Style, Apple HIG. - Tier 2: 24 strong-avoidance terms (warning if matched) — same sources plus Mailchimp. - Tier 3: 12 domain-specific jargon terms (failure if matched in default output, allowed in --raw and --json paths) — sourced from research/03 jargon table. Counts diverge from plan.md (18/21/11) — JSON tracks the brief's verbatim lists at research/03 lines 200-202 plus tier3 hook entry from the brief's table. Plan revision noted in audit-doc. Test: 10 cases verifying parse, count, schema completeness, spot checks per tier, no cross-tier duplicates. All pass. Regression: 645/645 tests (635 + 10 new). Project: .claude/projects/2026-05-01-config-audit-ux-redesign/	2026-05-01 16:53:37 +02:00
Kjell Tore Guttormsen	2397ffb5e4	chore(humanizer): pre-flight snapshots + test audit for v5.1.0 Wave 0 / Step 0 of the v5.1.0 plain-language UX humanizer plan. Captures v5.0.0 baseline output for all 8 CLIs at tests/snapshots/v5.0.0/ — these snapshots are immutable references for SC-6 (--json byte-equal) and SC-7 (--raw byte-equal) tests in later waves. - 5 CLIs captured via --output-file: scan-orchestrator, posture, token-hotspots-cli, manifest, whats-active - 3 CLIs captured via stdout redirect (no --output-file support): drift-cli (after baseline seed), fix-cli, plugin-health-scanner - Posture stderr scorecard captured separately for SC-7 stderr-mode comparison docs/v5.1.0-test-audit.md classifies all 42 .title references in 7 known test files: 34 will break under humanization (literal string equality / substring), 8 are safe (test fixtures or error formatting). This document is the change list for Step 4. Project: .claude/projects/2026-05-01-config-audit-ux-redesign/	2026-05-01 16:47:13 +02:00
Kjell Tore Guttormsen	395a9bd947	docs(config-audit): v5 implementation log — Session 5 release result v5.0.0 SHIPPED 2026-05-01. Tag config-audit/v5.0.0 pushed to Forgejo. SC-6b release-gate PASS at -0.85% delta (CLAUDE.md actual 589 vs estimated 594, well within ±5% gate). Per-step: - Step 28: README/CLAUDE.md straggler-sweep + self-audit counter alignment - Step 29: version bump 4.0.0 → 5.0.0 + consolidated CHANGELOG - Step 30: full audit + live SC-6b gate + tag (incl. one in-step bug fix for hotspot.path exposure, required to make calibration measurable) 635 tests still green throughout. No blockers carried forward. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 09:48:40 +02:00
Kjell Tore Guttormsen	6cfca82885	fix(config-audit): expose hotspot.path for --accurate-tokens calibration + SC-6b PASS The v5.0.0-rc.1 N5 implementation looked up hotspot.path in calibrateAgainstApi() but token-hotspots.mjs only emitted hotspot.source — calibration silently produced 0 actual_tokens because every iteration hit the `if (!hotspot?.path) continue` guard. Fix: file-backed hotspots now expose `path: h.absPath` in the JSON output. MCP-server hotspots intentionally leave path unset — their tokens are runtime tool-schema (formula-based: 500 + toolCount × 200), not file content readable by count_tokens. SC-6b release-gate verified against tests/fixtures/marketplace-large: - Actual (count_tokens, claude-opus-4-7): 589 tokens for CLAUDE.md - Estimated (4-bytes/token byte heuristic): 594 tokens - Delta: -5 tokens / -0.85% — well within ±5% gate. PASS. CHANGELOG: documented the fix + SC-6b result inline under [5.0.0]. All 635 tests still green. No estimateTokens tuning required for v5.0.0. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 09:45:56 +02:00
Kjell Tore Guttormsen	dcf8087972	chore(config-audit): bump version to 5.0.0 - .claude-plugin/plugin.json: 4.0.0 → 5.0.0 - README.md: version badge bump + v5.0.0 row in Version History - CHANGELOG.md: consolidated `## [5.0.0]` entry covering alpha.1, alpha.2, beta.1, rc.1 — Summary, Added, Changed, Removed, Breaking changes, Migration notes, Tests, Notes (incl. SC-6b deferred-to-implementation-log) - root README.md: Config-Audit row v4.0.0 → v5.0.0; counts updated (8→12 scanners, 17→18 commands, 543→635 tests, 4→6 patterns, +manifest command, +--accurate-tokens, +CPS/DIS/COL coverage) No code changes in this commit. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 09:39:08 +02:00
Kjell Tore Guttormsen	5bf500e1a8	docs(config-audit): straggler sweep for v5.0.0 — sync all badge counts Reconcile README/CLAUDE.md/commands/agents to filesystem truth ahead of v5.0.0 release. Self-audit --check-readme now passes (counts: scanners 12, commands 18, tests 635, knowledge 8, agents 6, hooks 4). Self-audit (scanners/self-audit.mjs): - Exclude plugin-health-scanner.mjs from countScannerShape (it is a "standalone" scanner per README/CLAUDE.md taxonomy; orchestrated scanners stay at 12) - countTestCases: spawn `node --test` and parse the `tests N` line so the badge reflects test cases (635), not test files (36). countTestFiles kept as fallback when subprocess fails. README.md: - Badges: scanners 9→12, commands 17→18, tests 543→635 - Body counts updated: 8 quality scanners → 12 deterministic scanners; 8 quality areas → 10 (incl. Plugin Hygiene from N6); 9 Node.js scanners → 12 - Scanner table extended with CPS / DIS / COL rows; TOK row reflects the v5 Pattern E/F/N1 expansion (sonnet-era removed) - CLI table adds manifest, whats-active, --accurate-tokens, --with-telemetry-recipe - Knowledge table adds opus-4.7-patterns.md and cache-telemetry-recipe.md - Scanner Lib table notes WEIGHTS export, severity-weighted scoring, tokenizer-api - Action Engines table adds manifest.mjs, whats-active.mjs, token-hotspots-cli.mjs - Test count text 486→635, file count 27→36 (12 lib + 23 scanner + 1 hook) - Tokens command: 4-pattern phrasing → 6 patterns + --accurate-tokens - Adds /config-audit manifest and /config-audit whats-active to command tables CLAUDE.md: - Posture row: 8 → 10 quality areas - Tokens row: 4 patterns (incl. sonnet-era) → 6 patterns + --accurate-tokens - Adds /config-audit manifest entry - Scanner table: TOK description rewritten; CPS, DIS, COL rows added - Scanner Lib table: tokenizer-api.mjs added; v5 annotations on severity, output, scoring, active-config-reader - Action Engines table: manifest.mjs added; token-hotspots-cli.mjs flags expanded - Knowledge table: cache-telemetry-recipe.md added; configuration-best-practices notes Opus-4.7 cache-stability rewrite - Finding ID examples extended with CA-TOK-005, CA-CPS-001, CA-DIS-001, CA-COL-001 - Test count text 543→635, file count 31→36 commands/help.md: tokens/manifest added to Core commands/posture.md: 8 → 10 quality areas commands/config-audit.md: argument-hint adds tokens/manifest; router adds tokens and manifest; "Running 8 configuration scanners" → 12 agents/feature-gap-agent.md: 8 → 10 quality areas No production code paths changed beyond self-audit's badge-counting heuristic. All 635 tests still green. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 09:34:43 +02:00
Kjell Tore Guttormsen	17af3d55f6	docs(config-audit): v5 implementation log — Session 4 rc.1 result	2026-05-01 09:19:04 +02:00
Kjell Tore Guttormsen	1ce26fea41	docs(config-audit): CHANGELOG 5.0.0-rc.1 entry	2026-05-01 09:15:52 +02:00
Kjell Tore Guttormsen	b7414303de	feat(config-audit): --accurate-tokens API calibration (v5 N5) [skip-docs]	2026-05-01 09:15:02 +02:00
Kjell Tore Guttormsen	df6e012903	docs(config-audit): cache-telemetry recipe + --with-telemetry-recipe flag (v5 M7)	2026-05-01 09:12:17 +02:00
Kjell Tore Guttormsen	e1e23edbcd	docs(config-audit): knowledge rensing — Opus 4.7 cache-stability guidance (v5 M8)	2026-05-01 09:10:32 +02:00
Kjell Tore Guttormsen	346b4c4fb7	docs(config-audit): v5 implementation log — Session 3 beta.1 result Session 3 (beta.1) shipped 7 steps in one session: N1 (CA-TOK-005), N2 (manifest CLI), N3 (CPS), N4 (DIS), N6 (COL) + namespace research spike + CHANGELOG entry. 586 → 625 tests, all green. Per-step result table + notable observations and deviations recorded. No blockers carried into Session 4. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 07:50:22 +02:00
Kjell Tore Guttormsen	5a1e7cb510	docs(config-audit): CHANGELOG 5.0.0-beta.1 + N1 breaking note beta.1 wrap entry covering N1-N4 + N6 (Steps 18-22b). Includes explicit Known breaking changes section on CA-TOK-* glob suppression matching CA-TOK-005, and notes plugin-vs-built-in collision is deferred to v5.0.1. Tests: 586 → 625 (+39). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 07:47:12 +02:00
Kjell Tore Guttormsen	cd25c1e934	feat(config-audit): cross-plugin collision scanner COL (v5 N6) [skip-docs] New COL scanner detects skill-name collisions across plugins and between user-level skills (~/.claude/skills/) and plugin-bundled skills. Skill identity is the directory basename — matches how enumerateSkills resolves names. Detection rules (per docs/v5-namespace-research.md, confidence: medium): - Plugin-vs-plugin same skill name → severity low (CA-COL-001) - User-vs-plugin same skill name → severity medium (CA-COL-001) - Plugin-vs-built-in collisions: out of scope for v5.0.0 (insufficient verification — recorded for v5.0.1 follow-up). Findings carry details.namespaces array with {source, name, path} for every conflicting source — supports per-collision reporting downstream. output.mjs: finding() helper now passes through optional `details` field (scanner-specific structured payload). scoring.mjs: COL → "Plugin Hygiene" (new area, 10 total). Posture test updated from 9 → 10 area scores. .gitignore: docs/v5-namespace-research.md is local-only (Step 22a research output, gitignored per plan). Fixture collision-plugins/fake-home/ has user skill `review` colliding with plugin-a + plugin-b's `review` (medium severity), plus plugin-c's unique `summarize` (no collision). [skip-docs] reason: v5 plan fences off README/CLAUDE.md badge updates to Session 5; Forgejo pre-commit-docs-gate hook requires this tag. Tests: 617 → 625 (+8). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 07:46:15 +02:00
Kjell Tore Guttormsen	cc349d6fe1	feat(config-audit): disabled-in-schema scanner DIS (v5 N4) [skip-docs] New DIS scanner detects tools that appear in BOTH permissions.deny and permissions.allow within the same settings.json file. The deny list wins, so allow entries are dead config but still load on every turn and confuse intent. Tool identity = bare name (everything before "("). `Bash(npm:*)` and `Bash` are treated as the same tool, so a deny on `Bash` flags any `Bash(...)` allow entry. Severity: low. Wired into scan-orchestrator + scoring (area: Settings). Fixture denied-tools-in-schema has Bash in both arrays; healthy-project serves as the negative case. [skip-docs] reason: v5 plan fences off README/CLAUDE.md badge updates to Session 5; Forgejo pre-commit-docs-gate hook requires this tag. Tests: 611 → 617 (+6). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 07:39:58 +02:00
Kjell Tore Guttormsen	65087e624f	feat(config-audit): cache-prefix stability scanner CPS (v5 N3) [skip-docs] New CPS scanner walks CLAUDE.md cascade and flags volatile content between lines 31 and 150 — the cache-prefix window beyond TOK Pattern A's top-30 territory. Volatile content anywhere in the cached prefix forces a fresh cache write from that line down on every turn. Volatile-pattern set extends TOK Pattern A with: - shell-exec lines (! prefix) — common in CLAUDE.md to inject git/date - ${VAR} substitutions — vary per-shell, defeat cache reuse Severity: medium per finding. Skips lines 1-30 to avoid duplicating Pattern A's range; CPS' value is in the 31-150 zone. Wired into scan-orchestrator + scoring SCANNER_AREA_MAP. CPS shares the "Token Efficiency" area with TOK; scoreByArea now deduplicates by area name and combines counts across scanners contributing to the same area, so the 9-area scorecard contract holds. Fixtures volatile-mid-section/{volatile-line-60, volatile-line-200} verify both positive (line 60) and out-of-window (line 200) cases. [skip-docs] reason: v5 plan fences off README/CLAUDE.md badge updates to Session 5; Forgejo pre-commit-docs-gate hook requires this tag. Tests: 604 → 611 (+7). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 07:37:54 +02:00
Kjell Tore Guttormsen	0420b8cc4a	feat(config-audit): /config-audit manifest command (v5 N2) [skip-docs] New scanners/manifest.mjs CLI + commands/manifest.md slash command. Reads activeConfig and produces a flat, ranked list of every token source (CLAUDE.md cascade entries, plugins, skills, MCP servers, hooks) sorted DESC by estimated_tokens. CLAUDE.md per-file tokens are derived by distributing claudeMd.estimatedTokens across the cascade proportional to bytes. Tests cover both real-config (plugin root) and fixture (rich-repo with patched HOME containing 2 plugins + 3 skills + .mcp.json) paths, plus error handling (nonexistent path → exit 3, --output-file). Builds on readActiveConfig from M1 (v5 alpha.2). [skip-docs] reason: v5 plan fences off README/CLAUDE.md badge updates to Session 5; Forgejo pre-commit-docs-gate hook requires this tag on feat commits without doc changes. Tests: 593 → 604 (+11). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 07:32:54 +02:00
Kjell Tore Guttormsen	b2407a09b3	feat(config-audit): CA-TOK-005 MCP tool-schema budget (v5 N1) [skip-docs] Adds detectMcpToolBudget detection block in TOK scanner. Tiered severity per project-local .mcp.json server based on toolCount: - < 20: no finding - 20-49: low - 50-99: medium - 100+: high - null (manifest unparseable): low + "tool count unknown" message Scoped to source==='.mcp.json' to keep findings actionable for the audited path; plugin/user-level MCP servers are surfaced by the manifest scanner (Step 19 / N2). 5 fixtures (mcp-budget/{14,25,60,120,unknown}-tools) use inline `tools` arrays in .mcp.json — no node_modules needed for these tests. Tests assert title+severity (not exact ID) since TOK IDs are sequential per scan, not semantic per pattern. [skip-docs] reason: v5 plan fences off README/CLAUDE.md badge updates to Session 5; Forgejo pre-commit-docs-gate hook requires this tag on feat commits without doc changes. Tests: 586 → 593 (+7). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 07:29:57 +02:00
Kjell Tore Guttormsen	dd0d4bf738	docs(config-audit): v5 implementation log — Session 2 alpha.2 result	2026-05-01 07:14:18 +02:00
Kjell Tore Guttormsen	55cedbea2c	docs(config-audit): CHANGELOG 5.0.0-alpha.2 entry	2026-05-01 07:10:52 +02:00
Kjell Tore Guttormsen	3c79f95e9a	feat(config-audit): self-audit --check-readme flag (v5 F6) [skip-docs] Filesystem counts are the source of truth; README badges parsed via line-anchored substring (badge/<kind>-<N>-...). Emits readmeCheck object with counts/badges/mismatches. CLI: node scanners/self-audit.mjs --check-readme [--json] API: runSelfAudit({ checkReadme: true }) → result.readmeCheck Helper: checkReadmeBadges(pluginDir) for per-fixture testing New fixture: readme-desynced/ (commands/foo + bar, README claims 1). Note: alpha phase does NOT require result.readmeCheck.passed === true. Self-test of real plugin currently fails (scanners 10 vs 9, tests 31 vs 543); will be reconciled in Session 5 Step 28 (README sync). 582 → 586 tests, all green.	2026-05-01 07:09:26 +02:00
Kjell Tore Guttormsen	910567d661	feat(config-audit): HKV flags verbose hook output (v5 M5) [skip-docs] Static heuristic — counts console.log / process.stdout.write lines per referenced hook script. > 50 → low CA-HKV-NNN finding. New fixtures: - hooks-verbose/ (61 verbose lines → triggers) - hooks-quiet/ (5 lines → no finding) 580 → 582 tests, all green.	2026-05-01 07:05:45 +02:00
Kjell Tore Guttormsen	7181862644	chore(config-audit): allow fake node_modules in tests/fixtures (v5 M1) [skip-docs] The mcp-tool-heavy fixture relies on node_modules/mcp-heavy/package.json being committed so the v5 M1 tool-count detection test runs deterministically. Add an unignore rule for tests/fixtures/**/node_modules/.	2026-05-01 07:02:54 +02:00
Kjell Tore Guttormsen	1422daf895	feat(config-audit): MCP tool-count detection with manifest fallback (v5 M1) [skip-docs] readActiveMcpServers now resolves tool count via: 1. In-config tools array 2. Cached tools/list at \$HOME/.claude/config-audit/mcp-cache/<name>.json 3. node_modules/<pkg>/package.json (resolved from npx <pkg>) 4. Fallback: { toolCount: null, toolCountUnknown: true } estimateTokens uses detected toolCount (heavy server > light server). New fixture: mcp-tool-heavy/ with mocked node_modules/mcp-heavy/package.json (20 tools). 576 → 580 tests, all green.	2026-05-01 07:02:08 +02:00
Kjell Tore Guttormsen	9a44df22ac	feat(config-audit): TOK flags skill description > 500 chars (v5 M2) [skip-docs] - New Pattern F in TOK: low-severity finding when SKILL.md description > 500 chars - Scoped to discovery.files (project-local) — activeConfig.skills walk would pull in user/plugin skills out of project scope - New fixtures: skill-bloated (594-char desc) + skill-tight (46-char baseline) 574 → 576 tests, all green.	2026-05-01 06:58:42 +02:00
Kjell Tore Guttormsen	25ca6139b4	feat(config-audit): TOK flags CLAUDE.md cascade > 10k tokens (v5 M4) [skip-docs] - New Pattern E in TOK: emits medium finding when activeConfig.claudeMd.estimatedTokens > 10_000 - Uses cascade tokens, file count, and calibration note as evidence - New fixtures: large-cascade (37k bytes / 14475 cascade tokens) + small-cascade (5k baseline) 572 → 574 tests, all green.	2026-05-01 06:53:12 +02:00
Kjell Tore Guttormsen	9330124f5c	feat(config-audit): flag additionalDirectories > 2 (v5 M6) [skip-docs] - Add 'additionalDirectories' to KNOWN_KEYS - Emit low severity finding when length > 2 - New fixtures: additional-dirs-many (3 entries) + additional-dirs-ok (2) 569 → 572 tests, all green.	2026-05-01 06:50:24 +02:00
Kjell Tore Guttormsen	58d6b5b9ea	feat(config-audit): recalibrate TOK severities for tokens/turn (v5 F7) [skip-docs] - Pattern A (cache-breaking volatile top): medium → high - Pattern B (redundant permissions): low → medium - Pattern C (deep @import chain): medium → low - Add calibration_note evidence on every TOK finding - Table-driven severity tests (identify by title, IDs are sequential) 563 → 569 tests, all green. Doc sweep deferred to Session 5 (Step 28).	2026-05-01 06:47:32 +02:00
Kjell Tore Guttormsen	3aba15c566	docs(config-audit): v5 implementation log — Session 1 alpha.1 result Per-step result table for Steps 1-9 + 8b with commit SHAs and notable deviations (Step 6 baseline switch to sonnet-era, Step 8 surprise on sonnet-era discovery scope, PathGuard hook false positive on test fixtures). 543 → 563 tests, all green, no blockers carried forward.	2026-05-01 06:37:08 +02:00
Kjell Tore Guttormsen	919bd213f8	docs(config-audit): CHANGELOG 5.0.0-alpha.1 entry Summarizes F1-F5 scope: TOK ↔ readActiveConfig integration, 'mcp' kind in estimateTokens (15 → ≥500), severity-weighted scoreByArea, dead-code removal in TOK hotspots, Pattern D / CA-TOK-004 removal. Includes migration notes for downstream consumers (CA-TOK-* globs still suppress 001-003; scoringVersion field added for v4→v5 detection).	2026-05-01 06:34:06 +02:00
Kjell Tore Guttormsen	08a9ead51a	docs(config-audit): remove CA-TOK-004 references after F5 (v5) knowledge/opus-4.7-patterns.md: - Pattern 4 row removed from the catalogue table - "Pattern 4 (sonnet-era)" detection note removed - Threshold-calibration note no longer mentions pattern 4 - Added a short pointer explaining the v5 F5 removal commands/tokens.md: - "CA-TOK-001..004" → "CA-TOK-001..003" in two places	2026-05-01 06:33:01 +02:00
Kjell Tore Guttormsen	2810ee6f62	feat(config-audit): remove TOK Pattern D detectSonnetEra (v5 F5) Pattern D was the v4 sonnet-era signature: 'config is structurally clean but uses no Opus-4.7-specific features'. Two problems: - It triggered on any minimal config that happened to lack skills/MCP - The advice was generic and not actionable The hotspots ranking and per-pattern findings (A/B/C) cover the same ground with concrete, file-anchored signal. Dropping the noise. BREAKING (intentional): scanners no longer emit the sonnet-era info finding. Suppression entries and downstream tooling that reference the v4 finding ID should be updated. Doc sweep follows in Step 8b. Tests: sonnet-era fixture now asserts zero findings.	2026-05-01 06:31:43 +02:00
Kjell Tore Guttormsen	0d8a9af3d6	fix(config-audit): remove TOK dead take + hotspot padding (v5 F4) The buildHotspots padding loop and unused 'take' variable were dead code from the v3 hotspots-min contract. Replaced with a clean ranked.slice(0, HOTSPOTS_MAX). Tiny fixtures may now return fewer than 3 hotspots, which is the honest answer; the contract now only asserts <= 10. Tests: +2 cases — every hotspot.source is unique (no padding); length never exceeds HOTSPOTS_MAX.	2026-05-01 06:29:33 +02:00
Kjell Tore Guttormsen	34669d596c	feat(config-audit): TOK consumes readActiveConfig (v5 F1) Removes the v4 'void readActiveConfig' placeholder and wires the active-config snapshot into the TOK scanner. Per-turn behavior changes: - Each enabled MCP server becomes its own hotspot entry (richer than the parent .mcp.json file alone) - total_estimated_tokens now includes MCP server cost - result.activeConfig exposes a small summary (claudeMdEstimatedTokens, mcpServerCount, pluginCount, skillCount) Failures of readActiveConfig are non-fatal — the scanner falls back to the discovery-only path used in v4. Tests: +3 cases on the new tok-active-config fixture (.mcp.json with 2 servers, CLAUDE.md, plugin skeleton).	2026-05-01 06:27:34 +02:00
Kjell Tore Guttormsen	ce7c42f517	fix(config-audit): MCP token callers use 'mcp' kind (v5 F2) Two MCP enumeration paths in readActiveMcpServers now pass kind='mcp' to estimateTokens with optional toolCount derived from def.tools array (populated when callers cache MCP discovery — Step 14 wires that up). Hook callers keep kind='item' (no schema overhead). Visible effect: every active MCP server jumps from estimatedTokens=15 to >= 500 (or higher when toolCount is known). The whats-active output and TOK hotspots now reflect actual MCP cost. Tests: assert mcpServers[].estimatedTokens >= 500 in fixture.	2026-05-01 06:22:54 +02:00
Kjell Tore Guttormsen	48d560a209	feat(config-audit): add 'mcp' kind to estimateTokens (v5 F2) Differentiate MCP servers from generic 'item' (flat 15) — they actually cost 500+ tokens per turn for protocol metadata and tool schemas. estimateTokens(bytes, 'mcp', {toolCount}) returns max of: - 500 token floor (base overhead) - ceil(bytes / 3.5) (json-rate when bytes known) - 500 + toolCount * 200 (when tool count is detected; Step 14 wires this) Caller-side migration in next commit (Step 5). Tests: +4 cases for mcp kind.	2026-05-01 06:21:30 +02:00
Kjell Tore Guttormsen	a65c7f4080	feat(config-audit): severity-weighted scoreByArea (v5 F3) Replace count-based pass-rate with severity-weighted penalty: - penalty = sum(count[s] * WEIGHTS[s]) - maxBudget = max(10, findingCount * 4) - passRate = max(0, 100 - penalty / maxBudget * 100) A few lows no longer crater an area's grade; a single high or critical consumes a large fraction of budget. Mirrors the operator intuition that severity, not count, is the signal. BREAKING (intentional): scoring semantics differ from v4 for non-clean configs. Add scoringVersion: 'v5' to the returned struct so consumers can detect the version. baseline-all-a remains all-A (no critical/high on that fixture). Tests: +6 cases for severity weighting; existing "many findings" test updated to use highs (where v5 still drops the grade as expected).	2026-05-01 06:20:08 +02:00
Kjell Tore Guttormsen	e5efc2ff64	feat(config-audit): export WEIGHTS from severity.mjs (v5 F3 prep) Promote WEIGHTS const to named export with Object.freeze for downstream use in scoring.mjs (severity-weighted scoreByArea, F3). Tests: +2 cases asserting WEIGHTS shape.	2026-05-01 06:16:28 +02:00

1 2

75 commits