ktg-plugin-marketplace

Author	SHA1	Message	Date
Kjell Tore Guttormsen	b7d64a6d2b	docs(llm-security): tre doc-nivåer oppdatert for v7.6.1 CLAUDE.md OBLIGATORISK-regel: enhver feature-endring som pusher til Forgejo MÅ oppdatere alle tre doc-nivåer i SAMME commit eller umiddelbart etter. v7.6.1-fix-commit (`f9b555a`) bumpet kun versjons-badgen — denne oppfølgings-commit-en lukker doc-gapet. - plugins/llm-security/README.md: ny [7.6.1] history-tabell-rad - plugins/llm-security/CLAUDE.md: header bumpet v7.6.0 → v7.6.1 + ny v7.6.1-blurb (alle 6 fix-detaljer) - README.md (rot): llm-security versjons-rad bumpet v7.6.0 → v7.6.1 + v7.6.1 history-bullet over v7.6.0-bullet Ingen kodeendringer. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-06 14:44:55 +02:00
Kjell Tore Guttormsen	f006143fb8	feat(llm-security): playground v7.6.0 — Tier 3 referanse-case komplett Komplett integrasjon av playground-design-system Tier 3-komponenter i playground-en. Playground er nå referanse-case for hva DS-en kan levere når alle komponenter brukes som tilsiktet. Levert over 5 sesjoner med atomic commits per sesjon. Endringer i v7.6.0 (fase 1-7): - Fjernet ~30 duplikat-CSS-deklarasjoner (DS vinner cascade) - Page-shell harmonisert (page__header-klynge på alle 4 overflater) - Scope-identitet via badge--scope-security - verdict-pill-lg erstatter custom verdict-pill - Onboarding wizard via Tier 3 form-progress + fp-step - Tier 3 spesialkomponenter integrert: - tfa-flow + tfa-leg + tfa-arrow (toxic-flow-rapport) - mat-ladder + mat-step (posture-modenhet) - suppressed-group (narrative-audit) - codepoint-reveal + cp-tag/cp-zw/cp-bidi (UNI-funn) - top-risks + top-risk[data-severity] (rangert funn-listing) - recommendation-card[data-severity] (clean/harden/audit/posture/ pre-deploy/plugin-audit advisory) - risk-meter (band-visualisering 0-100 på 5 archetypes) - card--severity-{level} (findings-cards modifier) 5 nye DS-helpers + mapSeverityToCardLevel + parseNarrativeAudit. renderRecommendationsList utvidet med severity-param. renderHarden-rewrite fra diff-row-struktur til recommendation-card med action-mapping. Ingen scanner/hook-atferd berørt. Kun visuelt og strukturelt. A11Y-rapport oppdatert (WCAG 2.1 AA bekreftet, severity-soft fargepar verifisert, semantiske elementer erstatter generic div). Versjon bumpet v7.5.0 → v7.6.0: - plugins/llm-security/.claude-plugin/plugin.json - plugins/llm-security/package.json - plugins/llm-security/README.md (badge + Playground-seksjon + history) - plugins/llm-security/CLAUDE.md (header + ny v7.6.0-blurb) - plugins/llm-security/CHANGELOG.md ([7.6.0] entry) - README.md (rot — llm-security-rad + history-bullet) - plugins/llm-security/playground/llm-security-playground.html (footer) Filendring playground.html totalt over 5 sesjoner: 10209 → 10677 linjer (+468 netto). Per-sesjons-commits: `9ef0c48` (Sesjon 1, fase 1-2), `2481133` (Sesjon 2, fase 3-4), `fbda041` (Sesjon 3, fase 5a-d), `e9e5cee` (Sesjon 4, fase 5e-h). Verifisering bekreftet: - 18/18 renderere passerer regresjons-smoke-test mot dft-komplett-demo - Grep-criteria oppfylt: top-risks 5, recommendation-card 32, risk-meter 7 (5 archetypes), card--severity- 4, verdict-pill-lg 20, fp-step 12, badge--scope-security 5, tfa-flow 3, mat-ladder 2, suppressed-group 8, codepoint-reveal 12 - Window-globaler intakt, JS parse OK, demo-state JSON parse OK Kjent begrensning: parsed.findings er tom for deep-scan/audit demo- fixturer (parser-begrensning, defensiv design — dokumentert i CHANGELOG + A11Y-rapport, sporet for v7.6.x patch). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-06 14:12:59 +02:00
Kjell Tore Guttormsen	ce3891bdd0	feat(llm-security): playground Fase 3 — v7.5.0 med 18 parsere/renderere Single-file SPA playground har nå parser + renderer for alle 18 produces_report=true-kommandoer (Fase 2: 10 høy-prio + Fase 3: 8 gjenstående: mcp-inspect, supply-check, pre-deploy, diff, watch, registry, clean, threat-model). 18 markdown test-fixtures fungerer som kontrakt-anker for parser-utvikling. Komplett demo-prosjekt `dft-komplett-demo` har alle 18 rapporter ferdig parsed inline — klikk-gjennom uten "parser ikke implementert"- paneler. 2 nye archetypes i KEY_STATS_CONFIG: kanban-buckets (clean) og matrix-risk (threat-model). Bug-fix: normalizeVerdictText sjekker nå GO-WITH-CONDITIONS / CONDITIONAL / BETINGET FØR plain GO så betinget verdict (pre-deploy med åpne vilkår) ikke kollapser til ALLOW. Eksponert 11 window-globaler for testing/automasjon (__store, __navigate, __loadDemoState, __PARSERS, __RENDERERS, __CATALOG, __inferVerdict, __inferKeyStats, __renderPageShell, __handlePasteImport, __scheduleRender). 12 Playwright-genererte screenshots i playground/screenshots/v7.5.0/. A11Y-rapport (WCAG 2.1 AA): 0 blokkerende, 3 mindre forbedringer flagget for v7.5.x patch (skip-link, heading-hierarki på project, aria-live toast). Versjonsbump 7.4.0 -> 7.5.0 i 10 filer (package.json, plugin.json, CLAUDE.md header, README badge, CHANGELOG-entry, 3 scanner VERSION- konstanter, ROADMAP, marketplace-rot README). Ingen scanner- eller hook-behavior-changes — purely additive surface. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-05 22:15:47 +02:00
Kjell Tore Guttormsen	8f1bf9b7b4	chore(llm-security): v7.4.0 — examples + e2e suite minor Bumps from v7.3.1 to v7.4.0. Purely additive surface — no scanner or hook behavior changes, no breaking changes. Headline content (already merged on main since v7.3.1): - examples/ utvidelse — seven runnable demonstration walkthroughs shipped over three sessions (sesjon 1 pre-existing prompt-injection-showcase + lethal-trifecta-walkthrough, mcp-rug-pull, supply-chain-attack, poisoned-claude-md, bash-evasion-gallery, toxic-agent-demo, pre-compact-poisoning). Each is self-contained: README + fixture + run-script + expected-findings testable contract. State-isolation pattern (PID-suffixed JSONL or env-overrides like LLM_SECURITY_MCP_CACHE_FILE) keeps the user's real cache and /tmp state untouched. - tests/e2e/ — three new suites totalling 45 tests: attack-chain.test.mjs (17), multi-session.test.mjs (9), scan-pipeline.test.mjs (19). Test count 1777 to 1822. These exercise the framework as a coordinated system rather than as isolated unit-tests. Version sync (8 files): - package.json - .claude-plugin/plugin.json - CLAUDE.md (header) - README.md (badge + Recent versions tabellen new row) - CHANGELOG.md (Unreleased to [7.4.0] - 2026-05-05 with summary) - scanners/dashboard-aggregator.mjs VERSION constant - scanners/ide-extension-scanner.mjs VERSION constant - scanners/posture-scanner.mjs VERSION constant Stabilization-stance unchanged. v8.0.0 remains the planned deprecation-cleanup release. v7.x continues as the stable line. Tests: 1822/1822 grønne lokalt etter bump. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-05 15:34:02 +02:00
Kjell Tore Guttormsen	b6d912200e	feat(llm-security): add pre-compact-poisoning example for PreCompact hook [skip-docs] Runnable demonstration of hooks/scripts/pre-compact-scan.mjs (the only PreCompact hook in the plugin) detecting both a CRITICAL injection pattern and an AWS-shaped credential inside a synthetic JSONL transcript, exercised across all three values of LLM_SECURITY_PRECOMPACT_MODE plus a benign-transcript control case in block mode that proves the gate is not a brick wall. The transcript is generated at runtime in a per-invocation tempdir under os.tmpdir() and the directory is removed in a finally block, so the user's real ~/.claude/projects/.../transcripts/ are never touched. The AWS-shaped key uses the same 'AK' + 'IA' + ... fragmentation idiom as tests/e2e/attack-chain.test.mjs so this source contains no literal credentials and pre-edit-secrets does not block writes during development. Nine independent assertions (9/9 must pass): - block mode + poisoned: exit 2, decision=block JSON, reason text covers both injection and AWS labels (3 assertions) - warn mode + poisoned: exit 0, systemMessage JSON, no decision field (2 assertions) - off mode + poisoned: exit 0, no JSON on stdout (2 assertions) - block mode + benign: exit 0, no decision=block JSON (2 assertions) OWASP / framework mapping: LLM01, LLM02, ASI01, AT-1, AT-3. Docs updated: plugin README "Other runnable examples", plugin CLAUDE.md "Examples" tabellen, CHANGELOG [Unreleased] Added. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-05 15:23:10 +02:00
Kjell Tore Guttormsen	92fb0087fa	feat(llm-security): add toxic-agent-demo example for TFA scanner [skip-docs] Single-component lethal-trifecta walkthrough that drives scanners/toxic-flow-analyzer.mjs against a deliberately misconfigured fixture plugin. The fixture agent declares tools: [Bash, Read, WebFetch], which alone covers all three trifecta legs (input surface + data access + exfil sink). No hooks/hooks.json is shipped, so TFA's mitigation logic finds no active guards and emits a CRITICAL "Lethal trifecta:" finding without downgrade. Plugin marker is plugin.fixture.json (recognised by isPlugin()) rather than .claude-plugin/plugin.json — the latter is blocked by the plugin's own pre-write-pathguard hook, and plugin.fixture.json exists in isPlugin() specifically so example fixtures can self-mark without touching guarded paths. Three independent assertions (3/3 must pass): direct trifecta present and CRITICAL; finding mentions the exfil-helper component; description confirms "no hook guards detected" (proves the mitigation path stayed inactive). expected-findings.md documents the contract. OWASP / framework mapping: ASI01, ASI02, ASI05, LLM01, LLM02, LLM06. Docs updated: plugin README "Other runnable examples", plugin CLAUDE.md "Examples" tabellen, CHANGELOG [Unreleased] Added. [skip-docs] is appropriate because examples don't change what the plugin "synes å dekke utad" — marketplace root README is unaffected. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-05 15:15:04 +02:00
Kjell Tore Guttormsen	ca5a8cec67	feat(llm-security): add 3 more runnable threat examples [skip-docs] Three new self-contained, runnable threat demonstrations under examples/, continuing the batch started in `583a78c`. Each example has README.md + run-*.mjs + expected-findings.md and uses state-isolation discipline so the user's real cache/state files are never polluted. - examples/supply-chain-attack/ — two-layer demonstration: pre-install-supply-chain (PreToolUse) blocks compromised event-stream version 3.3.6 and emits a scope-hop advisory for the @evilcorp scope; dep-auditor (DEP scanner, offline) flags 5 typosquat dependencies plus a curl-piped install-script vector in the fixture package.json. Maps to LLM03/LLM05/ASI04. - examples/poisoned-claude-md/ — all 6 memory-poisoning detectors fire on a deliberately poisoned CLAUDE.md plus a fixture agent file under .claude/agents (E15/v7.2.0 surface): detectInjection, detectShellCommands, detectSuspiciousUrls, detectCredentialPaths, detectPermissionExpansion, detectEncodedPayloads. No agent runtime needed — scanner imported directly. Maps to LLM01/LLM06/ASI04. - examples/bash-evasion-gallery/ — one disguised variant per T1 through T9 evasion technique fed through pre-bash-destructive, verified BLOCK after bash-normalize strips the evasion. T8 base64-pipe-shell uses its own BLOCK_RULE. The canonical destructive form uses a path token rather than the bare slash (regex word-boundary requires it). Source-string fragmentation pattern reused from the e2e attack-chain test. Maps to LLM06/ASI01/LLM01. Plugin README "Other runnable examples" section + plugin CLAUDE.md "Examples" table + CHANGELOG Unreleased/Added all updated. Marketplace root README unchanged ([skip-docs] for marketplace-level gate — plugin's outward coverage is unchanged, only demonstrations were added).	2026-05-05 15:01:20 +02:00
Kjell Tore Guttormsen	583a78c6cc	feat(llm-security): add lethal-trifecta + mcp-rug-pull example contents [skip-docs] Companion to `8df5d5c` (which only carried the doc updates — the example directories themselves were left out of staging by mistake). This commit adds the actual example mappes: - examples/lethal-trifecta-walkthrough/{README.md, run-trifecta.mjs, expected-findings.md} - examples/mcp-rug-pull/{README.md, run-rug-pull.mjs, expected-findings.md} Plus plugin CLAUDE.md "Examples (runnable demonstrations)" section with a 4-row table covering malicious-skill-demo, prompt-injection- showcase, lethal-trifecta-walkthrough, and mcp-rug-pull plus the state-isolation discipline notes. Marketplace root README unchanged since plugin's outward coverage is unchanged ([skip-docs] covers the marketplace-level gate).	2026-05-05 14:45:39 +02:00
Kjell Tore Guttormsen	f835777c1e	test(llm-security): add e2e suite proving framework works as coordinated system Three new files in tests/e2e/ (45 tests, 1777 -> 1822): - attack-chain.test.mjs (17): full hook stack against attack payloads in sequence -- prompt injection at the gate; T1/T5/T8 bash evasions; pathguard on .env / .ssh; secrets hook on AWS-shaped keys and PEM headers; markdown link-title and HTML-comment poisoning in tool output; trifecta accumulation over a single session with dedup on the next benign call. - multi-session.test.mjs (9): state persistence across simulated session boundaries. Uses the fact that a hook child's process.ppid equals the test runner's process.pid, so writing the session state file directly simulates "previous session" history. Covers slow-burn trifecta (legs spread >50 calls), MCP cumulative description drift via LLM_SECURITY_MCP_CACHE_FILE override, and pre-compact transcript poisoning in warn / block / clean / missing-file modes. - scan-pipeline.test.mjs (19): scan-orchestrator + all 10 scanners + toxic-flow correlator against poisoned-project (BLOCK / 95 / Extreme) and grade-a-project (WARNING / 48 / High). Asserts envelope shape, verdict, risk_score, severity counts, OWASP coverage, scanner enumeration, and a narrative-coherence cross-check that the BLOCK scan strictly outranks the WARNING scan along every axis. Test files build credential-shaped payloads at runtime via concatenation so they contain no literal matches for the pre-edit-secrets regexes (memory rule feedback_secrets_hook_test_fixtures.md). Doc updates in same commit per marketplace policy: - CLAUDE.md header: 1777+ -> 1822+ tests, mentions tests/e2e/ - README.md badge tests-1777 -> tests-1822, body text updated - CHANGELOG.md: new [Unreleased] Added section describing scope No version bump. No behavior changes outside tests/.	2026-05-05 12:06:57 +02:00
Kjell Tore Guttormsen	8ca391fdb2	fix(llm-security): correct distribution URLs to marketplace path The plugin lives in ktg-plugin-marketplace and is distributed via the Claude Code marketplace mechanism. There is no standalone open/claude-code-llm-security repo; references to it were aspirational and never realized. - package.json: homepage now deep-links to plugins/llm-security/ in the marketplace; repository.url uses the marketplace repo with directory field (npm convention for monorepo plugins); bugs.url routes to marketplace issue tracker. - CLAUDE.md: "Public Repository" section replaced with "Distribution" section documenting the marketplace install path. - CONTRIBUTING.md: issue tracker URL points at marketplace issues with [llm-security] prefix convention. - CHANGELOG.md: v7.3.1 entry rewritten to reflect actual change (URLs corrected to marketplace, not "fixed from one wrong URL to another wrong URL"). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 06:20:54 +02:00
Kjell Tore Guttormsen	62a9335772	chore(llm-security): v7.3.1 — stabilization patch for forkers and downstream users No behavior changes. Sets the public stance, tightens documentation, and removes coherence drift so anyone forking or downloading the plugin gets a consistent starting point. Added: - CONTRIBUTING.md — public fork-and-own guide. Why PRs are not accepted, how to fork well, what is welcome via issues. - README "Project scope" section — out-of-scope table naming what is fork-and-own territory (web dashboard, fleet policy, runtime firewall, IDE LSP, compliance pack, ticketing, multi-tenancy, ML detectors, marketplace UI, SSO/SCIM/RBAC) with commercial alternatives. - package.json: bugs.url, CONTRIBUTING/SECURITY/CHANGELOG in files whitelist for npm publishing. Changed: - SECURITY.md rewritten. Supported-versions table from stale 5.1.x to current reality (7.3.x active, 7.0-7.2 best-effort, <7.0 EOL). Best-effort solo response timeline. Scope expanded to bin/. - Scanner VERSION constants synced to plugin version. Was 6.0.0 in dashboard-aggregator and posture-scanner. - package.json repository.url corrected from fromaitochitta/ to open/. - README "Feedback & contributing" links to CONTRIBUTING.md. Fixed: - pre-compact-scan size-cap timing test ceiling raised 500ms -> 1000ms. Was a flake on Intel Mac and CI under load. Design target unchanged (<500ms, documented in CLAUDE.md). Notes: - First patch on the stabilization line (post-2026-05-01). - Wave E attack-simulator scenarios deferred indefinitely; coverage remains at 72. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 06:14:03 +02:00
Kjell Tore Guttormsen	c4183b8b4d	chore(release): bump to v7.3.0 Batch C release. Closes 12 implementation tasks (E3, E8-E14, 8.4, 8.6, 8.7, 8.10) across four execution waves: A (bash + decoder), B (supply chain + workflow scanner), C (MCP cumulative drift), D (code quality). Wave E (9 new attack-simulator scenarios for the new defenses) deferred to v7.3.1 — defenses are unit-tested per wave; the deferred work adds attack-simulator regression coverage on top, not the primary safety net. Tests: 1665+ → 1777 (Wave A-D cumulative, +112). Version sync targets touched: - package.json - .claude-plugin/plugin.json - CLAUDE.md (header) - README.md (badge + new release-history row) - scanners/ide-extension-scanner.mjs (VERSION constant) - ../../README.md (marketplace root plugin entry) - CHANGELOG.md (new [7.3.0] section per Keep a Changelog, all 12 task IDs covered individually under Added/Changed/Documentation/Tests/Notes) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 05:28:45 +02:00
Kjell Tore Guttormsen	97c5c9d934	docs(claude-md): 8.10 — fix hooks count + add doc-consistency test for hook-table sync	2026-04-30 17:12:49 +02:00
Kjell Tore Guttormsen	ba5f2b64ad	feat(policy-loader): 8.7 — env-var deprecation warnings (v8.0.0 removal)	2026-04-30 17:11:07 +02:00
Kjell Tore Guttormsen	001df2ebe8	feat(commands): E14 part 3 — /security mcp-baseline-reset slash command Wave C step C3: closes E14 with the user-facing reset command. After a legitimate MCP server upgrade the sticky baseline (added in C1) becomes a stale "what the tool used to say" anchor and every subsequent post-mcp-verify advisory will re-flag the change. /security mcp-baseline-reset lets the user acknowledge the upgrade so the next call seeds a fresh baseline. New files: - scanners/mcp-baseline-reset.mjs — small CLI wrapper around clearBaseline / listBaselines. Modes: --list (read-only), --target <name>, no-args (all). Outputs JSON summary on stdout. Exit 0 always (idempotent). - commands/mcp-baseline-reset.md — dispatcher following mcp-inspect.md shape. Frontmatter: name=security:mcp-baseline-reset, sonnet model, Read/Bash/AskUserQuestion tools. 4-step body (list -> confirm scope -> execute -> confirm result). - tests/scanners/mcp-baseline-reset.test.mjs — 10 CLI tests across --list, --target, clear-all, idempotency, history preservation, and bare-positional sugar. Updated: - commands/security.md — new row in commands table after mcp-inspect. - CLAUDE.md — new commands-table row + new v7.3.0 narrative section describing the baseline schema, cumulative-drift detection, reset semantics, and the LLM_SECURITY_MCP_CACHE_FILE override. - Plugin README.md — new MCP-baseline-reset row in commands table, scanner count 12 standalone -> 13 standalone, new "MCP Description Drift (E14, v7.3.0)" subsection explaining the sticky baseline, cumulative threshold, reset semantics, and env-var override. - Root marketplace README.md — scanner count 22 -> 23 (10 orchestrated + 13 standalone), command count 19 -> 20, test count 1511 -> 1768. Wave C complete: 1738 -> 1768 tests (+30 across C1/C2/C3). Per plan, Wave C does NOT bump the plugin version — that lands at the wave-bundle release. The advisory text in post-mcp-verify already references the new command path so the user has a ready remediation step. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-30 16:49:01 +02:00
Kjell Tore Guttormsen	427b68eca9	feat(post-mcp-verify): E14 part 2 — cumulative-drift MEDIUM advisory [skip-docs] Wave C step C2: surface the cumulative-drift signal from checkDescriptionDrift() (added in C1) as a separate MEDIUM advisory with finding category mcp-cumulative-drift. Independent of the existing per-update drift advisory — a slow-burn rug-pull that keeps each update below the 10% per-update threshold but cumulatively drifts >=25% from the sticky baseline now triggers the new advisory without ever crossing the per-update bar. The advisory references /security mcp-baseline-reset (added in C3) so the user knows how to acknowledge a legitimate MCP server upgrade. CLAUDE.md updates: - post-mcp-verify hooks-table row mentions per-update + cumulative drift - mcp-description-cache lib bullet documents baseline schema, history, cumulative threshold policy key, and LLM_SECURITY_MCP_CACHE_FILE override. Tests: 2 new hook tests using LLM_SECURITY_MCP_CACHE_FILE for cache isolation. Existing 68 still pass; total 70. Plugin README and root marketplace README updates land in C3 alongside the new /security mcp-baseline-reset slash command (combined Wave-C doc update per plan §"Wave C — Touch" list). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-30 16:40:52 +02:00
Kjell Tore Guttormsen	3b57dfbf6d	chore(release): bump to v7.2.0 Batch B release — closes critical-review B-tier scanner defects (B3, B5, B6, B7) and the v7.2.0 evasion-arsenal hardening patches (E1, E4, E5, E7, E15, E16, E17, E18). Tests 1522 → 1665+, attack simulator 64 → 72 (100 % pass). Version updates across the 6 sync targets: - package.json - .claude-plugin/plugin.json - CLAUDE.md (header + test count: 1511 → 1665+) - README.md (badge + Version History row) - scanners/ide-extension-scanner.mjs (VERSION constant) - ../../README.md (marketplace root) CHANGELOG [7.2.0] entry per Keep a Changelog with full Added / Changed / Documentation / Tests / Notes breakdown. Refs: Batch B Wave 6 / Step 15	2026-04-29 15:40:15 +02:00
Kjell Tore Guttormsen	f0a1d4024a	feat(post-session-guard): E17 — configurable escalation window + 20-call MEDIUM advisory Critical-review §4 E17 finding: pre-v7.2.0 the delegation-after-input advisory fired only within a 5-call window. Attackers who deliberately waited 6+ calls before delegating bypassed detection. Window was also hardcoded — operators couldn't tune it for their environment. Two coordinated changes: 1. LLM_SECURITY_ESCALATION_WINDOW env var (primary window override) - parseInt(env) \|\| getPolicyValue('trifecta', 'escalation_window', 5) - Mirrors the established pattern from LLM_SECURITY_TRIFECTA_MODE et al. - Setting env=3 narrows; env=8 expands. 2. Secondary 20-call MEDIUM advisory (slow-burn variant) - DELEGATION_ESCALATION_WINDOW_MEDIUM = 20 (hardcoded — same value for all operators; tunable in a future patch if needed) - checkEscalationAfterInput now returns `tier: 'primary'\|'secondary'\|null` - formatEscalationWarning emits a different message for secondary — mentions "slow-burn", references env-var, distinct from the primary "DeepMind Category 4" framing Hook reads max(WINDOW_SIZE, secondary+5) entries to cover the wider window. Existing duplicate-suppression (`escalation_warning` state entry) covers both tiers. Audit-trail event captures `tier` field. Tests: +5 cases in tests/hooks/post-session-guard.test.mjs: - secondary window catches 9-call distance (slow-burn) - secondary boundary at exactly 20 calls - primary regression guard (1-call distance) - env=3 narrows primary (4-call distance becomes secondary) - env=8 expands primary (7-call distance stays primary) Updated existing test "does NOT trigger when input_source is >5 calls ago" — now requires >20 calls (secondary window catches 6-20). Suite: 1644 → 1672 (+28 from new tests + extended scope). All green. CLAUDE.md hooks table updated to document both windows and the env var.	2026-04-29 14:26:18 +02:00
Kjell Tore Guttormsen	3cd68dc9fb	docs(severity): B3 — document info as scoring-inert (v7.2.0 prep) Critical-review §2 B3 finding: `riskScore({info: N}) = 0` silently masks info-volume findings. The behavior was correct (info is scoring-inert by design) but undocumented. Operators reading a report with N info findings had no way to know they contribute zero to verdict/band. Three coordinated edits: - scanners/lib/severity.mjs JSDoc — explicit "Info severity" subsection spelling out: scoring-inert, surfaced in owaspCategorize aggregates, treat as observability telemetry not verdict input. @param updated to mark info as accepted but ignored. - CLAUDE.md v7.0.0 risk-score-v2 line — one-sentence anchor pointing to severity.mjs JSDoc. - tests/lib/severity.test.mjs — anchor test alongside the existing 4-critical=93 anchor: asserts riskScore({info: 50}) === 0, riskScore({info: 1000}) === 0, verdict({info: 100}) === 'ALLOW', riskBand(riskScore({info: 500})) === 'Low'. Decision: skip the optional `infoScore()` helper from the brief. No current consumer would use it; doc-only fix keeps API surface minimal. Revisit if a consumer emerges. Tests: 1522 → 1523 (+1 anchor block, 4 assertions). All green.	2026-04-29 13:56:11 +02:00
Kjell Tore Guttormsen	b18cb329ef	docs(llm-security): v7.1.1 — narrative coherence patch Documents the v7.1.1 narrative-coherence patch in CLAUDE.md (mini-block appended after the v7.0.0 paragraph) and CHANGELOG.md (new [7.1.1] section per Keep a Changelog convention, placed above [7.1.0]). Plan: .claude/plans/ultraplan-2026-04-29-report-coherence.md Brief: .claude/ultraplan-spec-2026-04-29-report-coherence.md Verification gates passed: - npm test: 1522/1522 (was 1511; +11 from new narrative test) - node --test tests/lib/severity.test.mjs: 86/86 (co-monotonicity sweep at lines 252-303 unchanged and green) - node --test tests/scanners/skill-scanner-narrative.test.mjs: 11/11 - Orchestrator against fixture: WARNING / 48 / 1 HIGH (HITL trap caught correctly, no whiplash) - SARIF inline check via toSARIF import: sarif-version 2.1.0, runs: 1 - Zero remaining v1 cutoffs in agent + template Out of scope but flagged for Batch B (deferred to v7.2.0): - commands/scan.md:113-114 retains v1 risk formula Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 12:57:54 +02:00
Kjell Tore Guttormsen	621db144bd	chore(release): bump llm-security to v7.1.0 Closes A4 of v7.1.0 critical-review patch — release artefacts. - Version bump 7.0.0 → 7.1.0 across active version sources: * package.json * .claude-plugin/plugin.json * CLAUDE.md header * README.md badge * scanners/ide-extension-scanner.mjs (VERSION constant) * marketplace root README plugin entry - Marketplace root README test count: 1487 → 1511. - CHANGELOG.md: new [7.1.0] - 2026-04-29 section above [7.0.0], documenting B1, B2, B4, B8, honesty-sweep (7 phrases), and test-count delta (+24 → 1511 total). - docs/security-hardening-guide.md: §6 last-updated bump + new v7.1.0 calibration note on hook-level fixes (pathguard regex hole, distributed-trifecta block-mode bypass). Historical references to "7.0.0" intentionally preserved in: - CHANGELOG [7.0.0] entries (history) - README.md version-history table v5.0.0/v7.0.0 rows (history) - CLAUDE.md §"v7.0.0 — Severity-dominated risk scoring" (describes what changed at v7.0.0 release) - scanners/ JSDoc comments noting "v7.0.0+" formula provenance - agents/ + tests/ + knowledge/ provenance comments Pre-existing untracked/modified tracker noise (.gitignore, marketplace.json, config-audit/docs, ultraplan-local/docs) is not part of this commit per the v7.1.0 NEXT-SESSION-PROMPT handoff. Tests: 1511/1511 green.	2026-04-29 11:57:16 +02:00
Kjell Tore Guttormsen	a46308b1e9	docs(llm-security): A3 honesty-sweep — 7 sitater nedtonet (critical-review §9) Closes A3 of v7.1.0 critical-review patch. Each rewrite preserves the underlying claim where it is accurate but removes hype/overreach language. Historical CHANGELOG/README version-table rows are intentionally left as-is (they document what was claimed at the time of release, not what is true today). Changes (CLAUDE.md, commands/ide-scan.md, knowledge/mitigation-matrix.md, docs/security-hardening-guide.md): - "Trustworthy scoring (BREAKING)" → "Severity-dominated risk scoring (v2 model, BREAKING)". Removes hype framing; describes the actual mechanism. - "Context-aware entropy scanner" → "Rule-based entropy scanner with file-extension skip, 8 line-level suppression rules, and configurable policy". No ML/context inference; just rules. - "1487 tests" → "1511 unit and integration tests; mutation-testing coverage not published". Updated count after A1+A2 (+24) and added qualifier. - "Fully Schrems II compatible" → "Schrems II compatible in default offline mode. Optional OSV.dev enrichment (`supply-chain-recheck --online`) transmits package identifiers to a Google-operated API and is a separate compliance consideration." Acknowledges the OSV.dev opt-in caveat. - "Rule of Two enforcement" → "Rule of Two detection (configurable; default warn; blocks on high-confidence trifectas in opt-in `block` mode; distributed trifectas detected but not blocked by default)". "Enforcement" implied block; default is warn. - "Hardened ZIP extractor" → suffix " — no fuzz-testing results published to date". Caps and class-of-attacks rejected are accurate; absence of formal fuzz coverage now stated. - "defense-in-depth" — preserved as framing, but quantified in security-hardening-guide §4: "three independent detection layers with documented bypass classes". Each layer named, each layer's known bypasses pointed to (critical-review §4 evasion arsenal). Tests: 1511/1511 green (no behavioural change).	2026-04-29 11:52:55 +02:00
Kjell Tore Guttormsen	4aa5318bcb	fix(llm-security): A2 batch — JSDoc arithmetic + co-monotonicity test + CaMeL nedton Closes A2 of v7.1.0 critical-review patch (docs/critical-review-2026-04-20.md): - B4 (severity JSDoc): 4 critical = 93, not 90. Fixed in scanners/lib/severity.mjs:23 and CHANGELOG.md v7.0.0 tier description. The actual computation has always been 93 (70 + log2(5)*10 = 93.22 → round); only the docs were wrong. - §5.4 co-monotonicity: new sweep test in tests/lib/severity.test.mjs over 15 representative count vectors. Asserts that (verdict, riskBand) agree under the v7.0.0 contract for every case — catches future drift between riskScore tiers, verdict cutoffs, and riskBand cutoffs. Includes a B4 anchor test (riskScore {critical: 4} === 93) so doc/code drift fails loudly. - B8 (CaMeL claims toned down): post-session-guard.mjs:646 comment block and CLAUDE.md:184 Defense Philosophy bullet now describe the implementation honestly — opportunistic byte-matching of truncated output fingerprints (first 200 bytes, SHA-256/16-hex), not semantic data-flow tracking. Trivially bypassed by mutation, summarisation, or re-encoding. Inspired by CaMeL (DeepMind 2025), but not a CaMeL capability-tracking implementation. Tests: 1495 → 1511 (+16: 15 sweep cases + 1 B4 anchor). All green.	2026-04-29 11:49:08 +02:00
Kjell Tore Guttormsen	765bc74f52	feat(llm-security): v7.0.0 commit 7 — rule 18 (markdown image URL suppression) E2E verification against content-heavy repo (`content-claude-code`) revealed 413 entropy findings (8 HIGH / 405 MEDIUM) from markdown image CDN URLs in JSON content indexes — e.g., `![Image 1: Title](https://www-cdn.anthropic.com/images/.../cf1dd2167fcf12f5882333ddc58a5bc1f0026952.svg)`. These are legitimate content-repo artifacts, not credentials. The 40-char hash segment in the CDN URL trips Shannon entropy (H=5.29 over 300 chars), and rule 13 (inline <svg>) doesn't match since there's no literal `<svg>` tag — the `.svg` is just a URL path suffix. Added rule 18 `MARKDOWN_IMAGE = /!\[[^\]]\]\(\shttps?:\/\//` — matches `![alt](http…)` / `![alt](https…)`. Line-level (not string-level) so URL is not over-specific. E2E impact on `content-claude-code`: - Before: BLOCK / 65 / 8H 437M 0L - After: WARNING / 56 / 3H 427M 0L Hyperframes unchanged: BLOCK / 80 / 1C 4H 92M — real CRITICAL SQL-injection and HIGH findings still detected. Tests: 2 new (positive + negative fixture) bringing entropy-context to 26, total suite 1485 → 1487. Docs updated to "rules 11-18" and "8 new line-suppression rules". Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-19 22:37:39 +02:00
Kjell Tore Guttormsen	6f86de937a	feat(llm-security)!: v7.0.0 commit 6 — tests, docs, version bump Final commit in the trustworthy-scoring series. Bundles verdict cutoff alignment, the last suite of tests, and all documentation touch-points that quote version numbers or describe v7.0.0 behaviour. Verdict/band co-monotonicity - `scanners/lib/severity.mjs` — verdict cutoffs moved from 61/21 to 65/15 so `BLOCK >= 65`, `WARNING >= 15` locks onto the v2 riskBand() boundaries. Prevents "BLOCK / Medium band" contradictions under the v2 formula. Scanner hardening (bug fixes from v7.0.0 testing) - `scanners/entropy-scanner.mjs` — `policy_source` now uses `existsSync('.llm-security/policy.json')` instead of value-based check. Old heuristic always reported 'policy.json' because DEFAULT_POLICY now carries an `entropy.thresholds` section. - `scanners/lib/file-discovery.mjs` — `.sass` and GPU shader extensions (`.glsl, .frag, .vert, .shader, .wgsl`) added to TEXT_EXTENSIONS. Without this, shader files were invisible to file-discovery, so they were never counted as skipped by the entropy-scanner extension filter. Tests - `tests/scanners/entropy-context.test.mjs` (new, 24 tests) — A. File-ext skip (4), B. Line-level rules 11-17 (8), C. Policy overrides (3). Fixtures generate 80-char base64 payloads at runtime via `crypto.randomBytes` to dodge the plugin's own pre-edit credential hook on the test source. - `tests/lib/severity.test.mjs` — rewritten with v2 scoring table (70 tests total, was 52). - `tests/lib/output.test.mjs:243` — "1 critical = score 80" under v2 (was 25 under v1). - Full suite: 1485/1485 green (was 1461). Docs - `CHANGELOG.md` — v7.0.0 entry with BREAKING CHANGES section. - `README.md` (plugin + marketplace root) — version badge, history table, plugin-card version string, test count. - `CLAUDE.md` — header version, "v7.0.0 — Trustworthy scoring" summary paragraph at the top. - `docs/security-hardening-guide.md` — new section 6 "Calibration & false positives" documenting v2 formula, context-aware entropy scanner, typosquat allowlist, and §6.4 tuning workflow. Existing "Recommended baseline" section renumbered to §7. Version bump - `6.6.0 -> 7.0.0` across package.json, .claude-plugin/plugin.json, scanners/ide-extension-scanner.mjs VERSION const, README badge, CLAUDE.md header, marketplace root README card. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-19 22:26:35 +02:00
Kjell Tore Guttormsen	80c3e2d39a	chore(release): bump llm-security to v6.6.0	2026-04-18 11:04:42 +02:00
Kjell Tore Guttormsen	9f893c3858	feat(llm-security): OS sandbox for /security ide-scan <url> (v6.5.0) VSIX fetch + extract for URL targets now runs in a sub-process wrapped by sandbox-exec (macOS) or bwrap (Linux), reusing the same primitives proven by the v5.1 git-clone sandbox. Defense-in-depth — even if our own zip-extract.mjs ever has a bypass, the kernel refuses any write outside the per-scan temp directory. New files: - scanners/lib/vsix-fetch-worker.mjs — sub-process worker. Argv: --url --tmpdir; emits one JSON line on stdout (ok/sha256/size/source/extRoot or ok:false/error/code). Silent on stderr. Exit 0/1. - scanners/lib/vsix-sandbox.mjs — wrapper. Exports buildSandboxProfile, buildBwrapArgs, buildSandboxedWorker, runVsixWorker. 35s timeout, 1 MB stdout cap. Changes: - scanners/ide-extension-scanner.mjs: fetchAndExtractVsixUrl is now sandbox-aware (useSandbox option, default true). In-process logic preserved as fallback. New meta.source.sandbox field: 'sandbox-exec' \| 'bwrap' \| 'none' \| 'in-process'. - scan(target, { useSandbox }) defaults to true; tests pass false because globalThis.fetch mocks do not cross process boundaries. - Windows fallback: in-process with meta.warnings advisory. Tests: - 8 new tests in tests/scanners/vsix-sandbox.test.mjs (per-platform profile generation, worker arg construction, live worker exit behavior on invalid URLs — no network). - Existing URL tests updated to opt out of sandbox (useSandbox: false). - 1344 → 1352 tests, all green. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-17 17:28:57 +02:00
Kjell Tore Guttormsen	fe0193956d	feat(llm-security): /security ide-scan <url> — Marketplace/OpenVSX/direct VSIX (v6.4.0) Pre-installation verification of VS Code extensions via URL — fetch a remote VSIX, extract it in a hardened sandbox, and run the existing IDE scanner pipeline against it. No npm dependencies. Sources: - VS Code Marketplace (publisher.gallery.vsassets.io direct download) - OpenVSX (open-vsx.org official API) - Direct .vsix HTTPS URLs Defenses: - HTTPS-only, TLS verified, manual redirect with per-source host whitelist - 30s total timeout via AbortController - 50MB compressed cap, 500MB uncompressed, 100x expansion ratio - Zero-dep ZIP extractor: zip-slip, absolute paths, drive letters, NUL bytes, symlinks (Unix mode 0xA000), depth limits, ZIP64 rejected, encrypted rejected - SHA-256 streamed during fetch, surfaced in meta.source - Temp dir cleanup in all paths (try/finally) Files: - scanners/lib/vsix-fetch.mjs (HTTPS fetcher, host whitelist, streaming SHA-256) - scanners/lib/zip-extract.mjs (zero-dep parser with hardening caps) - knowledge/marketplace-api-notes.md (endpoint reference) - 3 test files (48 tests added: vsix-fetch, zip-extract, ide-extension-url) Tests: 1296 → 1344 (all green). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-17 17:16:26 +02:00
Kjell Tore Guttormsen	6252e55700	feat(llm-security): add /security ide-scan — VS Code / JetBrains extension prescan (v6.3.0) New standalone scanner (prefix IDE) discovers installed VS Code extensions across forks (Cursor, Windsurf, VSCodium, code-server, Insiders, Remote-SSH) and runs 7 IDE-specific threat checks: blocklist match (CRITICAL), theme-with-code, sideload (unsigned .vsix), dangerous uninstall hook (HIGH), wildcard activation, extension-pack expansion, typosquat (MEDIUM). Per-extension reuse of UNI/ENT/NET/TNT/MEM/SCR scanners with bounded concurrency. Offline-first; --online opt-in. JetBrains discovery stubbed for v1.1. 22 new tests (1296 total, was 1274). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-17 16:23:35 +02:00
Kjell Tore Guttormsen	80b4952f2c	chore(release): v6.2.0 — bash-normalize T5/T6, PreCompact hook, hardening guide	2026-04-17 14:55:26 +02:00
Kjell Tore Guttormsen	3bcd0d4bc4	docs(claude-md): link Defense Philosophy to Opus 4.7 system card	2026-04-17 14:50:07 +02:00
Kjell Tore Guttormsen	98ba545759	docs(claude-md): reframe bash-normalize as defense-in-depth layer	2026-04-17 14:37:03 +02:00
Kjell Tore Guttormsen	2c33e9cc64	feat(ci): add CI/CD integration — --fail-on, --compact, pipeline templates Add threshold-based exit codes (--fail-on <severity>) and compact output mode (--compact) to scan-orchestrator and CLI. Pipeline templates for GitHub Actions, Azure DevOps, GitLab CI with SARIF upload. CI/CD guide with Schrems II/NSM compliance documentation. npm publish preparation (files whitelist, .npmignore). Policy ci section for distributable CI defaults. Version 6.1.0. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-10 14:59:05 +02:00
Kjell Tore Guttormsen	dea17a1c11	chore(release): bump to v6.0.0 — CAISS-readiness release with compliance, governance, CLI Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-10 14:03:10 +02:00
Kjell Tore Guttormsen	708c898754	feat(llm-security): sandboxed remote cloning v5.1.0 Harden git clone attack surface for remote scans with defense-in-depth: Layer 1 (all platforms): 8 git config flags disable hooks, symlinks, filter/smudge drivers, fsmonitor, local file protocol. 4 env vars isolate from system/user git config and block interactive prompts. Layer 2 (OS sandbox): macOS sandbox-exec and Linux bubblewrap (bwrap) restrict file writes to only the specific temp directory. bwrap probe-tests availability before use. Graceful fallback on Windows and Ubuntu 24.04+ (git config hardening only). Additional: post-clone 100MB size check, UUID-unique evidence filenames, evidence file cleanup, cleanup guarantee in scan/plugin-audit commands. 32 new tests (1147 total). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-07 17:08:32 +02:00
Kjell Tore Guttormsen	f93d6abdae	feat: initial open marketplace with llm-security, config-audit, ultraplan-local	2026-04-06 18:47:49 +02:00

36 commits