ktg-plugin-marketplace

Author	SHA1	Message	Date
Kjell Tore Guttormsen	ce3891bdd0	feat(llm-security): playground Fase 3 — v7.5.0 med 18 parsere/renderere Single-file SPA playground har nå parser + renderer for alle 18 produces_report=true-kommandoer (Fase 2: 10 høy-prio + Fase 3: 8 gjenstående: mcp-inspect, supply-check, pre-deploy, diff, watch, registry, clean, threat-model). 18 markdown test-fixtures fungerer som kontrakt-anker for parser-utvikling. Komplett demo-prosjekt `dft-komplett-demo` har alle 18 rapporter ferdig parsed inline — klikk-gjennom uten "parser ikke implementert"- paneler. 2 nye archetypes i KEY_STATS_CONFIG: kanban-buckets (clean) og matrix-risk (threat-model). Bug-fix: normalizeVerdictText sjekker nå GO-WITH-CONDITIONS / CONDITIONAL / BETINGET FØR plain GO så betinget verdict (pre-deploy med åpne vilkår) ikke kollapser til ALLOW. Eksponert 11 window-globaler for testing/automasjon (__store, __navigate, __loadDemoState, __PARSERS, __RENDERERS, __CATALOG, __inferVerdict, __inferKeyStats, __renderPageShell, __handlePasteImport, __scheduleRender). 12 Playwright-genererte screenshots i playground/screenshots/v7.5.0/. A11Y-rapport (WCAG 2.1 AA): 0 blokkerende, 3 mindre forbedringer flagget for v7.5.x patch (skip-link, heading-hierarki på project, aria-live toast). Versjonsbump 7.4.0 -> 7.5.0 i 10 filer (package.json, plugin.json, CLAUDE.md header, README badge, CHANGELOG-entry, 3 scanner VERSION- konstanter, ROADMAP, marketplace-rot README). Ingen scanner- eller hook-behavior-changes — purely additive surface. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-05 22:15:47 +02:00
Kjell Tore Guttormsen	8f1bf9b7b4	chore(llm-security): v7.4.0 — examples + e2e suite minor Bumps from v7.3.1 to v7.4.0. Purely additive surface — no scanner or hook behavior changes, no breaking changes. Headline content (already merged on main since v7.3.1): - examples/ utvidelse — seven runnable demonstration walkthroughs shipped over three sessions (sesjon 1 pre-existing prompt-injection-showcase + lethal-trifecta-walkthrough, mcp-rug-pull, supply-chain-attack, poisoned-claude-md, bash-evasion-gallery, toxic-agent-demo, pre-compact-poisoning). Each is self-contained: README + fixture + run-script + expected-findings testable contract. State-isolation pattern (PID-suffixed JSONL or env-overrides like LLM_SECURITY_MCP_CACHE_FILE) keeps the user's real cache and /tmp state untouched. - tests/e2e/ — three new suites totalling 45 tests: attack-chain.test.mjs (17), multi-session.test.mjs (9), scan-pipeline.test.mjs (19). Test count 1777 to 1822. These exercise the framework as a coordinated system rather than as isolated unit-tests. Version sync (8 files): - package.json - .claude-plugin/plugin.json - CLAUDE.md (header) - README.md (badge + Recent versions tabellen new row) - CHANGELOG.md (Unreleased to [7.4.0] - 2026-05-05 with summary) - scanners/dashboard-aggregator.mjs VERSION constant - scanners/ide-extension-scanner.mjs VERSION constant - scanners/posture-scanner.mjs VERSION constant Stabilization-stance unchanged. v8.0.0 remains the planned deprecation-cleanup release. v7.x continues as the stable line. Tests: 1822/1822 grønne lokalt etter bump. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-05 15:34:02 +02:00
Kjell Tore Guttormsen	b6d912200e	feat(llm-security): add pre-compact-poisoning example for PreCompact hook [skip-docs] Runnable demonstration of hooks/scripts/pre-compact-scan.mjs (the only PreCompact hook in the plugin) detecting both a CRITICAL injection pattern and an AWS-shaped credential inside a synthetic JSONL transcript, exercised across all three values of LLM_SECURITY_PRECOMPACT_MODE plus a benign-transcript control case in block mode that proves the gate is not a brick wall. The transcript is generated at runtime in a per-invocation tempdir under os.tmpdir() and the directory is removed in a finally block, so the user's real ~/.claude/projects/.../transcripts/ are never touched. The AWS-shaped key uses the same 'AK' + 'IA' + ... fragmentation idiom as tests/e2e/attack-chain.test.mjs so this source contains no literal credentials and pre-edit-secrets does not block writes during development. Nine independent assertions (9/9 must pass): - block mode + poisoned: exit 2, decision=block JSON, reason text covers both injection and AWS labels (3 assertions) - warn mode + poisoned: exit 0, systemMessage JSON, no decision field (2 assertions) - off mode + poisoned: exit 0, no JSON on stdout (2 assertions) - block mode + benign: exit 0, no decision=block JSON (2 assertions) OWASP / framework mapping: LLM01, LLM02, ASI01, AT-1, AT-3. Docs updated: plugin README "Other runnable examples", plugin CLAUDE.md "Examples" tabellen, CHANGELOG [Unreleased] Added. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-05 15:23:10 +02:00
Kjell Tore Guttormsen	92fb0087fa	feat(llm-security): add toxic-agent-demo example for TFA scanner [skip-docs] Single-component lethal-trifecta walkthrough that drives scanners/toxic-flow-analyzer.mjs against a deliberately misconfigured fixture plugin. The fixture agent declares tools: [Bash, Read, WebFetch], which alone covers all three trifecta legs (input surface + data access + exfil sink). No hooks/hooks.json is shipped, so TFA's mitigation logic finds no active guards and emits a CRITICAL "Lethal trifecta:" finding without downgrade. Plugin marker is plugin.fixture.json (recognised by isPlugin()) rather than .claude-plugin/plugin.json — the latter is blocked by the plugin's own pre-write-pathguard hook, and plugin.fixture.json exists in isPlugin() specifically so example fixtures can self-mark without touching guarded paths. Three independent assertions (3/3 must pass): direct trifecta present and CRITICAL; finding mentions the exfil-helper component; description confirms "no hook guards detected" (proves the mitigation path stayed inactive). expected-findings.md documents the contract. OWASP / framework mapping: ASI01, ASI02, ASI05, LLM01, LLM02, LLM06. Docs updated: plugin README "Other runnable examples", plugin CLAUDE.md "Examples" tabellen, CHANGELOG [Unreleased] Added. [skip-docs] is appropriate because examples don't change what the plugin "synes å dekke utad" — marketplace root README is unaffected. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-05 15:15:04 +02:00
Kjell Tore Guttormsen	ca5a8cec67	feat(llm-security): add 3 more runnable threat examples [skip-docs] Three new self-contained, runnable threat demonstrations under examples/, continuing the batch started in `583a78c`. Each example has README.md + run-*.mjs + expected-findings.md and uses state-isolation discipline so the user's real cache/state files are never polluted. - examples/supply-chain-attack/ — two-layer demonstration: pre-install-supply-chain (PreToolUse) blocks compromised event-stream version 3.3.6 and emits a scope-hop advisory for the @evilcorp scope; dep-auditor (DEP scanner, offline) flags 5 typosquat dependencies plus a curl-piped install-script vector in the fixture package.json. Maps to LLM03/LLM05/ASI04. - examples/poisoned-claude-md/ — all 6 memory-poisoning detectors fire on a deliberately poisoned CLAUDE.md plus a fixture agent file under .claude/agents (E15/v7.2.0 surface): detectInjection, detectShellCommands, detectSuspiciousUrls, detectCredentialPaths, detectPermissionExpansion, detectEncodedPayloads. No agent runtime needed — scanner imported directly. Maps to LLM01/LLM06/ASI04. - examples/bash-evasion-gallery/ — one disguised variant per T1 through T9 evasion technique fed through pre-bash-destructive, verified BLOCK after bash-normalize strips the evasion. T8 base64-pipe-shell uses its own BLOCK_RULE. The canonical destructive form uses a path token rather than the bare slash (regex word-boundary requires it). Source-string fragmentation pattern reused from the e2e attack-chain test. Maps to LLM06/ASI01/LLM01. Plugin README "Other runnable examples" section + plugin CLAUDE.md "Examples" table + CHANGELOG Unreleased/Added all updated. Marketplace root README unchanged ([skip-docs] for marketplace-level gate — plugin's outward coverage is unchanged, only demonstrations were added).	2026-05-05 15:01:20 +02:00
Kjell Tore Guttormsen	8df5d5c70e	feat(llm-security): add lethal-trifecta + mcp-rug-pull examples [skip-docs] Two new self-contained, runnable threat demonstrations under examples/: - lethal-trifecta-walkthrough/ — feeds 5 hook calls (WebFetch, Read .env, Bash curl POST + suppression follow-ups) into post-session-guard and verifies the Rule-of-Two advisory fires exactly on leg 3. State isolated via run-script PID so /tmp/llm-security-session-.jsonl is not polluted. Treffer post-session-guard, ASI01/ASI02, LLM01/LLM02. - mcp-rug-pull/ — mutates an MCP tool description across 8 stages. Each per-update <10% Levenshtein, cumulative reaches 32.2% by stage 7 — proves the v7.3.0 (E14) mcp-cumulative-drift MEDIUM advisory catches slow-burn rug-pulls that the per-update detection would miss. Uses LLM_SECURITY_MCP_CACHE_FILE to isolate cache. Treffer post-mcp-verify, mcp-description-cache.mjs, OWASP MCP05/LLM03/ASI04. Each example: README.md + run-.mjs + expected-findings.md. Plugin README "Other runnable examples" section + CHANGELOG [Unreleased] Added bullets + plugin CLAUDE.md "Examples" section all updated in this commit. Marketplace root README unchanged since plugin's outward coverage is unchanged ([skip-docs] covers the marketplace-level gate).	2026-05-05 14:45:15 +02:00
Kjell Tore Guttormsen	f835777c1e	test(llm-security): add e2e suite proving framework works as coordinated system Three new files in tests/e2e/ (45 tests, 1777 -> 1822): - attack-chain.test.mjs (17): full hook stack against attack payloads in sequence -- prompt injection at the gate; T1/T5/T8 bash evasions; pathguard on .env / .ssh; secrets hook on AWS-shaped keys and PEM headers; markdown link-title and HTML-comment poisoning in tool output; trifecta accumulation over a single session with dedup on the next benign call. - multi-session.test.mjs (9): state persistence across simulated session boundaries. Uses the fact that a hook child's process.ppid equals the test runner's process.pid, so writing the session state file directly simulates "previous session" history. Covers slow-burn trifecta (legs spread >50 calls), MCP cumulative description drift via LLM_SECURITY_MCP_CACHE_FILE override, and pre-compact transcript poisoning in warn / block / clean / missing-file modes. - scan-pipeline.test.mjs (19): scan-orchestrator + all 10 scanners + toxic-flow correlator against poisoned-project (BLOCK / 95 / Extreme) and grade-a-project (WARNING / 48 / High). Asserts envelope shape, verdict, risk_score, severity counts, OWASP coverage, scanner enumeration, and a narrative-coherence cross-check that the BLOCK scan strictly outranks the WARNING scan along every axis. Test files build credential-shaped payloads at runtime via concatenation so they contain no literal matches for the pre-edit-secrets regexes (memory rule feedback_secrets_hook_test_fixtures.md). Doc updates in same commit per marketplace policy: - CLAUDE.md header: 1777+ -> 1822+ tests, mentions tests/e2e/ - README.md badge tests-1777 -> tests-1822, body text updated - CHANGELOG.md: new [Unreleased] Added section describing scope No version bump. No behavior changes outside tests/.	2026-05-05 12:06:57 +02:00
Kjell Tore Guttormsen	490d4eddc6	docs: introduce GOVERNANCE.md and unify fork-and-own blurb Establish a single governance document at marketplace root and copy it into each of the 9 plugins so every plugin folder remains 100% self-contained. Replace the inconsistent provocative blurb across all READMEs with a uniform fork-and-own paragraph that links to the local GOVERNANCE.md. [skip-docs] Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-03 14:57:00 +02:00
Kjell Tore Guttormsen	62a9335772	chore(llm-security): v7.3.1 — stabilization patch for forkers and downstream users No behavior changes. Sets the public stance, tightens documentation, and removes coherence drift so anyone forking or downloading the plugin gets a consistent starting point. Added: - CONTRIBUTING.md — public fork-and-own guide. Why PRs are not accepted, how to fork well, what is welcome via issues. - README "Project scope" section — out-of-scope table naming what is fork-and-own territory (web dashboard, fleet policy, runtime firewall, IDE LSP, compliance pack, ticketing, multi-tenancy, ML detectors, marketplace UI, SSO/SCIM/RBAC) with commercial alternatives. - package.json: bugs.url, CONTRIBUTING/SECURITY/CHANGELOG in files whitelist for npm publishing. Changed: - SECURITY.md rewritten. Supported-versions table from stale 5.1.x to current reality (7.3.x active, 7.0-7.2 best-effort, <7.0 EOL). Best-effort solo response timeline. Scope expanded to bin/. - Scanner VERSION constants synced to plugin version. Was 6.0.0 in dashboard-aggregator and posture-scanner. - package.json repository.url corrected from fromaitochitta/ to open/. - README "Feedback & contributing" links to CONTRIBUTING.md. Fixed: - pre-compact-scan size-cap timing test ceiling raised 500ms -> 1000ms. Was a flake on Intel Mac and CI under load. Design target unchanged (<500ms, documented in CLAUDE.md). Notes: - First patch on the stabilization line (post-2026-05-01). - Wave E attack-simulator scenarios deferred indefinitely; coverage remains at 72. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 06:14:03 +02:00
Kjell Tore Guttormsen	7219a5fe20	docs(readme): total overhaul for v7.3.0 Rewrites README.md from 919 → 484 lines (47% reduction). Modernized structure, all counts updated to v7.3.0 reality (commands 19→20, scanners 22→23, knowledge 19→22, tests 1665→1777), trimmed Version History to last 3 versions with link to CHANGELOG.md. Structural changes: - Removed dated "Prompt Injection Showcase (v5.0)" section - Removed verbose Directory Structure tree (file paths discoverable from CLAUDE.md and the file system itself) - Collapsed Knowledge Base 18-row table into 5-category summary - Merged "Architecture" mermaid + "What's inside" into single layered overview - Tightened Compliance & Governance, OWASP Coverage, Workflow Examples to essentials only - Added explicit v7.3.0 sections inline: - npm scope-hop typosquat in supply-chain hook (E13) - workflow-scanner W F L row in Scanners (E11) - .gitattributes post-clone advisory in remote scanning table (E12) - MCP cumulative-drift baseline + reset in Output verification + own subsection (E14) - rot13 + T7-T9 bash-normalize in Prompt injection + Destructive commands hooks (E3/E8/E9/E10) - env-var deprecation runway in Compliance & Governance (8.7) - Hook count corrected to 9 throughout (8.10) - New badges: commands-20, scanners-23, knowledge-22, tests-1777 Content preserved (load-bearing): - AI-generated disclosure - "no PRs accepted" framing - Sandbox defense-in-depth tables - OWASP coverage matrix - Defense philosophy section - Self-scan + malicious-skill-demo references - Recommended-combo with parry-guard Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 05:37:42 +02:00
Kjell Tore Guttormsen	c4183b8b4d	chore(release): bump to v7.3.0 Batch C release. Closes 12 implementation tasks (E3, E8-E14, 8.4, 8.6, 8.7, 8.10) across four execution waves: A (bash + decoder), B (supply chain + workflow scanner), C (MCP cumulative drift), D (code quality). Wave E (9 new attack-simulator scenarios for the new defenses) deferred to v7.3.1 — defenses are unit-tested per wave; the deferred work adds attack-simulator regression coverage on top, not the primary safety net. Tests: 1665+ → 1777 (Wave A-D cumulative, +112). Version sync targets touched: - package.json - .claude-plugin/plugin.json - CLAUDE.md (header) - README.md (badge + new release-history row) - scanners/ide-extension-scanner.mjs (VERSION constant) - ../../README.md (marketplace root plugin entry) - CHANGELOG.md (new [7.3.0] section per Keep a Changelog, all 12 task IDs covered individually under Added/Changed/Documentation/Tests/Notes) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 05:28:45 +02:00
Kjell Tore Guttormsen	ba5f2b64ad	feat(policy-loader): 8.7 — env-var deprecation warnings (v8.0.0 removal)	2026-04-30 17:11:07 +02:00
Kjell Tore Guttormsen	001df2ebe8	feat(commands): E14 part 3 — /security mcp-baseline-reset slash command Wave C step C3: closes E14 with the user-facing reset command. After a legitimate MCP server upgrade the sticky baseline (added in C1) becomes a stale "what the tool used to say" anchor and every subsequent post-mcp-verify advisory will re-flag the change. /security mcp-baseline-reset lets the user acknowledge the upgrade so the next call seeds a fresh baseline. New files: - scanners/mcp-baseline-reset.mjs — small CLI wrapper around clearBaseline / listBaselines. Modes: --list (read-only), --target <name>, no-args (all). Outputs JSON summary on stdout. Exit 0 always (idempotent). - commands/mcp-baseline-reset.md — dispatcher following mcp-inspect.md shape. Frontmatter: name=security:mcp-baseline-reset, sonnet model, Read/Bash/AskUserQuestion tools. 4-step body (list -> confirm scope -> execute -> confirm result). - tests/scanners/mcp-baseline-reset.test.mjs — 10 CLI tests across --list, --target, clear-all, idempotency, history preservation, and bare-positional sugar. Updated: - commands/security.md — new row in commands table after mcp-inspect. - CLAUDE.md — new commands-table row + new v7.3.0 narrative section describing the baseline schema, cumulative-drift detection, reset semantics, and the LLM_SECURITY_MCP_CACHE_FILE override. - Plugin README.md — new MCP-baseline-reset row in commands table, scanner count 12 standalone -> 13 standalone, new "MCP Description Drift (E14, v7.3.0)" subsection explaining the sticky baseline, cumulative threshold, reset semantics, and env-var override. - Root marketplace README.md — scanner count 22 -> 23 (10 orchestrated + 13 standalone), command count 19 -> 20, test count 1511 -> 1768. Wave C complete: 1738 -> 1768 tests (+30 across C1/C2/C3). Per plan, Wave C does NOT bump the plugin version — that lands at the wave-bundle release. The advisory text in post-mcp-verify already references the new command path so the user has a ready remediation step. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-30 16:49:01 +02:00
Kjell Tore Guttormsen	3b57dfbf6d	chore(release): bump to v7.2.0 Batch B release — closes critical-review B-tier scanner defects (B3, B5, B6, B7) and the v7.2.0 evasion-arsenal hardening patches (E1, E4, E5, E7, E15, E16, E17, E18). Tests 1522 → 1665+, attack simulator 64 → 72 (100 % pass). Version updates across the 6 sync targets: - package.json - .claude-plugin/plugin.json - CLAUDE.md (header + test count: 1511 → 1665+) - README.md (badge + Version History row) - scanners/ide-extension-scanner.mjs (VERSION constant) - ../../README.md (marketplace root) CHANGELOG [7.2.0] entry per Keep a Changelog with full Added / Changed / Documentation / Tests / Notes breakdown. Refs: Batch B Wave 6 / Step 15	2026-04-29 15:40:15 +02:00
Kjell Tore Guttormsen	1e555b6833	docs(llm-security): add v7.1.0 row to README version history The v7.1.0 release commit (`621db14`) bumped the version badge and added a CHANGELOG entry, but missed the README Version History table. Adding the row now so the public-facing version history at git.fromaitochitta.com/open/ktg-plugin-marketplace reflects v7.1.0. Row covers: B1 + B2 + B4 fixes, A3 honesty-sweep (7 phrases), B8 CaMeL nedton, test count 1487 → 1511, "why" framing tied to critical-review §F CISO perspective.	2026-04-29 12:03:10 +02:00
Kjell Tore Guttormsen	621db144bd	chore(release): bump llm-security to v7.1.0 Closes A4 of v7.1.0 critical-review patch — release artefacts. - Version bump 7.0.0 → 7.1.0 across active version sources: * package.json * .claude-plugin/plugin.json * CLAUDE.md header * README.md badge * scanners/ide-extension-scanner.mjs (VERSION constant) * marketplace root README plugin entry - Marketplace root README test count: 1487 → 1511. - CHANGELOG.md: new [7.1.0] - 2026-04-29 section above [7.0.0], documenting B1, B2, B4, B8, honesty-sweep (7 phrases), and test-count delta (+24 → 1511 total). - docs/security-hardening-guide.md: §6 last-updated bump + new v7.1.0 calibration note on hook-level fixes (pathguard regex hole, distributed-trifecta block-mode bypass). Historical references to "7.0.0" intentionally preserved in: - CHANGELOG [7.0.0] entries (history) - README.md version-history table v5.0.0/v7.0.0 rows (history) - CLAUDE.md §"v7.0.0 — Severity-dominated risk scoring" (describes what changed at v7.0.0 release) - scanners/ JSDoc comments noting "v7.0.0+" formula provenance - agents/ + tests/ + knowledge/ provenance comments Pre-existing untracked/modified tracker noise (.gitignore, marketplace.json, config-audit/docs, ultraplan-local/docs) is not part of this commit per the v7.1.0 NEXT-SESSION-PROMPT handoff. Tests: 1511/1511 green.	2026-04-29 11:57:16 +02:00
Kjell Tore Guttormsen	765bc74f52	feat(llm-security): v7.0.0 commit 7 — rule 18 (markdown image URL suppression) E2E verification against content-heavy repo (`content-claude-code`) revealed 413 entropy findings (8 HIGH / 405 MEDIUM) from markdown image CDN URLs in JSON content indexes — e.g., `![Image 1: Title](https://www-cdn.anthropic.com/images/.../cf1dd2167fcf12f5882333ddc58a5bc1f0026952.svg)`. These are legitimate content-repo artifacts, not credentials. The 40-char hash segment in the CDN URL trips Shannon entropy (H=5.29 over 300 chars), and rule 13 (inline <svg>) doesn't match since there's no literal `<svg>` tag — the `.svg` is just a URL path suffix. Added rule 18 `MARKDOWN_IMAGE = /!\[[^\]]\]\(\shttps?:\/\//` — matches `![alt](http…)` / `![alt](https…)`. Line-level (not string-level) so URL is not over-specific. E2E impact on `content-claude-code`: - Before: BLOCK / 65 / 8H 437M 0L - After: WARNING / 56 / 3H 427M 0L Hyperframes unchanged: BLOCK / 80 / 1C 4H 92M — real CRITICAL SQL-injection and HIGH findings still detected. Tests: 2 new (positive + negative fixture) bringing entropy-context to 26, total suite 1485 → 1487. Docs updated to "rules 11-18" and "8 new line-suppression rules". Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-19 22:37:39 +02:00
Kjell Tore Guttormsen	6f86de937a	feat(llm-security)!: v7.0.0 commit 6 — tests, docs, version bump Final commit in the trustworthy-scoring series. Bundles verdict cutoff alignment, the last suite of tests, and all documentation touch-points that quote version numbers or describe v7.0.0 behaviour. Verdict/band co-monotonicity - `scanners/lib/severity.mjs` — verdict cutoffs moved from 61/21 to 65/15 so `BLOCK >= 65`, `WARNING >= 15` locks onto the v2 riskBand() boundaries. Prevents "BLOCK / Medium band" contradictions under the v2 formula. Scanner hardening (bug fixes from v7.0.0 testing) - `scanners/entropy-scanner.mjs` — `policy_source` now uses `existsSync('.llm-security/policy.json')` instead of value-based check. Old heuristic always reported 'policy.json' because DEFAULT_POLICY now carries an `entropy.thresholds` section. - `scanners/lib/file-discovery.mjs` — `.sass` and GPU shader extensions (`.glsl, .frag, .vert, .shader, .wgsl`) added to TEXT_EXTENSIONS. Without this, shader files were invisible to file-discovery, so they were never counted as skipped by the entropy-scanner extension filter. Tests - `tests/scanners/entropy-context.test.mjs` (new, 24 tests) — A. File-ext skip (4), B. Line-level rules 11-17 (8), C. Policy overrides (3). Fixtures generate 80-char base64 payloads at runtime via `crypto.randomBytes` to dodge the plugin's own pre-edit credential hook on the test source. - `tests/lib/severity.test.mjs` — rewritten with v2 scoring table (70 tests total, was 52). - `tests/lib/output.test.mjs:243` — "1 critical = score 80" under v2 (was 25 under v1). - Full suite: 1485/1485 green (was 1461). Docs - `CHANGELOG.md` — v7.0.0 entry with BREAKING CHANGES section. - `README.md` (plugin + marketplace root) — version badge, history table, plugin-card version string, test count. - `CLAUDE.md` — header version, "v7.0.0 — Trustworthy scoring" summary paragraph at the top. - `docs/security-hardening-guide.md` — new section 6 "Calibration & false positives" documenting v2 formula, context-aware entropy scanner, typosquat allowlist, and §6.4 tuning workflow. Existing "Recommended baseline" section renumbered to §7. Version bump - `6.6.0 -> 7.0.0` across package.json, .claude-plugin/plugin.json, scanners/ide-extension-scanner.mjs VERSION const, README badge, CLAUDE.md header, marketplace root README card. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-19 22:26:35 +02:00
Kjell Tore Guttormsen	445a632d39	docs: add AI-generated code disclosure to marketplace and all plugins Transparency: all code in this marketplace is produced by Claude Code through dialog-driven development. Root README gets a full disclosure section; each plugin README gets a one-line disclosure linking back to the marketplace section. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-19 19:27:05 +02:00
Kjell Tore Guttormsen	80c3e2d39a	chore(release): bump llm-security to v6.6.0	2026-04-18 11:04:42 +02:00
Kjell Tore Guttormsen	9f893c3858	feat(llm-security): OS sandbox for /security ide-scan <url> (v6.5.0) VSIX fetch + extract for URL targets now runs in a sub-process wrapped by sandbox-exec (macOS) or bwrap (Linux), reusing the same primitives proven by the v5.1 git-clone sandbox. Defense-in-depth — even if our own zip-extract.mjs ever has a bypass, the kernel refuses any write outside the per-scan temp directory. New files: - scanners/lib/vsix-fetch-worker.mjs — sub-process worker. Argv: --url --tmpdir; emits one JSON line on stdout (ok/sha256/size/source/extRoot or ok:false/error/code). Silent on stderr. Exit 0/1. - scanners/lib/vsix-sandbox.mjs — wrapper. Exports buildSandboxProfile, buildBwrapArgs, buildSandboxedWorker, runVsixWorker. 35s timeout, 1 MB stdout cap. Changes: - scanners/ide-extension-scanner.mjs: fetchAndExtractVsixUrl is now sandbox-aware (useSandbox option, default true). In-process logic preserved as fallback. New meta.source.sandbox field: 'sandbox-exec' \| 'bwrap' \| 'none' \| 'in-process'. - scan(target, { useSandbox }) defaults to true; tests pass false because globalThis.fetch mocks do not cross process boundaries. - Windows fallback: in-process with meta.warnings advisory. Tests: - 8 new tests in tests/scanners/vsix-sandbox.test.mjs (per-platform profile generation, worker arg construction, live worker exit behavior on invalid URLs — no network). - Existing URL tests updated to opt out of sandbox (useSandbox: false). - 1344 → 1352 tests, all green. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-17 17:28:57 +02:00
Kjell Tore Guttormsen	fe0193956d	feat(llm-security): /security ide-scan <url> — Marketplace/OpenVSX/direct VSIX (v6.4.0) Pre-installation verification of VS Code extensions via URL — fetch a remote VSIX, extract it in a hardened sandbox, and run the existing IDE scanner pipeline against it. No npm dependencies. Sources: - VS Code Marketplace (publisher.gallery.vsassets.io direct download) - OpenVSX (open-vsx.org official API) - Direct .vsix HTTPS URLs Defenses: - HTTPS-only, TLS verified, manual redirect with per-source host whitelist - 30s total timeout via AbortController - 50MB compressed cap, 500MB uncompressed, 100x expansion ratio - Zero-dep ZIP extractor: zip-slip, absolute paths, drive letters, NUL bytes, symlinks (Unix mode 0xA000), depth limits, ZIP64 rejected, encrypted rejected - SHA-256 streamed during fetch, surfaced in meta.source - Temp dir cleanup in all paths (try/finally) Files: - scanners/lib/vsix-fetch.mjs (HTTPS fetcher, host whitelist, streaming SHA-256) - scanners/lib/zip-extract.mjs (zero-dep parser with hardening caps) - knowledge/marketplace-api-notes.md (endpoint reference) - 3 test files (48 tests added: vsix-fetch, zip-extract, ide-extension-url) Tests: 1296 → 1344 (all green). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-17 17:16:26 +02:00
Kjell Tore Guttormsen	6252e55700	feat(llm-security): add /security ide-scan — VS Code / JetBrains extension prescan (v6.3.0) New standalone scanner (prefix IDE) discovers installed VS Code extensions across forks (Cursor, Windsurf, VSCodium, code-server, Insiders, Remote-SSH) and runs 7 IDE-specific threat checks: blocklist match (CRITICAL), theme-with-code, sideload (unsigned .vsix), dangerous uninstall hook (HIGH), wildcard activation, extension-pack expansion, typosquat (MEDIUM). Per-extension reuse of UNI/ENT/NET/TNT/MEM/SCR scanners with bounded concurrency. Offline-first; --online opt-in. JetBrains discovery stubbed for v1.1. 22 new tests (1296 total, was 1274). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-17 16:23:35 +02:00
Kjell Tore Guttormsen	7bcf5fae9d	docs: update READMEs for llm-security v6.2.0 (9 hooks, PreCompact, Opus 4.7)	2026-04-17 15:35:52 +02:00
Kjell Tore Guttormsen	80b4952f2c	chore(release): v6.2.0 — bash-normalize T5/T6, PreCompact hook, hardening guide	2026-04-17 14:55:26 +02:00
Kjell Tore Guttormsen	b4da0a4a0c	docs(readme): replace npx references with node bin/ (not yet on npm) npx llm-security requires npm publishing which hasn't happened. Updated 3 references to use node bin/llm-security.mjs which works today. CI templates and docs intentionally kept as-is (designed for future npm). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-10 15:20:44 +02:00
Kjell Tore Guttormsen	02fd29f685	docs(readme): update README body for v6.0+v6.1 — fix stale counts and tables Badges, intro, commands, scanner table, Mermaid diagram, directory tree, and knowledge base section all had counts frozen at v3-v4 era. Updated to match actual filesystem: 21 scanners (10+11), 18 commands, 16 knowledge files, 16 posture categories, 1264 tests. Added missing bin/, ci/, docs/ directories and all standalone scanners to directory tree. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-10 15:15:16 +02:00
Kjell Tore Guttormsen	2c33e9cc64	feat(ci): add CI/CD integration — --fail-on, --compact, pipeline templates Add threshold-based exit codes (--fail-on <severity>) and compact output mode (--compact) to scan-orchestrator and CLI. Pipeline templates for GitHub Actions, Azure DevOps, GitLab CI with SARIF upload. CI/CD guide with Schrems II/NSM compliance documentation. npm publish preparation (files whitelist, .npmignore). Policy ci section for distributable CI defaults. Version 6.1.0. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-10 14:59:05 +02:00
Kjell Tore Guttormsen	dea17a1c11	chore(release): bump to v6.0.0 — CAISS-readiness release with compliance, governance, CLI Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-10 14:03:10 +02:00
Kjell Tore Guttormsen	34bb4e5971	docs: standardize installation instructions across all plugins All 6 plugin READMEs now use identical installation section: marketplace-first approach with /plugin browsing, then direct settings.json as alternative. Replaces inconsistent mix of git clone, plugin add, and JSON-only instructions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-08 15:10:24 +02:00
Kjell Tore Guttormsen	a8d79e4484	chore(llm-security): bump version refs to 5.1.0 Update monorepo README, llm-security README badge, and plugin.json from 5.0.0 to 5.1.0. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-07 17:15:50 +02:00
Kjell Tore Guttormsen	f1c9fac365	docs(llm-security): expand sandboxed cloning with platform matrix and links Add detailed platform matrix with links to sandbox-exec, bubblewrap, Windows Sandbox, Docker Desktop, WSL2, and AppContainer documentation. CVE reference for .gitattributes attack vector. Git config flag table with per-flag mitigation descriptions. Windows guidance with concrete options and recommendations. Note on why Node.js --permission is not applicable. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-07 17:12:54 +02:00
Kjell Tore Guttormsen	708c898754	feat(llm-security): sandboxed remote cloning v5.1.0 Harden git clone attack surface for remote scans with defense-in-depth: Layer 1 (all platforms): 8 git config flags disable hooks, symlinks, filter/smudge drivers, fsmonitor, local file protocol. 4 env vars isolate from system/user git config and block interactive prompts. Layer 2 (OS sandbox): macOS sandbox-exec and Linux bubblewrap (bwrap) restrict file writes to only the specific temp directory. bwrap probe-tests availability before use. Graceful fallback on Windows and Ubuntu 24.04+ (git config hardening only). Additional: post-clone 100MB size check, UUID-unique evidence filenames, evidence file cleanup, cleanup guarantee in scan/plugin-audit commands. 32 new tests (1147 total). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-07 17:08:32 +02:00
Kjell Tore Guttormsen	f93d6abdae	feat: initial open marketplace with llm-security, config-audit, ultraplan-local	2026-04-06 18:47:49 +02:00

34 commits