feat(llm-security-copilot): port llm-security v5.1.0 to GitHub Copilot CLI

Full port of llm-security plugin for internal use on Windows with GitHub Copilot CLI. Protocol translation layer (copilot-hook-runner.mjs) normalizes Copilot camelCase I/O to Claude Code snake_case format — all original hook scripts run unmodified. - 8 hooks with protocol translation (stdin/stdout/exit code) - 18 SKILL.md skills (Agent Skills Open Standard) - 6 .agent.md agent definitions - 20 scanners + 14 scanner lib modules (unchanged) - 14 knowledge files (unchanged) - 39 test files including copilot-port-verify.mjs (17 tests) - Windows-ready: node:path, os.tmpdir(), process.execPath, no bash Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-04-09 21:56:10 +02:00 · 2026-04-09 21:56:10 +02:00 · f418a8fe08
commit f418a8fe08
parent 901bf0ae12
169 changed files with 37631 additions and 0 deletions
--- a/plugins/llm-security-copilot/templates/archive/audit-report.md
+++ b/plugins/llm-security-copilot/templates/archive/audit-report.md
@ -0,0 +1,391 @@
+# Security Audit Report
+
+<!--
+TEMPLATE USAGE
+This is a reference document describing the expected output structure for `/security audit`.
+Agents use this as a formatting guide for a comprehensive project-wide audit.
+Fill every section with real findings. Do NOT output placeholder text.
+If a category is not applicable, mark it N/A and explain briefly why.
+-->
+
+---
+
+## Header
+
+| Field | Value |
+|-------|-------|
+| **Project** | [Name of the project or repository that was audited] |
+| **Repository** | [e.g. `github.com/org/repo`] |
+| **Audit date** | [ISO 8601 — e.g. 2026-02-19] |
+| **Auditor** | llm-security v[X.X] (automated) |
+| **Baseline** | Claude Code Security Baseline v1.0 + OWASP LLM Top 10 (2025) |
+| **Scope** | [Brief description — e.g. "Full project: source, skills, hooks, MCP configs, Docker, deployment"] |
+
+---
+
+## Executive Summary
+
+### Overall Grade: [A / B / C / D / F] ([X]%)
+
+```
+Security Posture   [==========]  X.0 / 9.0
+
+  PASS     |||       [n] categories
+  PARTIAL  ||||||    [n] categories
+  FAIL                [n] categories
+```
+
+| Severity | Count |
+|----------|------:|
+| Critical | [n] |
+| High | [n] |
+| Medium | [n] |
+| Low | [n] |
+| **Total** | **[n]** |
+
+**Summary:** [3–5 sentences covering the overall security posture: what the project does well, what the primary risks are, and the most urgent action required.]
+
+---
+
+## Category Assessment
+
+### Category 1 — Deny-First Configuration
+
+| Status | [PASS / PARTIAL / FAIL / N/A] |
+|--------|-------------------------------|
+
+**Evidence:**
+- [Bullet per observation — what was found, with file paths and line references where relevant]
+- [If PASS: confirm deny-first posture is correctly configured]
+- [If PARTIAL/FAIL: specify exactly what is missing or misconfigured]
+
+**Recommendations:**
+- [Specific, actionable recommendation — omit if PASS]
+
+---
+
+### Category 2 — Secrets Protection
+
+| Status | [PASS / PARTIAL / FAIL / N/A] |
+|--------|-------------------------------|
+
+**Evidence:**
+- [Bullet per observation]
+
+**Recommendations:**
+- [Specific, actionable recommendation — omit if PASS]
+
+---
+
+### Category 3 — Path Guarding
+
+| Status | [PASS / PARTIAL / FAIL / N/A] |
+|--------|-------------------------------|
+
+**Evidence:**
+- [Bullet per observation]
+
+**Recommendations:**
+- [Specific, actionable recommendation — omit if PASS]
+
+---
+
+### Category 4 — MCP Server Trust
+
+| Status | [PASS / PARTIAL / FAIL / N/A] |
+|--------|-------------------------------|
+
+**Evidence:**
+- [Bullet per MCP server found — source, auth status, scope assessment]
+- [Include trust verdict per server: Trusted / Suspect / Unknown]
+
+**Recommendations:**
+- [Specific, actionable recommendation — omit if PASS]
+
+---
+
+### Category 5 — Destructive Command Blocking
+
+| Status | [PASS / PARTIAL / FAIL / N/A] |
+|--------|-------------------------------|
+
+**Evidence:**
+- [Bullet per observation]
+
+**Recommendations:**
+- [Specific, actionable recommendation — omit if PASS]
+
+---
+
+### Category 6 — Sandbox Configuration
+
+| Status | [PASS / PARTIAL / FAIL / N/A] |
+|--------|-------------------------------|
+
+**Evidence:**
+- [Bullet per observation]
+
+**Recommendations:**
+- [Specific, actionable recommendation — omit if PASS]
+
+---
+
+### Category 7 — Human Review Requirements
+
+| Status | [PASS / PARTIAL / FAIL / N/A] |
+|--------|-------------------------------|
+
+**Evidence:**
+- [Bullet per observation]
+
+**Recommendations:**
+- [Specific, actionable recommendation — omit if PASS]
+
+---
+
+### Category 8 — Skill and Plugin Sources
+
+| Status | [PASS / PARTIAL / FAIL / N/A] |
+|--------|-------------------------------|
+
+**Evidence:**
+- [Bullet per observation — first-party vs third-party, lock file status, marketplace trust]
+
+**Recommendations:**
+- [Specific, actionable recommendation — omit if PASS]
+
+---
+
+### Category 9 — Session Isolation
+
+| Status | [PASS / PARTIAL / FAIL / N/A] |
+|--------|-------------------------------|
+
+**Evidence:**
+- [Bullet per observation]
+
+**Recommendations:**
+- [Specific, actionable recommendation — omit if PASS]
+
+---
+
+## Scan Findings
+
+Findings grouped by severity, sorted Critical → High → Medium → Low.
+Each finding ID is formatted `SCN-[NNN]` (e.g. `SCN-001`).
+
+---
+
+### Critical Findings ([n])
+
+> Omit this section if no Critical findings.
+
+#### SCN-001 — [Short title]
+
+| Field | Value |
+|-------|-------|
+| **File** | `[path/to/file:line]` |
+| **OWASP** | [e.g. LLM06:2025 Excessive Agency] |
+
+[Full description paragraph: what was found, why it is a risk, what an attacker could do with it.]
+
+```
+[Exact code or config excerpt that triggered the finding — redact actual secret values]
+```
+
+**Remediation:** [Concrete, actionable fix. Include example code or config snippet where helpful.]
+
+---
+
+#### SCN-002 — [Short title]
+
+| Field | Value |
+|-------|-------|
+| **File** | `[path/to/file:line]` |
+| **OWASP** | [OWASP reference] |
+
+[Description paragraph.]
+
+```
+[Evidence excerpt]
+```
+
+**Remediation:** [Fix.]
+
+---
+
+### High Findings ([n])
+
+> Omit this section if no High findings.
+
+#### SCN-[NNN] — [Short title]
+
+| Field | Value |
+|-------|-------|
+| **File** | `[path/to/file:line]` |
+| **OWASP** | [OWASP reference] |
+
+[Description paragraph.]
+
+```
+[Evidence excerpt]
+```
+
+**Remediation:** [Fix.]
+
+---
+
+### Medium Findings ([n])
+
+> Omit this section if no Medium findings.
+
+#### SCN-[NNN] — [Short title]
+
+| Field | Value |
+|-------|-------|
+| **File** | `[path/to/file:line]` |
+| **OWASP** | [OWASP reference] |
+
+[Description paragraph.]
+
+**Remediation:** [Fix.]
+
+---
+
+### Low Findings ([n])
+
+> Omit this section if no Low findings.
+
+#### SCN-[NNN] — [Short title]
+
+| Field | Value |
+|-------|-------|
+| **File** | `[path/to/file:line]` |
+| **OWASP** | [OWASP reference] |
+
+[Description paragraph.]
+
+**Remediation:** [Fix.]
+
+---
+
+## Risk Matrix
+
+```
+              LIKELIHOOD
+              Low          Medium       High
+         +------------+------------+------------+
+  High   |            |            |            |
+         |            |            |            |
+IMPACT   +------------+------------+------------+
+  Med    |            |            |            |
+         |            |            |            |
+         +------------+------------+------------+
+  Low    |            |            |            |
+         |            |            |            |
+         +------------+------------+------------+
+```
+
+Place each `Cat [N]` label in the cell matching its assessed likelihood and impact.
+Categories with Critical findings belong in High/High.
+Categories with PASS status typically appear in Low/Low.
+
+---
+
+## Prioritized Action Plan
+
+Sorted by risk. IMMEDIATE items must be resolved before the next deployment.
+
+| # | Priority | Action | Finding | Effort | Risk if deferred |
+|---|----------|--------|---------|--------|------------------|
+| 1 | **IMMEDIATE** | [Specific action] | SCN-[NNN] | [Low / Med / High] | [Risk description] |
+| 2 | **IMMEDIATE** | [Specific action] | SCN-[NNN] | [Low / Med / High] | [Risk description] |
+| 3 | **HIGH** | [Specific action] | SCN-[NNN] | [Low / Med / High] | [Risk description] |
+| 4 | **HIGH** | [Specific action] | Posture | [Low / Med / High] | [Risk description] |
+| 5 | **MEDIUM** | [Specific action] | SCN-[NNN] | [Low / Med / High] | [Risk description] |
+| 6 | **LOW** | [Specific action] | Posture | [Low / Med / High] | [Risk description] |
+
+---
+
+## Positive Findings
+
+The following security controls are in place and working correctly:
+
+- **[Control name]** — [Brief description of what is working and where it was confirmed]
+- **[Control name]** — [Description]
+- **[Control name]** — [Description]
+
+*(Remove any bullet that does not apply. Add as many as warranted by the evidence.)*
+
+---
+
+## Methodology
+
+This audit was performed by automated assessment agents:
+
+1. **posture-assessor-agent** — Evaluated 9 security categories against the Claude Code Security Baseline v1.0, collecting file-level evidence and assigning PASS/PARTIAL/FAIL status per category.
+
+2. **skill-scanner-agent** — Scanned all skills, commands, agents, hooks, source code, and configs for 7 threat categories derived from ToxicSkills/ClawHavoc research, OWASP LLM Top 10 (2025), and OWASP Agentic AI Top 10.
+
+[Add or remove agents as applicable. Include mcp-scanner-agent if MCP servers were analyzed.]
+
+Both agents operated in read-only mode. No files were modified during this assessment.
+
+**Limitations:**
+- Static analysis only — no runtime behavior observed
+- Source code spot-checked, not exhaustively reviewed
+- [Add project-specific limitations, e.g. "Extension dependencies not audited for known CVEs"]
+- Third-party MCP servers and marketplace content not analyzed beyond declared configs
+
+---
+
+*Report generated [ISO 8601 timestamp] by llm-security v[X.X]*
+*Baseline: Claude Code Security Baseline v1.0*
+*OWASP references: LLM Top 10 2025, Agentic AI Top 10*
+*Next recommended audit: [e.g. Before next major release or within 30 days]*
+
+---
+
+<!--
+GRADING LOGIC (for agents filling in this template)
+
+Count categories with status PASS (excluding N/A from denominator):
+  Applicable = total categories - N/A count
+  Pass rate  = PASS count / Applicable count
+  Percentage = PASS count / Applicable count * 100 (round to 1 decimal)
+
+Grade table:
+  A : Pass rate >= 0.89 AND zero Critical findings AND zero High findings
+  B : Pass rate >= 0.78 AND zero Critical findings
+  C : Pass rate >= 0.56 AND at most 1 Critical finding
+  D : Pass rate >= 0.33
+  F : Pass rate < 0.33 OR 3+ Critical findings
+
+STATUS DEFINITIONS
+  PASS    : Fully implemented, no gaps found
+  PARTIAL : Partially implemented — describe what is missing
+  FAIL    : Not implemented or actively misconfigured
+  N/A     : Category does not apply to this project type (explain why)
+
+PROGRESS BAR FORMULA
+  Bar length = 10 characters
+  Filled = round(PASS_count / applicable_count * 10)
+  Example: 6 PASS out of 9 → filled=7 → [=======---]  6.0 / 9.0
+  Use PARTIAL as 0.5 towards the score: score = PASS + (PARTIAL * 0.5)
+  Example: 3 PASS + 6 PARTIAL = 3 + 3 = 6.0 → [======----]
+
+SCAN FINDING SEVERITY CRITERIA
+  Critical : Exploit is direct and unauthenticated, or blast radius is system-wide (e.g. RCE, credential exfil, unauthenticated remote access)
+  High     : Exploit requires some conditions but risk is significant (e.g. injection with attacker-controlled input, auth bypass under specific config)
+  Medium   : Indirect risk, defense-in-depth gap, or bad practice likely to become exploitable (e.g. example docs showing unsafe patterns, non-root install missing)
+  Low      : Informational hygiene issue with low exploitability on its own (e.g. EXPOSE for unused ports, missing generic gitignore entry)
+
+FINDING ID FORMAT
+  SCN-[NNN] — three-digit zero-padded integer, sequential per report
+  Agents: Do NOT reuse IDs across reports. Start at SCN-001 for every new audit.
+
+OWASP REFERENCE FORMAT
+  Use: LLM0N:2025 [Full Category Name]
+  Example: LLM06:2025 Excessive Agency
+  Reference: knowledge/owasp-llm-top10.md for full category list
+-->