@GitTrend0x: Hermes 继续自进化，宝藏接一连三出现！ Kali 渗透测试技能，Dashboard 深色主题包、像素世界桥接、AIOps 运维军团、原生视频创作管线……全网程序员把 Hermes 玩成了下一代 Agent 红队黑客 + 丝滑仪表盘 …

X AI KOLs Timeline 2026/05/09 14:03 工具

ai-agent penetration-testing security open-source automation devops video-generation

摘要

Hermes Agent 生态迎来多项社区更新，包括 Kali 渗透测试技能、深色主题仪表盘、像素元宇宙桥接、AIOps 运维及视频创作管线，展示了 AI Agent 在安全测试与自动化运维方面的新能力。

Hermes 继续自进化，宝藏接一连三出现！ Kali 渗透测试技能，Dashboard 深色主题包、像素世界桥接、AIOps 运维军团、原生视频创作管线……全网程序员把 Hermes 玩成了下一代 Agent 红队黑客 + 丝滑仪表盘 + 元宇宙居民 + 生产值班官 + 视频工厂： kali-pentest（https://github.com/x-glacier/kali-pentest…） 200+ Kali 工具 + 15 场景 playbooks，自主攻击路径 + 人类审批门。 “Agent 终于敢干渗透”实锤了！ Hermes-Dashboard-Themes（https://github.com/yakuzadevopps/Hermes-Dashboard-Themes…） 4 套深色干净主题 + 大字体优化面板。官方主题“亮瞎眼”时代结束了 hermes-miniverse（https://github.com/teknium1/hermes-miniverse…）像素世界桥接：Agent 住进 Miniverse，像素小人实时展示思考状态 + 跨 Agent 聊天。太 meta 了，Agent 也有第二人生！ RunbookHermes（https://github.com/Tommy-yw/RunbookHermes…） Hermes-native AIOps：证据收集 + 审批 + runbook 自学习。生产事故 24/7 值班军团就位！ Noustiny（https://github.com/UfukNode/Noustiny…）一句话种子 → 树形故事 → 克隆配音 → 完整 MP4 视频管线。 Hackathon 神作，故事直接变视频了！ // 为什么这些新进化体这么炸？全吃 Hermes 底层循环当 DNA，社区再疯狂补渗透测试、主题美化、像素元宇宙、AIOps 军团、视频工厂……生态卷速肉眼可见

查看原文

查看缓存全文

缓存时间: 2026/05/10 02:20

x-glacier/kali-pentest

Source: https://github.com/x-glacier/kali-pentest

English | 简体中文

kali-pentest

A penetration testing skill built on Kali Linux for AI agents such as Claude Code, OpenClaw, and Hermes Agent. Currently includes 200+ CLI tools across 14 categories. Built-in coverage matrices, zero-findings fallbacks, and objective stopping conditions for each scenario ensure testing depth.

Unlike traditional automated penetration testing tools, the AI agent connects to a Kali environment via SSH or Docker, then autonomously plans the attack path based on the target, selects tools, integrates and analyzes results across phases to adapt the penetration strategy, and produces a structured report — with mandatory authorization checks and human approval gates for high-risk actions.

Workflow

Overall Workflow

flowchart LR
    A[Receive Task] --> B[Step 1: Environment]
    B --> C[Step 2: Plan]
    C --> D[Step 3: Execute]
    D --> E[Step 4: Analyze & Iterate]
    E -->|Adapt strategy| D
    E --> F[Step 5: Report]

    C --- C1[Confirm authorization & scope]
    C --- C2[Plan attack path & select depth]
    C --- C3[Select playbook from decision tree]
    E --- E1[Integrate results across phases]

Execution Detail (Step 3)

flowchart TD
    PB[Select Playbook] --> CAT[Read category README]
    CAT --> TOOL[Select tool & run]
    TOOL --> OUT[Collect output]
    OUT --> EVAL{Findings?}
    EVAL -->|Yes| NEXT{New target type?}
    EVAL -->|No| DEEPER[Escalate: deeper scan or alternate tool]
    NEXT -->|AD| AD[active-directory.md]
    NEXT -->|Web app| WEB[web-application.md]
    NEXT -->|Credentials| PWD[password-audit.md]
    NEXT -->|Initial access| POST[post-exploitation.md]
    NEXT -->|No| ITER[Next iteration]
    AD --> ITER
    WEB --> ITER
    PWD --> ITER
    POST --> ITER
    DEEPER --> ITER
    ITER --> TOOL

Getting Started

1. Install the skill

Copy the skill directory into your AI agent’s skills folder:

cp -r kali-pentest /path/to/your/agent/skills/kali-pentest

Agent	Skills path
Claude Code	`~/.claude/skills/` (personal) or `.claude/skills/` (project)
OpenClaw	`~/.openclaw/skills/`
Hermes Agent	`~/.hermes/skills/`

2. Provide Kali access

The agent needs a Kali Linux environment. Two options:

Server mode (recommended): full Kali over SSH — avoids Docker networking, raw-socket, wireless, and GPU limitations.
Documentation: Kali installation guide, Server mode guide.
Docker mode: pre-build a persistent container with tools installed. Best for CLI information gathering, vulnerability scanning, web/API and cloud-native testing, and reporting.
Documentation: Kali Docker guide, Docker mode guide.

Tell the agent how to connect (SSH key is recommended; username/password is supported but not recommended):

My Kali server is at 192.168.1.100, SSH user root, key at ~/.ssh/kali_key.

Or use Docker locally:

I have Docker installed locally. Use Docker to run Kali tools.

For OpenClaw and similar AI assistants, you can also configure Kali connection details in TOOLS.md so the agent reads them automatically without asking each time.

3. Invoke

Use natural language to assign a penetration testing task. The agent confirms scope and proceeds autonomously.

Slash command:

For Claude Code and compatible agents.

/kali-pentest

Conversational:

For OpenClaw, Hermes Agent, and other AI assistants.

Tested Models

The skill workflow has been optimized and tested with:

claude-sonnet-4.6
deepseek-v4-pro
qwen3.6:27b — local fallback for air-gapped environments (requires context length ≥ 128K)

Usage Examples

Kali server: ssh -i ~/.ssh/kali_key [email protected]
First run a full port scan and service fingerprinting against 192.168.1.50, then plan and execute an in-depth penetration test based on the results — do not overlook any potential weakness. After testing, produce a detailed report.
I have authorization.

Kali server: ssh -i ~/.ssh/kali_key [email protected]
Target: 10.0.0.0/24
Quickly scan the target network for open ports along with their service/protocol names and versions, then produce a report.
I have authorization.

The persistent Docker container `kali-pentest` is initialized with the full toolset.
Use Docker mode to run a web application penetration test against http://192.168.1.50 and produce a detailed HTML report.
I have authorization.

More examples (API, cloud, mobile, wireless, source code, VoIP/ICS)

Target domain: corp.example.com, domain controller 10.0.0.5
Perform an Active Directory security assessment covering enumeration, Kerberoasting, ACL abuse, and certificate template checks.

Target API: https://api.example.com, OpenAPI spec at /tmp/openapi.yaml
Perform an API security assessment covering authentication, authorization, and schema-driven testing.

Target: Kubernetes cluster context prod-audit and container registry registry.example.com
Run a read-only cloud-native security assessment and produce a findings report.

Target app: /tmp/app.apk with test account [email protected]
Perform an Android application security assessment, including static analysis, runtime checks, and backend endpoint mapping.

Authorized SSID: CorpWiFi, BSSID: AA:BB:CC:DD:EE:FF, channel 6
Perform a wireless security assessment including passive discovery, handshake capture, WPS detection, and evil twin testing.

Target repository: /tmp/source-repo (including Git history)
Perform a source code and dependency audit including secret scanning, SAST, and CI/CD pipeline security checks.

Target: SIP service 10.10.20.15 and Modbus host 10.10.30.20
Conservative read-only VoIP/ICS protocol assessment. Do not place calls or write PLC/Modbus values.

Architecture

Directory Structure

kali-pentest/
├── SKILL.md                   ← Agent entry point: planning, execution, error handling
└── references/
    ├── playbooks/             ← 15 scenario workflows (AD, web, internal, cloud, wireless, ...)
    ├── environment/           ← Server mode and Docker mode setup
    ├── information-gathering/ ← 39 tools
    ├── vulnerability/         ← 14 tools
    ├── sniffing-spoofing/     ← 6 tools
    ├── web/                   ← 31 tools
    ├── exploitation/          ← 19 tools
    ├── password/              ← 19 tools
    ├── wireless/              ← 26 tools
    ├── cloud-native/          ← 7 tools
    ├── rfid-nfc/              ← 5 tools
    ├── voip-ics/              ← 6 tools
    ├── reverse-engineering/   ← 16 tools
    ├── forensics/             ← 13 tools
    ├── post-exploitation/     ← 19 tools
    └── reporting/             ← 2 tools

The kali-pentest-zh/ directory is the Chinese mirror and stays structurally synchronized with kali-pentest/.

Document Layering

The skill uses a four-layer document hierarchy. Each layer has a distinct responsibility, and the agent reads top-down:

Layer	Files	Responsibility
Entry point	`SKILL.md`	Global workflow (Steps 1–5), execution standards, general testing principles
Scenario workflows	`playbooks/*.md`	Phase-by-phase procedures, decision trees, concrete command pipelines, depth-enforcement directives, stopping conditions
Tool selection	`<category>/README.md`	Category overview, tool comparison, selection guidance
Tool reference	`<category>/tools/<name>.md`	Parameters, command examples, installation, notes, official links

General principles live in SKILL.md (brief, no code blocks). Scenario-specific implementations live in playbooks (concrete commands, test matrices, coverage requirements). The layered structure prevents duplication while ensuring both global coverage and per-scenario depth.

Depth Enforcement

Each playbook includes bold-labeled directives at key workflow decision points to prevent the agent from doing shallow, surface-level work:

Coverage requirements — test ALL discovered items (endpoints, services, credentials), not just a sample.
Zero-findings fallback — escalate or manually verify when automated tools report no findings.
Coverage matrices — build explicit item × test matrices and complete every cell.
Attack escalation — progress through multiple attack techniques of increasing depth.

Every playbook has objective, verifiable stopping conditions — not “testing is complete” but specific artifacts, matrices, and checklists that must be filled. Every confirmed finding must include the complete reproducible command and its actual output as evidence.

Cross-Reference Logic

Playbooks form a connected graph. When a workflow phase discovers targets that belong to a different scenario (e.g., AD signals during internal network scanning, API endpoints during web testing), the playbook directs the agent to switch. All such handoffs are listed in each playbook’s Cross-References section.

Reusable methodology (e.g., the port scanning and service testing procedures in internal-network.md) can be referenced from other playbooks.

Playbooks

15 scenario workflows with phases, decision trees, risk gates, and stopping conditions:

Playbook	Scenario
`internal-network.md`	Host discovery, port scanning, service enumeration, pivoting
`external-attack-surface.md`	OSINT, subdomain enumeration, exposed service scanning
`web-application.md`	OWASP Top 10, CMS, injection, auth, business logic
`api-security.md`	REST, GraphQL, gRPC, WebSocket, JWT, BOLA/IDOR
`active-directory.md`	Kerberoasting, ADCS, relay, ACL abuse, DCSync
`password-audit.md`	Hash cracking, spraying, credential reuse, capture
`wireless-assessment.md`	WPA/WPA3, WPS, evil twin, Bluetooth/BLE
`cloud-native-assessment.md`	AWS/Azure/GCP IAM, Kubernetes, containers, serverless
`mobile-application.md`	Android/iOS static + dynamic analysis, SSL pinning bypass
`post-exploitation.md`	Privilege escalation, lateral movement, persistence, C2
`forensics-triage.md`	Disk imaging, memory forensics, log analysis, steganography
`rfid-nfc.md`	NFC/RFID cloning, smart cards, firmware extraction
`voip-ics.md`	VoIP/SIP, ICS/OT/Modbus, IPMI/BMC (safety-first)
`source-code-audit.md`	Secret scanning, SAST, dependency audit, CI/CD checks
`reporting-workflow.md`	Evidence packaging, CVSS scoring, report generation

Tool Selection Criteria

All tools are selected for autonomous agent operation:

CLI-automatable only — GUI-only tools and interactive debuggers are excluded
Headless binary analysis included — strings, checksec, radare2 one-shot, Ghidra Headless
CLI alternatives preferred — e.g., tshark instead of Wireshark

Contributing

Contributions are welcome — issues, pull requests, and new tool documentation.

相似文章

@GitTrend0x: Hermes 继续自进化冲超级体！ Humanizer 去 AI 痕迹、Obsidian 身份层、Taste 风格记忆、AutoShorts 短视频工厂、Creative Brain 采访大脑……全网程序员把 Hermes 玩成了下一代 …

X AI KOLs Timeline

介绍了一系列与Hermes agent相关的开源技能：humanizer去除AI痕迹、obsidian-skills将笔记变为身份层、taste-skill提炼个人风格、skill-autoshorts自动剪辑短视频、creative-brain采访提炼创意风格，旨在将Agent变为真人写手和私人知识库。

@GitTrend0x: Hermes 谁用谁舒服！超级应用研发！强化 fork 版、阿里云记忆插件、Felo 营销技能包、Awesome 社区圣经、轻量 Web UI…… 全网程序员把 Hermes 玩成了下一代 Agent 深度 hack 神器 + 云端集体…

X AI KOLs Timeline

Hermes Agent及其生态工具集在开发者社区中引发关注，包括强化fork版、阿里云记忆插件、Felo技能包、社区圣经和轻量Web UI，展示了AI Agent的深度定制和云端协作能力。

@GitTrend0x: Hermes 继续创时代！ literate programming 技能，SRE 事件指挥官、Spotify 原生控制、自主技能市场、Obsidian 身份层……全网程序员把 Hermes 玩成了下一代 Agent 文档神器 + 运维军…

X AI KOLs Timeline

Hermes AI agent ecosystem expands with new open-source tools including literate programming support for Claude Code, autonomous SRE incident management, Spotify control, and a skill marketplace. These tools enable developers to turn codebases into executable narratives and automate运维 tasks.

@GitTrend0x: Hermes 全能型身份名单分享！ AI/ML API 原生 fork、Inkbox 通信身份集成、VS Code Codespace 扩展、多 Agent 编程舰队、Agent-Stack 基础设施补全…… 全网程序员把 Hermes …

X AI KOLs Timeline

Hermes 是一个由 Nous Research 构建的自我改进 AI 智能体框架，支持多种基础设施集成和身份管理，开发者通过多个 fork 扩展了其能力。

@GitTrend0x: Hermes 终于拥有了自己的灵魂！ Proficiencies 职业技能包，Shadow CTO 仓库记忆、Mind Transposition 灵魂转移、Mnemosyne 高级本地记忆、HermesKill 安全急停…… 全网程序员…

X AI KOLs Timeline

Hermes agent gains a set of installable skills for workspace hygiene, decision-making, memory systems, and cross-platform migration, turning it into a professional agent workflow.

x-glacier/kali-pentest

kali-pentest

Workflow

Overall Workflow

Execution Detail (Step 3)

Getting Started

1. Install the skill

2. Provide Kali access

3. Invoke

Tested Models

Usage Examples

Architecture

Directory Structure

Document Layering

Depth Enforcement

Cross-Reference Logic

Playbooks

Tool Selection Criteria

Contributing

相似文章

@GitTrend0x: Hermes 继续自进化冲超级体！ Humanizer 去 AI 痕迹、Obsidian 身份层、Taste 风格记忆、AutoShorts 短视频工厂、Creative Brain 采访大脑……全网程序员把 Hermes 玩成了下一代 …

@GitTrend0x: Hermes 谁用谁舒服！超级应用研发！ 强化 fork 版、阿里云记忆插件、Felo 营销技能包、Awesome 社区圣经、轻量 Web UI…… 全网程序员把 Hermes 玩成了下一代 Agent 深度 hack 神器 + 云端集体…

@GitTrend0x: Hermes 继续创时代！ literate programming 技能，SRE 事件指挥官、Spotify 原生控制、自主技能市场、Obsidian 身份层……全网程序员把 Hermes 玩成了下一代 Agent 文档神器 + 运维军…

@GitTrend0x: Hermes 全能型身份名单分享！ AI/ML API 原生 fork、Inkbox 通信身份集成、VS Code Codespace 扩展、多 Agent 编程舰队、Agent-Stack 基础设施补全…… 全网程序员把 Hermes …

@GitTrend0x: Hermes 终于拥有了自己的灵魂！ Proficiencies 职业技能包，Shadow CTO 仓库记忆、Mind Transposition 灵魂转移、Mnemosyne 高级本地记忆、HermesKill 安全急停…… 全网程序员…

提交意见反馈

@GitTrend0x: Hermes 谁用谁舒服！超级应用研发！强化 fork 版、阿里云记忆插件、Felo 营销技能包、Awesome 社区圣经、轻量 Web UI…… 全网程序员把 Hermes 玩成了下一代 Agent 深度 hack 神器 + 云端集体…