@GitTrend0x: Hermes 继续自进化,宝藏接一连三出现! Kali 渗透测试技能,Dashboard 深色主题包、像素世界桥接、AIOps 运维军团、原生视频创作管线……全网程序员把 Hermes 玩成了下一代 Agent 红队黑客 + 丝滑仪表盘 …
摘要
Hermes Agent 生态迎来多项社区更新,包括 Kali 渗透测试技能、深色主题仪表盘、像素元宇宙桥接、AIOps 运维及视频创作管线,展示了 AI Agent 在安全测试与自动化运维方面的新能力。
查看缓存全文
缓存时间: 2026/05/10 02:20
Hermes 继续自进化,宝藏接一连三出现! Kali 渗透测试技能,Dashboard 深色主题包、像素世界桥接、AIOps 运维军团、原生视频创作管线……全网程序员把 Hermes 玩成了下一代 Agent 红队黑客 + 丝滑仪表盘 + 元宇宙居民 + 生产值班官 + 视频工厂: kali-pentest(https://github.com/x-glacier/kali-pentest…) 200+ Kali 工具 + 15 场景 playbooks,自主攻击路径 + 人类审批门。 “Agent 终于敢干渗透”实锤了! Hermes-Dashboard-Themes(https://github.com/yakuzadevopps/Hermes-Dashboard-Themes…) 4 套深色干净主题 + 大字体优化面板。 官方主题“亮瞎眼”时代结束了 hermes-miniverse(https://github.com/teknium1/hermes-miniverse…) 像素世界桥接:Agent 住进 Miniverse,像素小人实时展示思考状态 + 跨 Agent 聊天。 太 meta 了,Agent 也有第二人生! RunbookHermes(https://github.com/Tommy-yw/RunbookHermes…) Hermes-native AIOps:证据收集 + 审批 + runbook 自学习。 生产事故 24/7 值班军团就位! Noustiny(https://github.com/UfukNode/Noustiny…) 一句话种子 → 树形故事 → 克隆配音 → 完整 MP4 视频管线。 Hackathon 神作,故事直接变视频了! // 为什么这些新进化体这么炸? 全吃 Hermes 底层循环当 DNA,社区再疯狂补渗透测试、主题美化、像素元宇宙、AIOps 军团、视频工厂……生态卷速肉眼可见
x-glacier/kali-pentest
Source: https://github.com/x-glacier/kali-pentest
English | 简体中文
kali-pentest
A penetration testing skill built on Kali Linux for AI agents such as Claude Code, OpenClaw, and Hermes Agent. Currently includes 200+ CLI tools across 14 categories. Built-in coverage matrices, zero-findings fallbacks, and objective stopping conditions for each scenario ensure testing depth.
Unlike traditional automated penetration testing tools, the AI agent connects to a Kali environment via SSH or Docker, then autonomously plans the attack path based on the target, selects tools, integrates and analyzes results across phases to adapt the penetration strategy, and produces a structured report — with mandatory authorization checks and human approval gates for high-risk actions.
Workflow
Overall Workflow
flowchart LR
A[Receive Task] --> B[Step 1: Environment]
B --> C[Step 2: Plan]
C --> D[Step 3: Execute]
D --> E[Step 4: Analyze & Iterate]
E -->|Adapt strategy| D
E --> F[Step 5: Report]
C --- C1[Confirm authorization & scope]
C --- C2[Plan attack path & select depth]
C --- C3[Select playbook from decision tree]
E --- E1[Integrate results across phases]
Execution Detail (Step 3)
flowchart TD
PB[Select Playbook] --> CAT[Read category README]
CAT --> TOOL[Select tool & run]
TOOL --> OUT[Collect output]
OUT --> EVAL{Findings?}
EVAL -->|Yes| NEXT{New target type?}
EVAL -->|No| DEEPER[Escalate: deeper scan or alternate tool]
NEXT -->|AD| AD[active-directory.md]
NEXT -->|Web app| WEB[web-application.md]
NEXT -->|Credentials| PWD[password-audit.md]
NEXT -->|Initial access| POST[post-exploitation.md]
NEXT -->|No| ITER[Next iteration]
AD --> ITER
WEB --> ITER
PWD --> ITER
POST --> ITER
DEEPER --> ITER
ITER --> TOOL
Getting Started
1. Install the skill
Copy the skill directory into your AI agent’s skills folder:
cp -r kali-pentest /path/to/your/agent/skills/kali-pentest
| Agent | Skills path |
|---|---|
| Claude Code | ~/.claude/skills/ (personal) or .claude/skills/ (project) |
| OpenClaw | ~/.openclaw/skills/ |
| Hermes Agent | ~/.hermes/skills/ |
2. Provide Kali access
The agent needs a Kali Linux environment. Two options:
-
Server mode (recommended): full Kali over SSH — avoids Docker networking, raw-socket, wireless, and GPU limitations.
Documentation: Kali installation guide, Server mode guide. -
Docker mode: pre-build a persistent container with tools installed. Best for CLI information gathering, vulnerability scanning, web/API and cloud-native testing, and reporting.
Documentation: Kali Docker guide, Docker mode guide.
Tell the agent how to connect (SSH key is recommended; username/password is supported but not recommended):
My Kali server is at 192.168.1.100, SSH user root, key at ~/.ssh/kali_key.
Or use Docker locally:
I have Docker installed locally. Use Docker to run Kali tools.
For OpenClaw and similar AI assistants, you can also configure Kali connection details in TOOLS.md so the agent reads them automatically without asking each time.
3. Invoke
Use natural language to assign a penetration testing task. The agent confirms scope and proceeds autonomously.
Slash command:
For Claude Code and compatible agents.
/kali-pentest
Conversational:
For OpenClaw, Hermes Agent, and other AI assistants.
Tested Models
The skill workflow has been optimized and tested with:
claude-sonnet-4.6deepseek-v4-proqwen3.6:27b— local fallback for air-gapped environments (requires context length ≥ 128K)
Usage Examples
Kali server: ssh -i ~/.ssh/kali_key [email protected]
First run a full port scan and service fingerprinting against 192.168.1.50, then plan and execute an in-depth penetration test based on the results — do not overlook any potential weakness. After testing, produce a detailed report.
I have authorization.
Kali server: ssh -i ~/.ssh/kali_key [email protected]
Target: 10.0.0.0/24
Quickly scan the target network for open ports along with their service/protocol names and versions, then produce a report.
I have authorization.
The persistent Docker container `kali-pentest` is initialized with the full toolset.
Use Docker mode to run a web application penetration test against http://192.168.1.50 and produce a detailed HTML report.
I have authorization.
More examples (API, cloud, mobile, wireless, source code, VoIP/ICS)
Target domain: corp.example.com, domain controller 10.0.0.5
Perform an Active Directory security assessment covering enumeration, Kerberoasting, ACL abuse, and certificate template checks.
Target API: https://api.example.com, OpenAPI spec at /tmp/openapi.yaml
Perform an API security assessment covering authentication, authorization, and schema-driven testing.
Target: Kubernetes cluster context prod-audit and container registry registry.example.com
Run a read-only cloud-native security assessment and produce a findings report.
Target app: /tmp/app.apk with test account [email protected]
Perform an Android application security assessment, including static analysis, runtime checks, and backend endpoint mapping.
Authorized SSID: CorpWiFi, BSSID: AA:BB:CC:DD:EE:FF, channel 6
Perform a wireless security assessment including passive discovery, handshake capture, WPS detection, and evil twin testing.
Target repository: /tmp/source-repo (including Git history)
Perform a source code and dependency audit including secret scanning, SAST, and CI/CD pipeline security checks.
Target: SIP service 10.10.20.15 and Modbus host 10.10.30.20
Conservative read-only VoIP/ICS protocol assessment. Do not place calls or write PLC/Modbus values.
Architecture
Directory Structure
kali-pentest/
├── SKILL.md ← Agent entry point: planning, execution, error handling
└── references/
├── playbooks/ ← 15 scenario workflows (AD, web, internal, cloud, wireless, ...)
├── environment/ ← Server mode and Docker mode setup
├── information-gathering/ ← 39 tools
├── vulnerability/ ← 14 tools
├── sniffing-spoofing/ ← 6 tools
├── web/ ← 31 tools
├── exploitation/ ← 19 tools
├── password/ ← 19 tools
├── wireless/ ← 26 tools
├── cloud-native/ ← 7 tools
├── rfid-nfc/ ← 5 tools
├── voip-ics/ ← 6 tools
├── reverse-engineering/ ← 16 tools
├── forensics/ ← 13 tools
├── post-exploitation/ ← 19 tools
└── reporting/ ← 2 tools
The kali-pentest-zh/ directory is the Chinese mirror and stays structurally synchronized with kali-pentest/.
Document Layering
The skill uses a four-layer document hierarchy. Each layer has a distinct responsibility, and the agent reads top-down:
| Layer | Files | Responsibility |
|---|---|---|
| Entry point | SKILL.md | Global workflow (Steps 1–5), execution standards, general testing principles |
| Scenario workflows | playbooks/*.md | Phase-by-phase procedures, decision trees, concrete command pipelines, depth-enforcement directives, stopping conditions |
| Tool selection | <category>/README.md | Category overview, tool comparison, selection guidance |
| Tool reference | <category>/tools/<name>.md | Parameters, command examples, installation, notes, official links |
General principles live in SKILL.md (brief, no code blocks). Scenario-specific implementations live in playbooks (concrete commands, test matrices, coverage requirements). The layered structure prevents duplication while ensuring both global coverage and per-scenario depth.
Depth Enforcement
Each playbook includes bold-labeled directives at key workflow decision points to prevent the agent from doing shallow, surface-level work:
- Coverage requirements — test ALL discovered items (endpoints, services, credentials), not just a sample.
- Zero-findings fallback — escalate or manually verify when automated tools report no findings.
- Coverage matrices — build explicit item × test matrices and complete every cell.
- Attack escalation — progress through multiple attack techniques of increasing depth.
Every playbook has objective, verifiable stopping conditions — not “testing is complete” but specific artifacts, matrices, and checklists that must be filled. Every confirmed finding must include the complete reproducible command and its actual output as evidence.
Cross-Reference Logic
Playbooks form a connected graph. When a workflow phase discovers targets that belong to a different scenario (e.g., AD signals during internal network scanning, API endpoints during web testing), the playbook directs the agent to switch. All such handoffs are listed in each playbook’s Cross-References section.
Reusable methodology (e.g., the port scanning and service testing procedures in internal-network.md) can be referenced from other playbooks.
Playbooks
15 scenario workflows with phases, decision trees, risk gates, and stopping conditions:
| Playbook | Scenario |
|---|---|
internal-network.md | Host discovery, port scanning, service enumeration, pivoting |
external-attack-surface.md | OSINT, subdomain enumeration, exposed service scanning |
web-application.md | OWASP Top 10, CMS, injection, auth, business logic |
api-security.md | REST, GraphQL, gRPC, WebSocket, JWT, BOLA/IDOR |
active-directory.md | Kerberoasting, ADCS, relay, ACL abuse, DCSync |
password-audit.md | Hash cracking, spraying, credential reuse, capture |
wireless-assessment.md | WPA/WPA3, WPS, evil twin, Bluetooth/BLE |
cloud-native-assessment.md | AWS/Azure/GCP IAM, Kubernetes, containers, serverless |
mobile-application.md | Android/iOS static + dynamic analysis, SSL pinning bypass |
post-exploitation.md | Privilege escalation, lateral movement, persistence, C2 |
forensics-triage.md | Disk imaging, memory forensics, log analysis, steganography |
rfid-nfc.md | NFC/RFID cloning, smart cards, firmware extraction |
voip-ics.md | VoIP/SIP, ICS/OT/Modbus, IPMI/BMC (safety-first) |
source-code-audit.md | Secret scanning, SAST, dependency audit, CI/CD checks |
reporting-workflow.md | Evidence packaging, CVSS scoring, report generation |
Tool Selection Criteria
All tools are selected for autonomous agent operation:
- CLI-automatable only — GUI-only tools and interactive debuggers are excluded
- Headless binary analysis included —
strings,checksec,radare2one-shot, Ghidra Headless - CLI alternatives preferred — e.g.,
tsharkinstead of Wireshark
Contributing
Contributions are welcome — issues, pull requests, and new tool documentation.
相似文章
@outsource_: Hermes Workspace 新版本 v2.30 已发布 新功能:HermesWorld 集成 > 工作区内可玩的代理 MMO!Ag…
Hermes Workspace v2.30 引入了 HermesWorld 集成(可玩的代理 MMO)、新的代理视图面板、仪表盘小部件以及各种补丁和改进。
@tavilyai:Hermes Agent 让我们得以一窥智能体的未来走向——它从每次会话中学习,自主编写技能,并持续构建记忆…
Nous Research 推出的 Hermes Agent 是一款开源自进化自主智能体,能够从每次会话中学习并随时间积累持久记忆。Tavily 作为其网络搜索后端接入,旨在提升搜索质量,防止错误数据在智能体的长期记忆与技能库中不断累积放大。
@Saboo_Shubham_: 这太疯狂了!Hermes Agent 团队(@NousResearch)正在将 X 评论转化为数小时内即可使用的完整功能。10倍构建者…
Nous Research 强调其 Hermes Agent 平台能够快速将原始想法转化为完全可用的功能,将 AI 智能体定位为开发者生产力的新标准。
@AlphaSignalAI: https://x.com/AlphaSignalAI/status/2052836621905510541
# Hermes Agent v0.13.0("韧性版本")发布:持久看板、目标持久化与检查点回滚 Hermes Agent v0.13.0 正式发布,代号"The Tenacity Release"(韧性版本),本次更新带来了持久化看板(Durable Kanban)、持久目标(Persistent Goals)、Checkpoints v2(含回滚功能)以及 8 项 P0 级安全修复,将自身定位为与 Claude Code、Codex 等编码智能体并肩的**运行时持久层**。此次发布恰逢 DeepSeek V4-Pro、MiMo-V2.5-Pro 等支持百万级上下文窗口的低成本模型相继推出,使长期运行的智能体软件变得更具可行性。 ## 核心新特性 ### 持久看板(Durable Kanban) 任务状态不再随会话结束而丢失。看板数据现可跨会话持久保存,智能体可在中断后无缝恢复工作流,无需从头重建任务上下文。 ### 持久目标(Persistent Goals) 智能体的长期目标现在可以跨越多个会话持续存在。这意味着用户无需在每次启动时重新向智能体说明意图,系统能够自主维护目标状态并持续推进执行。 ### Checkpoints v2(含回滚) 升级后的检查点系统引入了**回滚能力**,允许将智能体状态恢复至任意历史检查点。这对于长时间运行的任务尤为关键——一旦某个执行分支出现错误,可直接回退而无需从零重启。 ### 8 项 P0 级安全修复 本次版本针对最高优先级安全漏洞进行了全面修复,共解决 8 个 P0 级问题,进一步强化了生产环境下的部署安全性。 ## 定位:运行时持久层 Hermes Agent 将自身明确定位为**运行时持久层**,与 Claude Code、Codex 等以代码生成为核心的编码智能体形成互补,而非竞争。其核心价值在于为智能体提供可靠的状态管理与执行连续性,解决长期任务中"断点续跑"这一关键工程问题。 ## 时机:百万上下文模型降低门槛 此次发布的时机颇具战略意义。随着 DeepSeek V4-Pro 和 MiMo-V2.5-Pro 等支持百万 token 上下文窗口的模型以极低成本落地,长时间运行的智能体任务从理论走向实践的障碍正在迅速消除。上下文容量的扩展与持久化基础设施的成熟,共同构成了**长期智能体软件**规模化应用的两大前提条件,而 Hermes Agent v0.13.0 正是在这一窗口期及时补上了基础设施侧的关键拼图。
@libapi_: 今天 Hermes Agent 登上全球榜单第一 这不仅仅是一份排名,它代表着开源社区、开发者、贡献者和每一个真实用户共同推动出来的结果 同时也很开心看到 @OpenRouter 上越来越多 AI Agent 项目被更多人看见。 CLI、…
Hermes Agent 登顶全球榜首,彰显开源社区与开发者的共同推动力,同时反映出 AI Agent 生态在 OpenRouter 等平台上正步入高速发展阶段。