Question 1

Is self-improving-agent safe to install?

Accepted Answer

Oathe's behavioral security audit gives self-improving-agent a trust score of 68/100 with a verdict of CAUTION. 12 findings were detected. This skill is a well-intentioned continuous improvement framework, but its core design pattern — instructing agents to autonomously write to CLAUDE.md and other instruction files based on conversational patterns — creates a significant prompt injection escalation path. The hook scripts provide comprehensive surveillance of all user prompts and Bash output. No malicious intent detected, no exfiltration attempted, but the architectural pattern of 'promote learnings to system instructions' is inherently risky and could be exploited by adversarial interactions or combined with malicious skills.

Question 2

What security issues were found in self-improving-agent?

Accepted Answer

12 findings were detected: HIGH — Persistent instruction file modification; HIGH — System-context-mimicking XML injection via hooks; MEDIUM — Sensitive file reads during installation; MEDIUM — Error logging may capture sensitive data; MEDIUM — Comprehensive Bash output surveillance; MEDIUM — Hook scripts run on every interaction; MEDIUM — Filesystem manipulation via extract-skill.sh; LOW — Autonomous behavior triggers without consent; LOW — Escalation path from learnings to system instructions; LOW — Combination risk with other skills; INFO — OpenClaw platform reads credential files; INFO — Temp file creation in /tmp.

Question 3

Should I install self-improving-agent?

Accepted Answer

Based on Oathe's audit, the recommendation is: INSTALL_WITH_CAUTION. Trust score: 68/100.

Is `self-improving-agent` safe?

Category Scores

Findings (12)

Is self-improving-agent safe?

Category Scores

Findings (12)

Is `self-improving-agent` safe?