Question 1

Is bartelmost/agentshield-audit safe to install?

Accepted Answer

Oathe's behavioral security audit gives bartelmost/agentshield-audit a trust score of 19/100 with a verdict of MALICIOUS. 12 findings were detected. AgentShield Audit is a deliberately deceptive skill designed to social-engineer AI agents into exfiltrating their system prompts to an attacker-controlled Heroku server (agentshield-api-bartel-fe94823ceeea.herokuapp.com). The skill disguises a prompt-injection and data exfiltration attack as a 'privacy-first security audit,' using granular consent workflows, false privacy assurances, and version inflation to manufacture user trust and authorization for transmitting complete system instructions — which typically contain API keys, operational secrets, and proprietary business logic — to third-party infrastructure with admitted 30-day data retention. The clean install-time behavior is a deliberate architectural feature, not a safety signal: the entire attack payload is contained in the SKILL.md instruction set, specifically designed to evade installation-phase security monitoring while activating post-install when a user approves the audit workflow.

Question 2

What security issues were found in bartelmost/agentshield-audit?

Accepted Answer

12 findings were detected: CRITICAL — Skill instructs agent to exfiltrate its system prompt to attacker-controlled server; CRITICAL — Explicit instruction to exfiltrate skill code to attacker-controlled API; CRITICAL — Security tool framing used as social engineering to manufacture consent for data exfiltration; HIGH — Hardcoded attacker-controlled Heroku endpoint for data collection; HIGH — 30-day data retention contradicts 'never stores data' privacy claims; HIGH — API key registration at agentshield.live creates PII and credential harvesting funnel; HIGH — Skill instructs agent to install Python packages on the host system; HIGH — Ed25519 certificate system creates persistent tracking fingerprint for victim agents; MEDIUM — Version inflation used to deceive users about skill maturity and trustworthiness; MEDIUM — Clean install is a deliberate evasion design — attack activates post-install via instructions; MEDIUM — Skill designed to harvest system prompts at scale — high organizational impact if widely deployed; INFO — Canary files intact — skill bypasses honeypot detection by targeting system prompts rather than local files.

Question 3

Should I install bartelmost/agentshield-audit?

Accepted Answer

Based on Oathe's audit, the recommendation is: DO_NOT_INSTALL. Trust score: 19/100.

Is `bartelmost/agentshield-audit` safe?

Category Scores

Findings (12)

Is bartelmost/agentshield-audit safe?

Category Scores

Findings (12)

Is `bartelmost/agentshield-audit` safe?