← Back to Blog

We Re-Scanned the Top 50 ClawHub Skills — Things Have Changed

~3× download growth, first CRITICAL findings, 2 top skills delisted, and 34% flagged Suspicious. The ecosystem is growing fast — but trust signals are diverging.

We Re-Scanned the Top 50 ClawHub Skills — Things Have Changed

One week after our initial scan, we ran the numbers again. The ClawHub ecosystem has changed — fast.

Total downloads across the Top 50 grew from 1.25M to over 3.5M in one week. The #1 skill now has 311K downloads. But alongside the growth, new patterns have emerged that weren’t there before.

The headline: for the first time, we found CRITICAL security patterns in the Top 50. Two skills received Grade D. Two of the top 10 were delisted. And a third of the Top 50 carry a “Suspicious” flag.


Grade Distribution

GradeCount%Change
A3978%↓ from 88%
B48%=
C36%↑ from 4%
D24%NEW
DELISTED24%NEW

The Grade A share dropped 10 points. Two skills hit Grade D for the first time — both are “evolver” variants that execute system commands and modify code by design.


What’s New Since Last Week

CRITICAL findings exist now

The previous scan found zero CRITICAL patterns across all 50 skills. This time:

These findings are consistent with the skills’ stated purpose — but the security surface is extreme: 844 combined findings across 25,000+ lines of code.

Top skills are disappearing

The #1 most-downloaded skill (311K downloads) and #3 (170.9K) have been removed from ClawHub’s download API. Both were flagged “Suspicious.” When the most popular tool in an ecosystem gets delisted, that’s a signal worth paying attention to.

A third of the Top 50 are “Suspicious”

topclawhubskills.com now shows a Suspicious/OK indicator based on OpenClaw’s behavioral analysis. 17 of 50 skills (34%) carry the Suspicious flag.

Interestingly, one Grade D skill is marked OK despite having eval() in its code — and some Grade A skills are marked Suspicious. The two trust dimensions measure different things. Neither alone tells the full story.


Most Skills Are Still Pure Prompt

CategoryCount%
With code files1837%
Pure prompt (SKILL.md only)3063%

Similar to last week (34/66). The majority of popular skills contain no executable code — just instructions for the AI agent. These are safe from code-level attacks but raise separate questions about prompt injection and claim verification.


Risk Pattern Frequency

RuleHitsSeverityDescription
S-05405HIGHEnvironment variable access
S-07325MEDIUMFile system operations
S-02115CRITICALSystem command execution
S-0443HIGHExternal HTTP communication
S-011CRITICALDynamic code execution (eval)

Environment variable access (S-05) overtook file I/O (S-07) as the most common pattern. The 116 CRITICAL hits are entirely from the two Grade D skills.


Skills with Findings

SkillGradeFindingsDownloadsStatus
self-improving-agentDELISTED311KSuspicious
agent-browserDELISTED170.9KSuspicious
nano-banana-proB167.7KOK
openclaw-tavily-searchB158.2KSuspicious
polymarket-tradeC1947.6KSuspicious
brave-searchC341.3KSuspicious
elite-longterm-memoryB838.9KSuspicious
stock-analysisC638.4KSuspicious
evolverD65338.0KSuspicious
feishu-evolver-wrapperD19132.9KOK
imap-smtp-emailB729.9KOK

Author Concentration

One author (@steipete) maintains 18 of the Top 50 — all graded A or B. This is both a quality signal (consistent security hygiene) and a structural risk (36% of popular tools depend on one maintainer).


What This Means

Three things stand out:

  1. The clean core is shrinking. Grade A dropped from 88% to 78%. The first CRITICAL findings and delistings mark a phase transition — the ecosystem is no longer uniformly safe at the top.

  2. Trust requires multiple layers. V(g) catches code patterns. OpenClaw’s scanner catches behavioral inconsistencies. VirusTotal catches known malware. Each misses what the others find. A skill can be Grade D (V(g)) and OK (OpenClaw) simultaneously — or Grade A and Suspicious.

  3. Growth amplifies risk. ~3× download growth in one week means more users are exposed to skills of unknown quality. The 311K-download #1 skill being delisted after the fact means hundreds of thousands of installs occurred before the problem was caught.

V(g) is one trust layer. The ecosystem needs them all working together.


Try It

Scan any skill or Gene with one command:

Terminal window
npx @rotifer/playground vg <path>

Badge your repo: rotifer.ai/badge

Full scanner docs: rotifer.dev/docs/cli/vg


Report by Rotifer Protocol. Data, methodology, and scanner are open source. Full JSON data available in the report repository.