GitHub Stars Don't Predict AI Agent Trust. I Scored 192 to Prove It.

May 31, 2026 · 6 min read · HVTracker Research

Every "best AI agent" list ranks the same way: by GitHub stars. Stars are easy to count and easy to game. They measure attention, not trustworthiness — and for software you're about to give access to your terminal, your CI, and your codebase, attention is the wrong metric.

So I built HVTracker to score AI agents on signals you can actually verify: OSSF Scorecard, build provenance, signed commits, license type, maintenance, and adoption — each weighted by how hard it is to fake. It's a curated and ever-growing registry of the most notable agents (the ones people actually use), refreshed every two hours — not an index of every repo on GitHub.

Then I checked the 30 most-starred agents for the one signal that should be table stakes: build provenance — cryptographic proof that the package you install was built from the source you can read.

24 of the 30 don't publish it.

The list

The 24 most-starred AI agents shipping without build provenance, with their HVTrust rank:

AgentStarsProvenanceHVTrust Rank
OpenClaw376kNone#43
AutoGPT185kNone#53
opencode168kNone#120
Langflow149kNone#47
Dify143kNone#36
LangChain138kNone#26
Claude Code128kNone#82
Firecrawl126kNone#38
Gemini CLI105kNone#34
Browser Use96kNone#49
MCP (servers)87kNone#76
RAGFlow82kNone#62
OpenHands75kNone#33
Daytona73kNone#125
DeerFlow70kNone#51
MetaGPT68kNone#124
Crawl4AI67kNone#85
Open Interpreter64kNone#77
AnythingLLM61kNone#118
PrivateGPT57kNone#122
OpenManus56kNone#164
Flowise53kNone#45
CrewAI52kNone#23
LlamaIndex50kNone#117

That's not an accusation against any one of them. It's a snapshot of an ecosystem that has collectively decided provenance is optional.

Why provenance matters

When you pip install or npm install an agent, you're trusting that the published artifact matches the public source. Provenance attestation (via SLSA / Sigstore) is the cryptographic receipt that proves it. Without it, a compromised build pipeline or a hijacked publish token can ship malicious code under a trusted name — and you'd have no way to tell.

This would matter for any dependency. It matters more for AI agents because they execute code, call APIs, access tools, and increasingly spend money on your behalf. The blast radius of a compromised logging library is data exfiltration. The blast radius of a compromised agent with tool access is "whatever it was authorized to do."

The six that get it right

Credit where it's due. Among the 30 most-starred, these publish provenance — and it's no coincidence they also score near the top:

AgentStarsProvenanceHVTrust Rank
Codex87kVerified#2
n8n190kVerified#6
Cline63kVerified#10
Mem057kVerified#16
Open WebUI139kVerified#19
AutoGen59kVerified#25

Verifiable practices and trust scores move together. Projects that publish provenance also tend to care about signed commits, OSSF Scorecard, and disclosure policies. Trust signals cluster.

Stars vs. trust, head to head

The clearest illustration:

Fewer stars, far higher trust. This isn't a quality judgment on Claude Code — it's an excellent tool. It's proprietary, with no public OSSF Scorecard and no published provenance, so there's simply less that's externally verifiable. That's the point: popularity and verifiability are different axes, and stars only measure the first one.

Not a safety verdict. Missing provenance doesn't mean a project is unsafe. It means an independent observer can't verify the build pipeline from public data. That's a gap, not an accusation.

How we score (and how to argue with it)

Every score is built from public signals, weighted by how hard they are to fake, then scaled by an evidence-confidence factor so a tool with little verifiable evidence can't bluff its way to the top. The full methodology is public, and the entire dataset is free to use under CC BY 4.0.

If you think a weight is wrong, tell us — the methodology is meant to be argued with, in the open.

See where your stack ranks

Provenance, OSSF Scorecard, signed commits, license type, and more — across hundreds of notable AI agents. Refreshed every 2 hours.

Browse the trust registry

Data from HVTracker signals as of May 31, 2026. Provenance is checked via npm registry attestations and PyPI PEP 740 metadata. Full methodology.