Data-driven insights from analyzing 128,000+ repositories and 3.27 billion lines of code. Vulnerability trends, code quality patterns, and actionable intelligence for engineering teams.
RSS FeedWhen we ask an LLM to describe Opus 4.7 repos in its own words, the same 30 patterns come up over and over. This is …
70% of Opus 4.7 repos span multiple languages. Monolingual codebases are the minority.
A deeper look at the test gap in the Opus 4.7 corpus — and why the 11% that do write tests write them well.
59% of Opus 4.7 repos share names with community-scanned repos. Closer look reveals they're the same repos, harvested twice.
Opus 4.7 writes thorough README files but almost never generates a LICENSE file. 81% of repos default to "all rights reserved.
75% of Opus 4.7 repos keep their directory structure under 5 levels deep. The opposite of the "over-engineered enterprise" stereotype.
Nearly half of Opus 4.7 repos exceed 10,000 lines of code. This isn't a demo-project corpus.
Static analysis of 683K files reveals the real stack Claude Opus 4.7 reaches for — with numbers.
The top LLM-flagged risks across Opus 4.7 repos paint a consistent picture — the model writes tests but doesn't wire them up.
1.3 million functions later, the verdict is in: Claude Opus 4.7 favors small, composable units.
How one component library became Claude Opus 4.7's default UI kit.
When we ask an LLM to grade every Opus 4.7 repo, 3 in 4 come back as "high reuse potential.
Our research is based on continuous analysis of 128,000+ repositories and 3.27 billion lines of code using Repobility's proprietary scanning engine.
All data is aggregated and anonymized. No individual repository names or source code is disclosed.
Access our proprietary datasets for your own research, product development, or competitive intelligence.
Browse DatasetsGet our latest research and intelligence reports delivered to your inbox.
No spam. Unsubscribe anytime.