RANK ↑ 30D
+847%
case studies

only 5 domains link to every major AI agent framework (and the shelf is still being built)

we pulled the backlinks of 8 AI agent frameworks from common crawl. only 5 domains link to all of them, and they are not tech media. the niche's link graph is dev platforms plus a brand-new crop of agent directories you can still get into.

pete the seo wizard
July 2, 2026 · 8 min read · 1,500 words
sharexlinkedin

same experiment, third niche: take a category's biggest players, pull every domain that links to them from common crawl, and keep the ones that link to the whole set. SEO tools gave 37 universal linkers, nearly all media. CRMs gave 2, both integration hubs. AI agent frameworks just gave us 5, and they are a different animal again: two developer publishing platforms, one data science publisher, and two product sites. the niche that everyone is writing about has almost no press in its link graph.

how we pulled this

eight frameworks: langchain, llamaindex, crewai, dify, flowise, langflow, autogpt, and superagi. for each we pulled its referring domains ranked by cg_authority from the common crawl webgraph, then intersected the eight lists. the per-domain numbers:

referring domainscg authority
langchain.com5,94964
superagi.com2,04046
llamaindex.ai1,99949
crewai.com1,48946
dify.ai1,33146
flowiseai.com68529
langflow.org45028
agpt.co (autogpt)45028

one number up front: langchain has 3 to 13 times the referring domains of everything else in the category. 77 percent of all linkers link to exactly one framework, and most of the time that one framework is langchain. in a niche this young, editorial attention has already monopolized.

the overlap pyramid

across the eight frameworks there were 7,508 unique linking domains. the distribution:

linking domainsnon-platform
link to all 855
link to 71817
link to 63937
link to 57068
link to 4144141
link to 3384383
link to 21,0611,059
link to just 15,7875,782

among domains linking to four or more frameworks, 97 percent are real editorial or product sites. the platform noise you would expect from a crawl-scale dataset is almost absent once you filter the obvious CDNs.

the five, and what they tell you

the five domains that link to all eight are github.com, substack.com, analyticsvidhya.com, justcall.io, and klavis.ai. github and substack are where this niche publishes: the frameworks are open source, and the people who cover them write newsletters, not magazine features. analyticsvidhya is the one classic publisher, a data science education site. widen to the seven-of-eight set and the picture sharpens:

text
domains linking to 7 of the 8 frameworks, by cg authority
(github, substack, analyticsvidhya, justcall and klavis link to all 8):

dev.to (7) 62               clickup.com (7) 58
n8n.io (7) 57               akka.io (7) 57
luma.com (7) 57             arize.com (7) 43
aiagentsdirectory.com (7) 39  zenml.io (7) 38
thetoolnerd.com (7) 34      productschool.com (7) 30
latenode.com (7) 21         agenthunter.io (7) 21
everydev.ai (7) 20          potpie.ai (7) 14

read the names. dev.to, qiita and zenn (both in the six-linker set) are developer publishing platforms. n8n and latenode are automation platforms. arize, zenml and langfuse are ML tooling that integrates with everything. and then there is the interesting layer: aiagentsdirectory.com, agenthunter.io, everydev.ai, findmyagentai.com, bestaiagents.ai, aiagentslive.com. a whole crop of AI agent directories that did not exist two years ago, most with single-digit-to-low authority scores.

the shape of the niche is the strategy

three niches, three shapes, three different plays:

link to all 8who they are
SEO tools37trade media + roundups
CRMs2integration hubs
AI agent frameworks5dev platforms + young directories

for SEO tools the play is pitching writers, because the category shelf is media. for CRMs it is building integrations, because the shelf is automation hubs. for AI agents the shelf is developer platforms plus directories that are still forming. the play is to publish where developers already read (github, substack, dev.to, qiita) and to get listed in the agent directories while they are young. a directory with authority 20 today is not impressive. a directory that ends up being the geekflare of AI agents in three years is, and you got in when it took an email.

livetry it on your own site

run this query against your domain - free

first 5 backlinks free. no signup required.

https://

the newcomer proof

langflow and autogpt each have 450 referring domains, a fraction of langchain's 5,949. yet both are reached by the same directories and dev platforms that link to the whole category. you do not out-publish langchain at this point. you get onto the shelf that the category linkers are building, and the shelf treats you the same as the giant.

how to do this for your own niche

the method is category-agnostic. pick your two or three closest competitors, pull their referring domains, and keep the ones that link to several of them but not to you, sorted by overlap and then authority. that is exactly what a backlink gap analysis does, free, on the same common crawl data we used here. it tells you which of the three shapes your niche is, and hands you the list of who to approach.

the full data

the 59 domains that link to six or more of these frameworks are free to download as CSV or JSON. see also the SEO-tool link leaderboard and the CRM study for the contrasting shapes.

ahrefs · backlinkslocked
upgrade required · $129/mo
crawlgraph · live $99 once
G
github.io92
C
css-tricks.com88
L
lobste.rs86
A
algolia.com84
W
web.dev80
same data · one-time
$99$129/moonce
unlock the data →
stripe checkout · instant access
sharexlinkedin
pete the seo wizard
author

writes the queries we run internally. ships one tactical post a week.

keep reading
RANK ↑ 30D
+847%
case studies
case studies8 min read

only 2 domains link to every major CRM (and that is the lesson)

we pulled the backlinks of the 8 biggest CRMs from common crawl. only 2 domains link to all of them, both integration hubs. compare that to SEO tools, where 37 did. the shape of a niche's link graph tells you the strategy.

pete the seo wizardJul 1
RANK ↑ 30D
+847%
case studies
case studies9 min read

the 37 domains that link to every major SEO tool

we pulled the backlinks of 8 major seo tools from common crawl. 37 domains link to all of them, and the list is the seo trade press plus the 'best seo tools' roundups. here is the playbook that falls out of it.

pete the seo wizardJun 30
RANK ↑ 30D
+847%
case studies
case studies8 min read

only 6 domains link to every major email marketing tool

we ran the same backlink-overlap study on the 8 biggest email tools. the overlap almost vanished - only 6 domains link to all of them. that collapse is itself the most useful finding.

pete the seo wizardJun 5
the dispatch
one post a week.

+ a free domain audit when you sign up.