139 checks across 8 categories — plus 28 schema types and 15 AI crawlers. Every check is backed by a real source — Google's own documentation, not our opinion — and re-checked every 90 days so the advice never goes stale. Most tools won't tell you why they flag something. We show you, every time.
We test your robots.txt against 15 named AI crawlers. Most sites accidentally block the ones they most want reading them.
Validate (or generate) your llms.txt
The emerging standard that tells AI assistants what is citable. We flag what is missing — and generate a ready-to-ship file if you do not have one.
Catch deprecated schema
Some schema types have lost rich-result eligibility or are sector-restricted. We flag the ones that no longer earn results — most validators only check that the JSON parses.
Validate the schema graph
We validate the JSON-LD entity graph, not just individual fields — catching structural defects a field-by-field validator never sees.
Measure real Core Web Vitals
Real-user Core Web Vitals — the numbers Google actually ranks on — not just lab estimates.
Benchmark against your sector
Your score against the median and top quartile of real sites in your sector — not a number in a vacuum.
Audit a whole site, not one page
Deep Audit samples across your whole site and flags "fix once, lift every page" template bugs.
Run a real accessibility audit
A real WCAG accessibility audit — critical, serious and moderate issues — on every rendered scan.
Check security + TLS
TLS, security headers, mixed content and stack disclosure — the hygiene AI crawlers and users both notice.
Detect agent-auth readiness
We check /.well-known/agent-auth — the emerging standard for authenticating AI agents that act on your behalf. Adoption is early; declaring it puts you ahead of 99% of the web.
Get a priority-grouped fix roadmap
Every issue is bucketed by priority — today / this week / this quarter — with copy-paste fix code. You see exactly what to do first.
See the working, not just the score
Every weight carries a cited source, labelled by strength of evidence and re-reviewed every 90 days. Answer "why does this matter?" in seconds.
Twelve headlines, 139 individual checks. Each category below shows its size and severity mix — the per-check findings and fixes run in your report.
/ SCHEMA MARKUP
Schema markup30 checks
29 types audited with sector-aware deprecation rules — some retired, some restricted to certain sectors, some time-boxed. We validate nested required fields, not just presence.
Severity mix
1Critical1High8Medium18Low2Advisory
Each check's finding + copy-paste fix runs in your report.
/ HEAD + META SIGNALS
HEAD + meta signals11 checks
The 15+ HEAD-level signals Google ranks on and AI assistants read first.
Severity mix
4High4Medium3Low
Each check's finding + copy-paste fix runs in your report.
/ OPEN GRAPH + SOCIAL
Open Graph + social8 checks
Card metadata, image reachability, dimension hints, brand-suffix-tolerant title consistency.
Severity mix
1Medium7Low
Each check's finding + copy-paste fix runs in your report.
/ AI CRAWLER ACCESS + LLMS.TXT
AI crawler access + llms.txt9 checks
robots.txt access for 15 named AI crawlers (GPTBot, ClaudeBot, PerplexityBot, etc.), llms.txt validity, llms-full.txt detection, and crawler-blocking severity tiered by 'live citation' vs 'training only'.
Severity mix
1Critical1High6Low1Advisory
Each check's finding + copy-paste fix runs in your report.
/ CONTENT STRUCTURE + E-E-A-T
Content structure + E-E-A-T33 checks
Main-content extraction via Mozilla Readability so we don't count nav chrome. Heading hierarchy, H1 quality, alt-text, reading level, internal links, semantic HTML5, author + date for E-E-A-T.
Severity mix
1Critical3High10Medium19Low
Each check's finding + copy-paste fix runs in your report.
/ PERFORMANCE + CORE WEB VITALS
Performance + Core Web Vitals14 checks
CrUX field data via PageSpeed Insights (the metric Google actually ranks on). Synthetic checks for render-blocking, third-party scripts, preconnect hints. Lighthouse category scores.
Severity mix
2High4Medium7Low1Advisory
Each check's finding + copy-paste fix runs in your report.
Each check's finding + copy-paste fix runs in your report.
/ SCHEMA TYPES
Schema types we recognise28 types
Beyond presence detection, each type has nested required-field validation and sector-aware deprecation rules — we track which still earn rich results. Most validators only check that the JSON parses. Per-type status (active / deprecated / restricted) is in your report.
AggregateRating
Article
Book
BreadcrumbList
ClaimReview
Course
Dataset
EmployerAggregateRating
Event
FAQPage
FinancialProduct
HowTo
ImageObject
JobPosting
LocalBusiness
Movie
Organization
Person
PodcastEpisode
PodcastSeries
Product
QAPage
Recipe
Review
Service
SoftwareApplication
SpecialAnnouncement
VideoObject
/ AI CRAWLERS
AI crawlers we check robots.txt access for15 bots
We test each User-Agent against your robots.txt using a longest-match-wins parser that handles wildcards, Allow rules, and per-group Crawl-delay. We tier them by whether they fetch live for citations or only for training, and weight a block accordingly — which bot sits in which tier is in your report.
Amazonbot
Applebot-Extended
bingbot
Bytespider
CCBot
ChatGPT-User
Claude-Web
ClaudeBot
Google-Extended
GPTBot
Meta-ExternalAgent
OAI-SearchBot
Perplexity-User
PerplexityBot
xAI-Bot
/ TIERS
What's in each tier
Same scanner under every tier. The differences are scope (one page vs whole site) and cadence (one-time vs ongoing). Studio adds programmatic access.
FEATURE
£5 Single
£39 Deep Audit
£5/mo Watcher
£199/mo Studio
139 signal checks per page
Priority-grouped roadmap (today / week / quarter)
Copy-paste fix code per issue
Peer benchmarks vs sector
Pages audited
1
up to 10
1 weekly
unlimited
Sitemap discovery + clustering
Site-wide common-issue detection
via API
Internal-link graph + orphan detection
via API
Hreflang return-link verification
via API
Sitemap freshness + URL health audit
via API
Per-page drill-down report
via API
Weekly re-scan
via API
Drift alerts (≥3pt or new critical only)
12-month score history dashboard
Free re-scans (7-day window)
always
always
Public REST API (/api/v1/scan)
API key with per-key audit trail
Rate-limit headers (X-RateLimit-*)
Render-mode opt-in (Playwright for SPAs)
always
opt-in
White-label PDFs
soon
Magic-link auth (no password)
/ METHODOLOGY
Three things nobody else does
Know whether AI search cites your brand — and fix it when it doesn't. Every weight cited and re-reviewed every 90 days. Every artefact below is a real file path in our open repo — click through, read it, fork it.
Cited weights
Every signal weight carries a source URL and EVIDENCE / HEURISTIC / CORPUS-CALIBRATED rationale. When a client asks "why is this -8?" you can answer in seconds. Nobody else shows their work.
Empirical calibration
A growing corpus of 50+ labelled real sites tells us which rules actually predict citation outcomes. Weights tune to the data over time — run `npm run corpus:stats` to reproduce the per-rule correlations.
90-day audit cycle
Every rule has a lastReviewed date. `npm run audit:stale` fails CI when anything crosses 90 days unverified. Google quietly deprecates schemas; bot operators rename UAs — without an audit cycle, your tool rots. Ours can't.
/ READY?
Run a free scan on your site
See the verdict in 30 seconds. No signup. Full report £5 — or check out Watcher / Deep Audit / Studio if you want more.
/ OPEN GRAPH + SOCIAL
Open Graph + social8 checks
Card metadata, image reachability, dimension hints, brand-suffix-tolerant title consistency.
Severity mix
Each check's finding + copy-paste fix runs in your report.