/ FEATURES

Every signal, every category,
every source.

139 checks across 8 categories — plus 28 schema types and 15 AI crawlers. Every check is backed by a real source — Google's own documentation, not our opinion — and re-checked every 90 days so the advice never goes stale. Most tools won't tell you why they flag something. We show you, every time.

Run a free scanPublic methodology repo lands in v1.1

/ WHAT IT DOES

The audit, in twelve headlines.

Find blocked AI crawlers	We test your robots.txt against 15 named AI crawlers. Most sites accidentally block the ones they most want reading them.
Validate (or generate) your llms.txt	The emerging standard that tells AI assistants what is citable. We flag what is missing — and generate a ready-to-ship file if you do not have one.
Catch deprecated schema	Some schema types have lost rich-result eligibility or are sector-restricted. We flag the ones that no longer earn results — most validators only check that the JSON parses.
Validate the schema graph	We validate the JSON-LD entity graph, not just individual fields — catching structural defects a field-by-field validator never sees.
Measure real Core Web Vitals	Real-user Core Web Vitals — the numbers Google actually ranks on — not just lab estimates.
Benchmark against your sector	Your score against the median and top quartile of real sites in your sector — not a number in a vacuum.
Audit a whole site, not one page	Deep Audit samples across your whole site and flags "fix once, lift every page" template bugs.
Run a real accessibility audit	A real WCAG accessibility audit — critical, serious and moderate issues — on every rendered scan.
Check security + TLS	TLS, security headers, mixed content and stack disclosure — the hygiene AI crawlers and users both notice.
Detect agent-auth readiness	We check /.well-known/agent-auth — the emerging standard for authenticating AI agents that act on your behalf. Adoption is early; declaring it puts you ahead of 99% of the web.
Get a priority-grouped fix roadmap	Every issue is bucketed by priority — today / this week / this quarter — with copy-paste fix code. You see exactly what to do first.
See the working, not just the score	Every weight carries a cited source, labelled by strength of evidence and re-reviewed every 90 days. Answer "why does this matter?" in seconds.

Twelve headlines, 139 individual checks. Each category below shows its size and severity mix — the per-check findings and fixes run in your report.

/ SCHEMA MARKUP

Schema markup30 checks

29 types audited with sector-aware deprecation rules — some retired, some restricted to certain sectors, some time-boxed. We validate nested required fields, not just presence.

Severity mix

1Critical1High8Medium18Low2Advisory

Each check's finding + copy-paste fix runs in your report.

/ HEAD + META SIGNALS

HEAD + meta signals11 checks

The 15+ HEAD-level signals Google ranks on and AI assistants read first.

Severity mix

4High4Medium3Low

Each check's finding + copy-paste fix runs in your report.

/ AI CRAWLER ACCESS + LLMS.TXT

AI crawler access + llms.txt9 checks

robots.txt access for 15 named AI crawlers (GPTBot, ClaudeBot, PerplexityBot, etc.), llms.txt validity, llms-full.txt detection, and crawler-blocking severity tiered by 'live citation' vs 'training only'.

Severity mix

1Critical1High6Low1Advisory

Each check's finding + copy-paste fix runs in your report.

/ CONTENT STRUCTURE + E-E-A-T

Content structure + E-E-A-T33 checks

Main-content extraction via Mozilla Readability so we don't count nav chrome. Heading hierarchy, H1 quality, alt-text, reading level, internal links, semantic HTML5, author + date for E-E-A-T.

Severity mix

1Critical3High10Medium19Low

Each check's finding + copy-paste fix runs in your report.

/ PERFORMANCE + CORE WEB VITALS

Performance + Core Web Vitals14 checks

CrUX field data via PageSpeed Insights (the metric Google actually ranks on). Synthetic checks for render-blocking, third-party scripts, preconnect hints. Lighthouse category scores.

Severity mix

2High4Medium7Low1Advisory

Each check's finding + copy-paste fix runs in your report.

/ SECURITY + INFRASTRUCTURE

Security + infrastructure20 checks

TLS cert expiry + protocol version, security response headers, mixed http:// content, caching strategy, theme-color, PWA manifest, hreflang return-link symmetry.

Severity mix

2Critical1High2Medium15Low

Each check's finding + copy-paste fix runs in your report.

/ URL HYGIENE + ERRORS

URL hygiene + errors14 checks

Soft-404 probes, canonical round-trip validation, redirect chains, pagination markup, AMP awareness, tracking-param canonicals, mailto/tel hygiene, accessibility (skip links).

Severity mix

3High3Medium7Low1Advisory

Each check's finding + copy-paste fix runs in your report.

/ SCHEMA TYPES

Schema types we recognise28 types

Beyond presence detection, each type has nested required-field validation and sector-aware deprecation rules — we track which still earn rich results. Most validators only check that the JSON parses. Per-type status (active / deprecated / restricted) is in your report.

AggregateRating
Article
Book
BreadcrumbList
ClaimReview
Course
Dataset
EmployerAggregateRating
Event
FAQPage
FinancialProduct
HowTo
ImageObject
JobPosting
LocalBusiness
Movie
Organization
Person
PodcastEpisode
PodcastSeries
Product
QAPage
Recipe
Review
Service
SoftwareApplication
SpecialAnnouncement
VideoObject

/ AI CRAWLERS

AI crawlers we check robots.txt access for15 bots

We test each User-Agent against your robots.txt using a longest-match-wins parser that handles wildcards, Allow rules, and per-group Crawl-delay. We tier them by whether they fetch live for citations or only for training, and weight a block accordingly — which bot sits in which tier is in your report.

Amazonbot
Applebot-Extended
bingbot
Bytespider
CCBot
ChatGPT-User
Claude-Web
ClaudeBot
Google-Extended
GPTBot
Meta-ExternalAgent
OAI-SearchBot
Perplexity-User
PerplexityBot
xAI-Bot

/ TIERS

What's in each tier

Same scanner under every tier. The differences are scope (one page vs whole site) and cadence (one-time vs ongoing). Studio adds programmatic access.

FEATURE	£5 Single	£39 Deep Audit	£5/mo Watcher	£199/mo Studio
139 signal checks per page
Priority-grouped roadmap (today / week / quarter)
Copy-paste fix code per issue
Peer benchmarks vs sector
Pages audited	1	up to 10	1 weekly	unlimited
Sitemap discovery + clustering
Site-wide common-issue detection				via API
Internal-link graph + orphan detection				via API
Hreflang return-link verification				via API
Sitemap freshness + URL health audit				via API
Per-page drill-down report				via API
Weekly re-scan				via API
Drift alerts (≥3pt or new critical only)
12-month score history dashboard
Free re-scans (7-day window)			always	always
Public REST API (/api/v1/scan)
API key with per-key audit trail
Rate-limit headers (X-RateLimit-*)
Render-mode opt-in (Playwright for SPAs)		always		opt-in
White-label PDFs				soon
Magic-link auth (no password)

/ METHODOLOGY

Three things nobody else does

Know whether AI search cites your brand — and fix it when it doesn't. Every weight cited and re-reviewed every 90 days. Every artefact below is a real file path in our open repo — click through, read it, fork it.

Cited weights

Every signal weight carries a source URL and EVIDENCE / HEURISTIC / CORPUS-CALIBRATED rationale. When a client asks "why is this -8?" you can answer in seconds. Nobody else shows their work.

Empirical calibration

A growing corpus of 50+ labelled real sites tells us which rules actually predict citation outcomes. Weights tune to the data over time — run `npm run corpus:stats` to reproduce the per-rule correlations.

90-day audit cycle

Every rule has a lastReviewed date. `npm run audit:stale` fails CI when anything crosses 90 days unverified. Google quietly deprecates schemas; bot operators rename UAs — without an audit cycle, your tool rots. Ours can't.

/ READY?

Run a free scan on your site

See the verdict in 30 seconds. No signup. Full report £5 — or check out Watcher / Deep Audit / Studio if you want more.