Technical SEO AI Search Signal vs. Noise

The Fundamental Flaw of LLMs.txt

Publishers are generating LLMs.txt files to get their sites surfaced by AI. Google's John Mueller explains why that was never what the file does — and why it can't.

Source: Google Search Off The Record, Ep. 111 — Mueller & Splitt via Search Engine Journal · Jun 2026

01 Where ranking actually begins

Search runs on a 5-stage pipeline. LLMs.txt touches none of it.

Every page that ranks passes through the same sequence. It starts with Discovery — the engine learning the URL exists. Skip that first box and the page is invisible, no matter what else you optimize.

STAGE 01

Discovery

Engine learns the URL exists

→

Crawling

Download & parse

→

Indexing

Structure & store

→

Ranking

Order by relevance

→

Serving

Show in results

✕ NO VALID CONNECTION TO THE PIPELINE

LLMs.txt Discovery is not part of the proposed LLMs.txt standard. The file plugs into none of the five stages — so it can't be the on-ramp people are treating it as.

Reality → Without Discovery, a page never gets queued for crawling, indexing, or ranking. The effort spent on LLMs.txt for visibility is aimed at a door that doesn't open onto the pipeline.

02 What it was actually built for

The file does have a purpose — just not the one it's being used for.

Mueller spoke with one of the proposal's creators. The intent was narrow, and it was never about getting found.

The myth

A shortcut to AI discovery & ranking

Site owners publish LLMs.txt hoping AI systems will find, trust, and surface their content. This use case conflicts with the file's design — it was never meant to drive discovery.

The reality

A map for systems already on your site

If an AI already knows your site and wants to see what else is there, LLMs.txt can point the way. It's an internal index for the known — not a beacon for the unknown.

03 The trust problem

It's self-reported — so an AI can't lean on it to compare sites.

LLMs.txt is the site owner describing their own content, which may not match the actual HTML. Every site claims to be the best, so the signal carries no weight for differentiation.

“

By design, an LLM can't trust a file in which a site simply declares it has the best pages and you must buy its products.

PARAPHRASED — John Mueller, Google Search Relations

04 The better fit for agents

LLMs.txt vs. WebMCP

If AI agents become how users interact with sites, the useful standard is one that lets them act — not one that merely describes. Neither, however, gets you discovered.

CAPABILITY

LLMs.txtdescribes content

WebMCPenables action

Helps AI discover your site

✕ No

Lets an agent search & filter products

✕ No

✓ Yes

Lets an agent compare & add to cart

✕ No

✓ Yes

Natural fit for ecommerce

✕ No

✓ Yes

05 Get started with WebMCP

Test the agentic layer today — and prep your site to be callable.

WebMCP is a proposed web standard (built by Google's Chrome team, incubated through the W3C, with the goal of any agentic browser implementing it). It lets a page expose its actions as structured tools an AI agent can call — turning your site into an API agents use without you maintaining a separate one. It supports tool discovery, JSON Schema for inputs/outputs, and shared page state. It's in origin trial from Chrome 149, and available behind a local flag for development.

What you need

01Chrome with WebMCP — the Chrome 149+ origin trial for live users, or the local-dev flag for testing.
02Local dev: set chrome://flags/#enable-webmcp-testing to Enabled, then relaunch.
03Model Context Tool Inspector extension to register, call, and verify a page's tools.
04An origin-isolated document — WebMCP is disabled if document.domain / Origin-Agent-Cluster: ?0 is set.
05The tools permissions policy — defaults to self; cross-origin iframes need allow="tools".
06An open tab — no headless support; a visible browsing context is required to call tools.

Try it locally in 4 steps

Open Chrome

Use a recent Chrome build for local testing.

Enable the flag

Set the WebMCP testing flag to Enabled.

Relaunch

Restart Chrome so the flag takes effect.

Inspect tools

Add the Tool Inspector; prompt it in natural language.

Shipping to real users? Register for the Chrome 149 origin trial instead of the flag.

Two ways to make your site agent-ready

Declarative API · HTML

Tag your existing forms

Add toolname and tooldescription attributes to a clean HTML form. The browser auto-generates the schema and fills the fields when an agent calls the tool.

Effort: Low — clean forms are ~80% of the work

Imperative API · JavaScript

Use navigator.modelContext to register a tool with a name, description, input schema, and execute function. Tools appear and disappear with page state (e.g. checkout only when the cart has items).

Effort: Higher — for dynamic, complex flows

Heads up Origin-trial stage — the spec is under active discussion and subject to change. Two limits to plan around: there's no headless support (a tab must be open), and tool discoverability means an agent has to visit your site directly to find its tools. So WebMCP still doesn't get you discovered — agents reach your brand through HTML and AI visibility first. The form & markup hygiene you build now carries forward regardless of how the standard settles.

06 The bottom line

Discovery and ranking are still bound to HTML.

/ 01

Don't ship LLMs.txt for visibility. It plays no role in how AI or search finds your pages.

/ 02

Keep optimizing the HTML. Discovery, crawling, and ranking all happen at the page level.

/ 03

Watch WebMCP for ecommerce. Agentic standards are still unsettled — none has won yet.

Your move → build for the HTML layer that AI actually reads, not the file it ignores.

Sources: Search Engine Journal (LLMs.txt) · Chrome for Developers (WebMCP docs) · Semrush (WebMCP guide) Riman Agency

The Fundamental Flaw of LLMs.txt

Search runs on a 5-stage pipeline. LLMs.txt touches none of it.

The file does have a purpose — just not the one it's being used for.

It's self-reported — so an AI can't lean on it to compare sites.

LLMs.txt vs. WebMCP

Test the agentic layer today — and prep your site to be callable.

Discovery and ranking are still bound to HTML.

Services & Calculators

Main Pages

Top Services

Top Marketing Calculators

SaaS, Ads & Finance

SaaS & Subscription

Paid Ads & Email

Finance & Pricing

Content, Social & SEO

Content & Copy

Schema & SEO

Popular Articles