The Fundamental Flaw of LLMs.txt

Technical SEO AI Search Signal vs. Noise

The Fundamental Flaw of LLMs.txt

Publishers are generating LLMs.txt files to get their sites surfaced by AI. Google's John Mueller explains why that was never what the file does — and why it can't.

Source: Google Search Off The Record, Ep. 111 — Mueller & Splitt via Search Engine Journal · Jun 2026
01 Where ranking actually begins

Search runs on a 5-stage pipeline. LLMs.txt touches none of it.

Every page that ranks passes through the same sequence. It starts with Discovery — the engine learning the URL exists. Skip that first box and the page is invisible, no matter what else you optimize.

STAGE 01
Discovery
Engine learns the URL exists
02
Crawling
Download & parse
03
Indexing
Structure & store
04
Ranking
Order by relevance
05
Serving
Show in results
NO VALID CONNECTION TO THE PIPELINE
LLMs.txt Discovery is not part of the proposed LLMs.txt standard. The file plugs into none of the five stages — so it can't be the on-ramp people are treating it as.
Reality → Without Discovery, a page never gets queued for crawling, indexing, or ranking. The effort spent on LLMs.txt for visibility is aimed at a door that doesn't open onto the pipeline.
02 What it was actually built for

The file does have a purpose — just not the one it's being used for.

Mueller spoke with one of the proposal's creators. The intent was narrow, and it was never about getting found.

The myth
A shortcut to AI discovery & ranking

Site owners publish LLMs.txt hoping AI systems will find, trust, and surface their content. This use case conflicts with the file's design — it was never meant to drive discovery.

The reality
A map for systems already on your site

If an AI already knows your site and wants to see what else is there, LLMs.txt can point the way. It's an internal index for the known — not a beacon for the unknown.

03 The trust problem

It's self-reported — so an AI can't lean on it to compare sites.

LLMs.txt is the site owner describing their own content, which may not match the actual HTML. Every site claims to be the best, so the signal carries no weight for differentiation.

By design, an LLM can't trust a file in which a site simply declares it has the best pages and you must buy its products.
PARAPHRASED — John Mueller, Google Search Relations
04 The better fit for agents

LLMs.txt vs. WebMCP

If AI agents become how users interact with sites, the useful standard is one that lets them act — not one that merely describes. Neither, however, gets you discovered.

CAPABILITY
LLMs.txtdescribes content
WebMCPenables action
Helps AI discover your site
No
No
Lets an agent search & filter products
No
Yes
Lets an agent compare & add to cart
No
Yes
Natural fit for ecommerce
No
Yes
05 Get started with WebMCP

Test the agentic layer today — and prep your site to be callable.

WebMCP is a proposed web standard (built by Google's Chrome team, incubated through the W3C, with the goal of any agentic browser implementing it). It lets a page expose its actions as structured tools an AI agent can call — turning your site into an API agents use without you maintaining a separate one. It supports tool discovery, JSON Schema for inputs/outputs, and shared page state. It's in origin trial from Chrome 149, and available behind a local flag for development.

What you need
  • 01Chrome with WebMCP — the Chrome 149+ origin trial for live users, or the local-dev flag for testing.
  • 02Local dev: set chrome://flags/#enable-webmcp-testing to Enabled, then relaunch.
  • 03Model Context Tool Inspector extension to register, call, and verify a page's tools.
  • 04An origin-isolated document — WebMCP is disabled if document.domain / Origin-Agent-Cluster: ?0 is set.
  • 05The tools permissions policy — defaults to self; cross-origin iframes need allow="tools".
  • 06An open tab — no headless support; a visible browsing context is required to call tools.
Try it locally in 4 steps
1
Open Chrome

Use a recent Chrome build for local testing.

2
Enable the flag

Set the WebMCP testing flag to Enabled.

3
Relaunch

Restart Chrome so the flag takes effect.

4
Inspect tools

Add the Tool Inspector; prompt it in natural language.

Shipping to real users? Register for the Chrome 149 origin trial instead of the flag.
Two ways to make your site agent-ready
Declarative API · HTML
Tag your existing forms

Add toolname and tooldescription attributes to a clean HTML form. The browser auto-generates the schema and fills the fields when an agent calls the tool.

Effort: Low — clean forms are ~80% of the work
Imperative API · JavaScript
Register tools in code

Use navigator.modelContext to register a tool with a name, description, input schema, and execute function. Tools appear and disappear with page state (e.g. checkout only when the cart has items).

Effort: Higher — for dynamic, complex flows
Heads up Origin-trial stage — the spec is under active discussion and subject to change. Two limits to plan around: there's no headless support (a tab must be open), and tool discoverability means an agent has to visit your site directly to find its tools. So WebMCP still doesn't get you discovered — agents reach your brand through HTML and AI visibility first. The form & markup hygiene you build now carries forward regardless of how the standard settles.
06 The bottom line

Discovery and ranking are still bound to HTML.

/ 01

Don't ship LLMs.txt for visibility. It plays no role in how AI or search finds your pages.

/ 02

Keep optimizing the HTML. Discovery, crawling, and ranking all happen at the page level.

/ 03

Watch WebMCP for ecommerce. Agentic standards are still unsettled — none has won yet.

Your move build for the HTML layer that AI actually reads, not the file it ignores.
Sources: Search Engine Journal (LLMs.txt) · Chrome for Developers (WebMCP docs) · Semrush (WebMCP guide) Riman Agency
Skip to Accessibility Panel