QBiz Leads AI

llms.txt Explained: What It Is, and Whether a Small Business Actually Needs One

You have probably seen the term in a newsletter or a LinkedIn post, usually wrapped in a warning: add an llms.txt file to your website or AI search will leave you behind. One vendor guide goes as far as calling it "the existential risk of ignoring llms.txt".

If you run an electrical firm, a small law practice or a removals company, that lands somewhere between confusing and alarming. You have a website. Now there is apparently a file you are missing, and missing it might make you "functionally invisible" to ChatGPT.

This guide gives you the straight version. We will explain what llms.txt actually is in plain terms, look at what the independent research says about whether it does anything, separate the genuine reasons to add one from the marketing fear, and finish with a clear verdict for a small business. No hype, no "you must act now". Just the evidence and a sensible decision.

The short answer, if you only read one line: llms.txt is harmless, quick to make, and almost certainly not the thing standing between you and an AI recommendation. The work that actually gets you cited sits elsewhere, and we will point you to it.

What llms.txt actually is

llms.txt is a single plain-text file that sits at the root of your website, at an address like yourbusiness.co.uk/llms.txt. It is written in Markdown, the same simple formatting used for plenty of online documents, and it gives an AI system a short, tidy map of your site: who you are, what you offer, and links to your most useful pages with a line describing each one.

The idea was proposed by Jeremy Howard, co-founder of Answer.AI and fast.ai, in September 2024 (via answer.ai, 3 September 2024). The reasoning behind it is genuinely sensible. As the proposal puts it, language models "face a critical limitation: context windows are too small to handle most websites in their entirety", and "converting complex HTML pages with navigation, ads, and JavaScript into LLM-friendly plain text is both difficult and imprecise" (llmstxt.org). In other words, a normal web page is cluttered with menus, cookie banners and scripts that an AI has to wade through to find the actual content. llms.txt hands it a clean summary instead.

It helps to know what the original proposal was really aimed at. The first examples were software documentation: the FastHTML coding library used it so AI coding assistants could find programming references cleanly (llmstxt.org). That origin matters, because a developer-tools file and a plumber's five-page website are very different problems, and a lot of the breathless advice quietly forgets the difference.

How it differs from files you may already know

If you have ever dealt with a website, you may have heard of two older files. A quick comparison clears up where llms.txt fits.

File What it does Age
robots.txt Tells search crawlers which pages they may or may not access. It controls permission. A web standard since 1994
sitemap.xml Lists all your pages so search engines can find them. It controls discovery. Long-established
llms.txt Offers AI tools a curated summary of your best content. It suggests what is worth reading. Proposed September 2024

The honest framing: robots.txt is a lock on the door, a sitemap is a list of every room, and llms.txt is a one-page note saying "here are the three rooms worth your time". The first two are near-universal and respected by every major engine. The third is new, optional, and, as we are about to see, not yet doing much.

What the evidence actually says

This is where the honest version parts company with the scare stories. There is real research on whether llms.txt changes anything, and it is worth more than any confident assertion in a sales email.

The large independent study: no measurable effect

The most useful study comes from SE Ranking, an SEO platform with no llms.txt product to sell. In November 2025 they analysed nearly 300,000 domains. Two findings stand out.

First, adoption is thin. "Out of the nearly 300,000 domains we analyzed, only 10.13% had an LLMs.txt file in place," they reported, which means "almost 9 out of 10 sites haven't adopted it yet" (SE Ranking, November 2025).

Second, and more important, they tested whether having the file made a site more likely to be cited by AI. They built a machine-learning model and checked. Their conclusion was blunt: "when we removed the LLMs.txt factor, the model's predictions actually improved. It became more accurate and reliable on test data." Having the file "didn't make a domain more likely to be cited by AI models. In fact, the model performed better without it" (SE Ranking, November 2025).

Read that twice, because it is the heart of the matter. An independent analysis of 300,000 sites found no positive link between having llms.txt and being cited by AI. The file was statistical noise, not a signal.

What the major AI platforms actually do

The platforms back this up. Google published its own guidance on succeeding in AI search in May 2025. It lists what matters: "unique, non-commodity content", a good page experience, making sure crawlers "can find them, crawl them, index them", and structured data that matches your visible content (Google Search Central, May 2025). The whole document is the same advice Google has given for years. It does not mention llms.txt at all.

SE Ranking's research notes the broader picture across platforms: there is "no clear evidence that major AI platforms are actively using LLMs.txt in their data pipelines", and while "GPTBot sometimes fetches LLMs.txt files", "this doesn't happen often" (SE Ranking, November 2025). One real exception is worth naming: Anthropic recommends llms.txt for sites that want to be cited cleanly by its Claude model. So it is not useless everywhere. It is just nowhere near the universal must-have it is sold as.

The vendor study that says it helps (read the small print)

In fairness, there is research pointing the other way, and an honest guide includes it. CiterLabs, a firm that runs "GEO" (generative engine optimisation) services, tracked 1,000 well-known B2B SaaS sites. It found that sites adopting a well-structured llms.txt see "15-40% higher citation rates within 60 days" (CiterLabs, April 2026).

That sounds impressive, so apply two filters. First, CiterLabs sells the service that this finding promotes, so treat it as vendor research with a commercial interest, not neutral fact. Second, and more useful, look at the wording: the lift came from a "well-structured" file, and the sites studied were "B2B SaaS" companies, mostly "developer-focused brands" (CiterLabs, April 2026). That is the exact audience llms.txt was invented for: software companies with sprawling documentation. It is the structure and the quality of the content doing the work, not the mere existence of the file. A tidy file pointing at genuinely useful pages helps an AI; an empty file pointing at thin pages does nothing.

So the two studies do not really contradict each other. The honest synthesis is this: for a typical small local business, an llms.txt file on its own has no demonstrated effect on whether AI recommends you. Where it helps at all, it is because it points cleanly at content that was already worth citing.

Why the hype exists (and why you can ignore the fear)

If the evidence is this measured, why does so much of the advice sound like a fire alarm?

Because some of the loudest voices are selling something. The "existential risk" framing comes from a vendor whose guide warns that without llms.txt "you're functionally invisible to the growing ecosystem of AI answer engines", that "your content will be misrepresented", and that your "brand's credibility" is "burning" (Profound, llms.txt guide). Profound sells AI-visibility software, so its job is partly to make you feel behind. Some of the underlying points are reasonable (clean, well-structured content does help AI read you). The "existential risk" wrapper is marketing.

There is a simple tell for this kind of advice. If a piece tells you a 30-minute file is the difference between thriving and disappearing, but never mentions the independent study showing no correlation, it is selling, not informing. The reality is calmer: llms.txt is a small, sensible, low-risk addition that the major engines mostly are not using yet, on a standard that may or may not catch on.

Caring about being found in AI answers is right. Believing one file fixes it is not.

So should your business add one? An honest verdict

Here is a straight decision you can act on today, based on the kind of business you run.

If your website is small (a handful of pages): do not bother yet, or add a basic one and forget about it. A removals firm or a barber with five pages does not have a "context window" problem for an AI to solve. The file would point at the same pages the AI can already read in seconds. Your time is far better spent on the work that genuinely makes the difference (covered below). There is no harm in having one; there is just very little point.

If your website is larger (many service pages, a resource library, location pages, guides): a hand-written llms.txt is a reasonable, low-cost tidy-up. Think of an estate agency with dozens of area pages, or an accountancy practice with guides on VAT, payroll, limited companies and self-assessment. A short file flagging your strongest pages takes under an hour, carries no technical risk, and means that the engines which do read it (Claude today, possibly more later) get a clean map. Treat it as good housekeeping, not a growth lever.

Either way, write it by hand, do not auto-generate it. Several plugins will spit out an llms.txt for you, but most produce a bare list of URLs with no descriptions, which is the version the research found does nothing. If you make one, write a real one-line description for each page in plain language. A file that says "AI visibility audits for UK service businesses" is worth ten that just dump a link.

One firm rule for every business: llms.txt only helps if it points at pages worth reading. Point it at thin or vague pages and you have simply indexed the problem. Fix the content first.

What actually gets a local business cited by AI

This is the part the file-focused advice skips, and it is the part that matters. If you want AI tools to name your business, the evidence points consistently to a handful of fundamentals. None of them is a file.

That list is the real work. For the full picture of how engines choose who to name, see how AI engines pick sources, and for the complete local playbook, start with our pillar guide, Answer Engine Optimisation for local businesses.

If you want a deeper, step-by-step walkthrough of writing an llms.txt file itself (the exact Markdown structure, what to include, how to keep it current), we keep that on our resource page on llms.txt rather than repeating it here.

Common misconceptions about llms.txt

Frequently asked questions

What is llms.txt in one sentence?

It is a short Markdown file at your website's root that gives AI tools a clean summary of your most important pages, proposed in September 2024 and still optional.

Do I need llms.txt for my small business?

Probably not as a priority. Independent research across nearly 300,000 sites found no link between having the file and being cited by AI (SE Ranking, November 2025). If your site is large and you want to tidy it for the engines that do read it, a hand-written file is a reasonable hour's work. If your site is small, your time is better spent on content, reviews and crawlability.

Will llms.txt hurt my site if I add one?

No. It carries no technical risk and exposes nothing sensitive. The only downside is spending time on it instead of on the work that actually drives citations.

Does Google or ChatGPT read it?

Google's AI-search guidance does not mention it. ChatGPT's crawler has been seen fetching llms.txt occasionally but not often. Anthropic's Claude is the clearest case of a platform that recommends it. Treat broad reliance as unproven for now (SE Ranking, November 2025).

Should I let a plugin generate it automatically?

Better to write it by hand. Auto-generated files are usually bare URL lists with no descriptions, which is the version that does nothing. A short, hand-written file with a real description per page is worth far more.

If llms.txt does so little, why does everyone talk about it?

Partly genuine curiosity about a sensible idea, and partly because some firms selling AI-visibility services use the fear of missing out to win business. The honest position is that it is a minor, optional tidy-up, not an emergency.

Where to start

If you take one thing from this: llms.txt is real, it is harmless, and it is nowhere near the most important thing you can do to show up in AI answers. Add a hand-written one if your site is big enough to benefit. Skip it for now if your site is small. Either way, put your real effort into clear content, a site AI can read, consistent business details and genuine reviews. That is what gets you named.

If you would rather know exactly where your business stands first (which pages are clear and crawlable, which need work, and whether AI tools currently mention you at all), that is what a QBiz AI Visibility check does. We look at your real setup, tell you plainly what is ready to be cited and what is holding you back, and hand you a prioritised list. No guaranteed-ranking promises, because nobody controls the platforms. Just an honest read before you spend time or money on anything, including a file you may not need.

Get your free AI Visibility check →

Sources

  • llms.txt proposal, llmstxt.org: https://llmstxt.org/ (primary; the original specification by Jeremy Howard, September 2024; states the context-window and HTML-clutter problem llms.txt is meant to solve; original examples are software documentation such as FastHTML)
  • SE Ranking, "Does LLMs.txt impact your AI visibility and citations? No, according to research," 7 November 2025: https://seranking.com/blog/llms-txt/ (independent; analysed ~300,000 domains; only 10.13% had an llms.txt file; machine-learning model found no correlation with AI citations and improved when the variable was removed; "no clear evidence that major AI platforms are actively using LLMs.txt"; GPTBot fetches it but "this doesn't happen often")
  • Google Search Central, "Top ways to ensure your content performs well in Google's AI experiences on Search," 21 May 2025: https://developers.google.com/search/blog/2025/05/succeeding-in-ai-search (primary; Google's own AI-search guidance lists unique content, page experience, crawlability and matching structured data; makes no mention of llms.txt)
  • CiterLabs, "State of llms.txt Adoption 2026," 29 April 2026: https://citerlabs.com/state-of/llms-txt-adoption-2026/ (VENDOR: CiterLabs sells GEO services; "well-structured llms.txt" linked to 15-40% higher citation rates within 60 days, but only across developer-focused B2B SaaS sites; ~12% of top-1000 B2B SaaS sites have adopted it; Anthropic recommends it for Claude)
  • Profound, "The role and functionality of llms.txt in LLM-driven web interactions," llms.txt guide: https://www.tryprofound.com/resources/articles/what-is-llms-txt-guide (VENDOR: Profound sells AI-visibility tools; source of the "existential risk of ignoring llms.txt" and "functionally invisible" framing this article weighs against; cited as an example of vendor hype, not as evidence of effect)

Leave a comment

Thoughts on this post? Leave a comment below. Comments are moderated before they appear, so yours will not show on the page straight away.

Your email is used only to contact you about your comment if needed — it is never published.

Comments

No comments yet. Be the first to leave one above.