Cookie Consent by Free Privacy Policy Generator

The Best Of

Go to the Best Of the SEO Community.

Brent Rangen
Brent Rangen
Dec 27, 2024, 7:36 AM
Forwarded from another channel:
Interesting idea: anyone think it will get more traction?
A proposal to standardise on using an /llms.txt file to provide information to help LLMs use a website at inference time.
llms-txt: The /llms.txt file – llms-txt
Forwarded thread from another channel:
John Mueller
John Mueller
Dec 27, 2024, 8:20 AM
This seems like a kludge. Just make a normal, crawlable site?
Dave Smart
Dave Smart
Dec 27, 2024, 8:39 AM
Feels like a real good way to feed llms one thing and users another too i.m.h.o. I'd also worry about synchronicity too, update / change / add to your content, easy to forget the llm version (although you could I guess automate that to some extent)
Brent Rangen
Brent Rangen
Dec 27, 2024, 10:04 AM
Definitely not setting the internet ablaze. ????
MIchael Buckbee
MIchael Buckbee
Dec 30, 2024, 6:02 AM
Couple different things:
• We have quite sophisticated tools for taking HTML+Images and making it plain text of all sorts, so I'm not sure it helps anything there.
• It's built on the assumption that LLMs just want text, but the newest models are quite mutli-modal and support images, video, etc. which makes this feel outdated.
• One larger strategic change is that you want more `Brand Statement` content as part of your overall content strategy than preAI and that might not fit cleanly into how you present your marketing, so it might be helpful for that.
Emina Demiri-Watson
Emina Demiri-Watson
Jan 4, 2025, 2:47 AM
Cue consultants blaming it’s absence for lower results.
MIchael Buckbee
MIchael Buckbee
Jan 4, 2025, 9:47 AM
@emina well, I wonder how you even adopt it? Like do you re-render all your pages as markdown...that seems excessive.
Emina Demiri-Watson
Emina Demiri-Watson
Jan 5, 2025, 2:29 AM
@mike647 surely not! I just think it’s an overkill but then maybe Im not understanding something here.
John Mueller
John Mueller
Jan 5, 2025, 3:26 AM
Random data point - I crawled all our documentation and converted it into markdown to feed into NotebookLM. It's not super-hard (I used colab). Anyone training an AI model can do this if they're already crawling the web or ingesting HTML content. (That said, it was pretty cool to have all the content in a single folder :-))
Emina Demiri-Watson
Emina Demiri-Watson
Jan 5, 2025, 3:51 AM
@softplus plus now you can listen to the worlds longest podcast!
MIchael Buckbee
MIchael Buckbee
Jan 6, 2025, 7:11 AM
I can definitely see there being "Packages" of some kind of documentation like this built for ingestion into your AI model of choice.
Andrea Volpini
Andrea Volpini
Jan 15, 2025, 1:38 PM
If you work with LLMs, markdown helps - this is why exists and service like are growing fast.
The best model to turn HTML to markdown in my personal experience is (ReaderLM-v2 is available from today)
A proposal to standardise on using an /llms.txt file to provide information to help LLMs use a website at inference time.
llms-txt: The /llms.txt file – llms-txt
Tavily is a search engine tailored for AI agents, delivering real-time, accurate results, intelligent query suggestions, and in-depth research capabilities.
Tavily
Reader-LM-0.5B and Reader-LM-1.5B are two novel small language models inspired by Jina Reader, designed to convert raw, noisy HTML from the open web into clean markdown.
Written by
Jina AI
Reading time
13 mins read
Reader-LM: Small Language Models for Cleaning and Converting HTML to Markdown
Andrea Volpini
Andrea Volpini
Feb 4, 2025, 11:24 AM
We did a quick update here -
Adding MarkItDown by Microsoft ???? #SEO #LLMs
Provides brief background information and guidance for your website to large language models, by creating your /llms.text file.
Est. reading time
3 minutes
AI-Powered SEO • WordLift: Generate llms.txt
Andrea Volpini
Andrea Volpini
Feb 5, 2025, 3:04 AM
(and a blog post with all the links to the different solutions - there is also a WordPress plugin called AEOmatic that could be interesting)
Enhance your website's SEO by adopting the llms.txt standard, ensuring improved content accessibility and richer semantic connectivity for AI search engines.
Written by
Andrea Volpini
Est. reading time
6 minutes
WordLift Blog: Mastering llms.txt: Your Essential Guide to GenAI-Optimized Website Indexing
Optimize your WordPress website for AI tools with AEOmatic. Our plugin automates Answer Engine Optimization (AEO), making your content discoverable by ChatGPT, Gemini, and other AI tools. The next evolution of SEO is here.
AEOmatic | WordPress LLMs.txt Generator Plugin
MIchael Buckbee
MIchael Buckbee
Feb 5, 2025, 8:45 AM
Has any AI search provider said that they're ingesting LLMs.txt ?
John Mueller
John Mueller
Feb 5, 2025, 1:23 PM
It would be easy to see in server logs.
MIchael Buckbee
MIchael Buckbee
Feb 6, 2025, 6:32 AM
@softplus yeah - I'm working on some tracking of that `robots.txt` and `sitemap.xml` as I'm curious.
Anne Berlin
Anne Berlin
Feb 24, 2025, 1:24 PM
ive looked at the logfiles for one of the larger sites i manage, and llms.txt has only been requested 5 times in the last 30 days, and the only user agent is BuiltWith. At the same time, this site has been visited by both of Chat GPT's user agents almost a million times. this is not a content publisher, it's a drectory site, so YMMV.

Our Values

What we believe in

Building friendships

Kindness

Giving

Elevating others

Creating Signal

Treating each other with respect

What has no home here

Diminishing others

Gatekeeping

Taking without giving back

Spamming others

Arguing

Selling links and guest posts


Sign up for our Newsletter

Join our mailing list for updates

By signing up, you agree to our Privacy Policy and Terms of Service. We may send you occasional newsletters and promotional emails about our products and services. You can opt-out at any time.

Apply now to join our amazing community.

Powered by MODXModx Logo
the blazing fast + secure open source CMS.