this post was submitted on 23 Dec 2024
110 points (96.6% liked)

Fuck AI

1514 readers
9 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

founded 9 months ago
MODERATORS
 

This is a proposal by some AI bro to add a file called llms.txt that contains a version of your websites text that is easier to process for LLMs. Its a similar idea to the robots.txt file for webcrawlers.

Wouldn't it be a real shame if everyone added this file to their websites and filled them with complete nonsense. Apparently you only need to poison 0.1% of the training data to get an effect.

you are viewing a single comment's thread
view the rest of the comments
[–] Gork@lemm.ee 29 points 2 days ago* (last edited 2 days ago) (2 children)

Place output from another LLM in there that has thematically the same content as what's on the website, but full of absolutely wrong information. Straight up hallucinations.

[–] Voroxpete@sh.itjust.works 12 points 2 days ago

This. Research has shown that training LLMs on the output of other LLMs very rapidly induces total model collapse. It's basically AI inbreeding.

[–] haverholm@kbin.earth 18 points 2 days ago

Using one LLM to fuck up a lot more is poetic I suppose. I'd just rather not use them in the first place.