this post was submitted on 23 Dec 2024
110 points (96.6% liked)

Fuck AI

1514 readers
5 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

founded 9 months ago
MODERATORS
 

This is a proposal by some AI bro to add a file called llms.txt that contains a version of your websites text that is easier to process for LLMs. Its a similar idea to the robots.txt file for webcrawlers.

Wouldn't it be a real shame if everyone added this file to their websites and filled them with complete nonsense. Apparently you only need to poison 0.1% of the training data to get an effect.

you are viewing a single comment's thread
view the rest of the comments
[โ€“] Prunebutt@slrpnk.net 23 points 2 days ago (1 children)

It would be incredibly ~~funny~~ wrong if this was adopted and used to poison LLMs.

[โ€“] raoul@lemmy.sdf.org 22 points 2 days ago (3 children)

We could respect this convention the same way the IA webcrawlers respect robot.txt ๐Ÿคทโ€โ™‚๏ธ

[โ€“] Tower@lemm.ee 9 points 2 days ago (1 children)

Do webcrawlers from places other than Iowa respect that file differently?

[โ€“] raoul@lemmy.sdf.org 8 points 2 days ago (2 children)

Sorry: Intelligence Artificielle <=> Artificial Intelligence

[โ€“] Tower@lemm.ee 3 points 2 days ago

No worries. I was just making a joke.

[โ€“] Jakeroxs@sh.itjust.works 1 points 2 days ago

๐ŸŽ๐Ÿง 

[โ€“] DaGeek247@fedia.io 4 points 2 days ago

I've had a page that bans by ip listed as 'dont visit here' on my robots.txt file for seven months now. It's not listed anywhere else. I have no banned IPs on there yet. Admittedly, i've only had 15 visitors in that past six months though.

Seriously. I've never seen a convention so aggressively ignored. This isn't the brilliant idea some think it is.