Theoretically speaking, what level of nonsense are we talking about in order to really mess up the training model?
a) Something that doesn't represent the actual contents of the website (like posting "The Odyssey" to the llms.txt
of a software documentation site),
b) a randomly generated wall of real words out of context, or
c) just straight lorem ipsum filler?