Asklemmy

43898 readers

1175 users here now

A loosely moderated place to ask open-ended questions

Search asklemmy 🔍

If your post meets the following criteria, it's welcome here!

Open-ended question
Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
Not ad nauseam inducing: please make sure it is a question that would be new to most members
An actual topic of discussion

Looking for support?

Looking for a community?

Lemmyverse: community search
sub.rehab: maps old subreddits to fediverse options, marks official as such
!lemmy411@lemmy.ca: a community for finding communities

~Icon~ ~by~ ~@Double_A@discuss.tchncs.de~

founded 5 years ago

MODERATORS

Will we try to prevent google (and other) scrapers? (lemmy.ml)

submitted 1 year ago by Screak42@lemmy.ml to c/asklemmy@lemmy.ml

8 comments fedilink hide all child comments

Will we try to prevent google (and other) scrapers?

The headline is pretty much a summary. "Google Says It will Scrape Everything You Post Online for AI" https://www.gizmodo.com.au/2023/07/google-says-it-will-scrape-everything-you-post-online-for-ai/

The first question is obviously; do we as a community on Lemmy even want to try and stop them from scraping our content here? If no; well. ok then.

If yes; how? I'm not sure if "preventing access" to unregistered users would really prevent this. Pretty sure google has enough money and manpower to figure out a way to make it their mission to get around "can only accessed by members" content.

you are viewing a single comment's thread
view the rest of the comments

[–] nottheengineer@feddit.de 3 points 1 year ago

The reddit API thing started because reddit thought they owned the content and could lock it behind a paywall for people who want training data. But that fundamentally isn't the case, so that whole thing backfired.

If someone wants to own the content and restrict access, they have to distribute it on their own instead of using a public platform. Lemmy is the wrong tool for that.