Bluesky says it is up to outside orgs to respect user consent, after a Hugging Face employee published a dataset of 1M posts from Bluesky's API for ML research (Samantha Cole/404 Media)
https://www.404media.co/someone-made-a-dataset-of-one-million-bluesky-posts-for-machine-learning-research/
http://www.techmeme.com/241127/p1#a241127p1
LB Thank goodness for Bluesky, attracting all the scraper scumlords we kept blocking here on the Fedi.
@futzle
Don't expect it to last. In fact, it's probably already been done, it's just that most scrapers have no reason to share on huggingface
@notsoloud @futzle I dimly remember that there was an incident a few years back were someone scraped data from the fediverse. It was before the AI dystopia, can't remember now what it was for, exactly. Full-text search? But I'm sure that AI scrapers run over the fediverse just like they hammer the rest of the internet. That's (yet another) sad reality, if you don't want to be in AI datasets, don't post on the public internet. #FuckAI
@Techmeme ON Imgur I've been trying to point out for weeks that Bluesky is just Twitter all over again, and the reaction I've gotten is absolutely saddening. How DARE I suggest their new favorite corporate overlord is a bad guy!
Of course, mentioning Masto brings the same response every time...it's too haaaaaard. Which as I've said elsewhere, is horsesh*t, and a myth we masto users need to stop reinforcing.
So many people are so effing desperate to be enslaved.
@Techmeme I posted this same article there tonight, and it's sitting in Downvote Hell. How did we become so stupid?
@Techmeme ANd I just removed the Imgur post after losing over six thousand karma points and over a hundred nasty PMs. What a world we live it.
@Techmeme I give it a year before the investors lock down the API and start charging for it
@Techmeme If you're surprised at this, you're not paying attention to risks in social media.
IOW. Bluesky doesn't care and won't lift a finger to protect user privacy.