Turn on the lights for internet... if you can afford cloudflare:
https://ugpl.net/blog/post/turn-on-the-lights-for-internet-if-you-can-afford-cloudflare.html

Turn on the lights for internet... if you can afford cloudflare:
https://ugpl.net/blog/post/turn-on-the-lights-for-internet-if-you-can-afford-cloudflare.html
Really interesting project Anubis to protect against #LLM scraping bots : https://anubis.techaro.lol/ #Scraping #bots
Le #scraping #payant : vers un changement radical du modèle économique de l’ #IA #AI #générative ?
#Cloudflare lässt KI-Crawler auflaufen, wenn nicht für #Scraping bezahlt wird | heise online https://www.heise.de/news/Cloudflare-laesst-KI-Crawler-auflaufen-wenn-nicht-fuer-Scraping-bezahlt-wird-10467015.html #PayPerCrawl #ArtificialIntelligence #copyright #Urheberrecht
Civil Society: Cloudflare’s latest change {blocks, unblocks} network use by {people, software} that we {hate, love} – {yay, boo} this is {great, terrible}!
https://alecmuffett.com/article/113629
#ai #censorship #cloudflare #scraping
@akamran @davidtoddmccarty If you search Google for #Mastodon hashtag scraping, you find software and programs that help AI for doing that. It exists.
Fact is that from today, the main instances mastodon.social and mastodon.online prohibit #scraping officially: https://techcrunch.com/2025/06/17/mastodon-updates-its-terms-to-prohibit-ai-model-training/
Problem of decentralisation: admins/users of other instances must get aware of the problem and change their terms, too.
It may be funny but it's no joke.
#Hinweis auf #Nutzbarkeit von #Data #Analytics / #Data #Science #Methoden #Scraping, #Pattern #Recognition, #Machine #Learning oder #Text #Mining für #soziologische #Forschung.
#Sutter / #Maasen - #Neuerfindung #Soziologie S.76 f. 2020 DOI: 10.5771/9783845295008-73
Turn out the lights, the internet is over:
https://ugpl.net/blog/post/turn-out-the-lights-the-internet-is-over.html
5 Best JavaScript Web Scraping Libraries in 2025, by @apify.bsky.social:
https://blog.apify.com/best-javascript-web-scraping-libraries/
@anirvan @404mediaco the only way to deal with this is the same as with any other #malware and #DDoS:
I do maintain a #blocklist of those and will happily accept suggestions and pull requests...
https://github.com/greyhat-academy/lists.d/blob/main/scrapers.ipv4.block.list.tsv
#AI #scraping #GLAM #CulturalHeritage
'AI bots that scrape the internet for training data are hammering the servers of libraries, archives, museums, and galleries, and are in some cases knocking their collections offline, according to a new survey published today.'
https://www.404media.co/ai-scraping-bots-are-breaking-open-libraries-archives-and-museums/
My anti society
collision course
I charted to address
my many anxieties
is rapidly
approaching the end of the line
The impending
gravity induced crash
should be quite the event
given the acceleration
of my descent
The "I told you so"
chorus of the status quo
will be very pleased
#scraping up my bloody carcass
to mount on their Warning Wall
I sure would hate
to give these smug bastards
the confirmation validation
they so desperately need
Deceased me
playing the lead
in their future forewarning
history stories
to unborn rebellious
non conformist generations
about the folly
of living life brazenly
outside the rigid boundaries
constructed with bricks of bullshit
I guess it's time to confess
my internal trepidation
Those pointless
could and should ofs
as if somehow
we are the captains
of our destinies
For you see my soon to be
grieving comrades
the die was cast
for ill fated us
the day we were born
Playing the roles
fate precisely defines
scribed in the unwavering stars
#vss365
A Thought on JavaScript “Proof of Work” Anti-Scraper Systems, by @cks:
https://utcc.utoronto.ca/~cks/space/blog/web/JavaScriptScraperObstacles
Oh look, another #Python library to help you scrape the #web like a pro... or at least until every CAPTCHA on the planet realizes you're a faking it.
Because, clearly, the world needed an "async" way to prove that you’re not a human.
https://github.com/autoscrape-labs/pydoll #Scraping #Async #Libraries #CAPTCHAs #Automation #HackerNews #ngated
PyDoll – Async Python scraping engine with native CAPTCHA bypass
Get your #Chat4Data guide book in one click!
1. Install in Chrome Web Store
2. Start your chat and #scraping journey
and more...!
Explore more herehttps://chat4data.notion.site/Chat4Data-Knowledge-Base-201f2a5316eb807b8b62f22b224ffb61?pvs=143
#chromeextension
Scrapy, el framework open source que se ha convertido en el terror silencioso de millones de sitios web https://blog.elhacker.net/2025/06/scrapy-el-framework-open-source-scraping.html #scraping #scraper
Stop making stupid ai-people famous! #ai #aiart #scraping #hyperreality #dr
¿La IA podría arruinar la comunidad de los fan fiction? | #RollingStoneES
https://es.rollingstone.com/la-ia-podria-arruinar-la-comunidad-de-los-fan-fiction/
A thought on JavaScript "proof of work" anti-scraper systems
https://utcc.utoronto.ca/~cks/space/blog/web/JavaScriptScraperObstacles
#HackerNews #JavaScript #proof #of #work #anti-scraper #systems #web #scraping #technology #security