Sujet : AI scraper bots driving scientific websites to a crawl
De : fungus (at) *nospam* amongus.com.invalid (Retrograde)
Groupes : sci.miscDate : 03. Jun 2025, 04:24:05
Autres entêtes
Message-ID : <683e6ad5$2$13$882e4bbb@reader.netnews.com>
From the «kill it with fire» department:
Title: Web-Scraping AI Bots Cause Disruption For Scientific Databases and Journals
Author:
feedback@slashdot.orgDate: Mon, 02 Jun 2025 17:25:00 +0000
Link:
https://science.slashdot.org/story/25/06/02/172202/web-scraping-ai-bots-cause-disruption-for-scientific-databases-and-journals?utm_source=rss1.0mainlinkanon&utm_medium=feedAutomated web-scraping bots seeking training data for AI models are flooding
scientific databases and academic journals with traffic volumes that render
many sites unusable. The online image repository DiscoverLife, which contains
nearly 3 million species photographs, started receiving millions of daily hits
in February this year that slowed the site to the point that it no longer
loaded, Nature reported Monday. The surge has intensified since the release of
DeepSeek, a Chinese large language model that demonstrated effective AI could
be built with fewer computational resources than previously thought. This
revelation triggered what industry observers describe as an "explosion of bots
seeking to scrape the data needed to train this type of model." The
Confederation of Open Access Repositories reported that more than 90% of 66
surveyed members experienced AI bot scraping, with roughly two-thirds suffering
service disruptions. Medical journal publisher BMJ has seen bot traffic surpass
legitimate user activity, overloading servers and interrupting customer
services.
[image 2][2][image 4][4]
Read more of this story[5] at Slashdot.
Links:
[1]:
http://twitter.com/home?status=Web-Scraping+AI+Bots+Cause+Disruption+For+Scientific+Databases+and+Journals%3A+https%3A%2F%2Fscience.slashdot.org%2Fstory%2F25%2F06%2F02%2F172202%2F%3Futm_source%3Dtwitter%26utm_medium%3Dtwitter (link)
[2]:
https://a.fsdn.com/sd/twitter_icon_large.png (image)
[3]:
http://www.facebook.com/sharer.php?u=https%3A%2F%2Fscience.slashdot.org%2Fstory%2F25%2F06%2F02%2F172202%2Fweb-scraping-ai-bots-cause-disruption-for-scientific-databases-and-journals%3Futm_source%3Dslashdot%26utm_medium%3Dfacebook (link)
[4]:
https://a.fsdn.com/sd/facebook_icon_large.png (image)
[5]:
https://science.slashdot.org/story/25/06/02/172202/web-scraping-ai-bots-cause-disruption-for-scientific-databases-and-journals?utm_source=rss1.0moreanon&utm_medium=feed (link)