Conversation
AI farms can just up and die, please. Add overscraping to the long list of public resources being ruined by commercial greed -- to join overfishing, overlogging, and overgrazing.
4
37
57
And no, it's not easy to "just throttle them" when it's thousands of IPs all coming from public cloud subnets with user-agents matching common modern browsers.
3
5
12

@monsieuricon I think Drew DeVault has a project meant to keep track and fight against abuse: https://git.sr.ht/~sircmpwn/abused

I'm not sure if it's appropriate for your use cases though. I remember they (the sourcehut people) started working on it after they had issues with a large DDoS earlier in the year.

0
0
0

@monsieuricon I activated fail2ban just three days ago, and now I fear for the iptables config size.

0
0
0

@monsieuricon Figuring out a snappy way to say that so much of "AI" is excess / wasteful / bad, but that it maybe could be useful and done responsibly would be really good I think. Overscraping might not be it, but overfishing for example doesn't demonize fishing it frames it in a sustainability lens, and that is a crucial piece missing in the criticisms of AI, I think.

1
0
0

@monsieuricon Just don't think the broader public can connect with "scraping" mentally. Though I could be wrong, I think it is sort of a niche concept.

0
0
0

@monsieuricon Maybe we need sustainably-sourced locally-grown AI, not factory farmed AI ?

0
0
0

@monsieuricon Is blocking all public cloud subnets an option?

0
0
0