@smxi@corbet we're kinda trying to tell you that a single IP will hit 2-3 times an hour or so. You can't do behavioural analysis over 3 hits. They request 2-3 specific URLs with generic browser client strings and then aren't seen again. But multiply this by tens of thousands of IPs all coming from different subnets and you have a problem.
A followup for folks who are curious about the whole AI botswarm problem...
Some of these bots are clearly running on a bunch of machines on the same net. I have been able to reduce the traffic significantly by treating everything as a class-C net and doing subnet-level throttling. That and simply blocking a couple of them.
But that leaves a lot of traffic with an interesting characteristic: there are millions of obvious bot hits (following a pattern through the site, for example) that all come from a different IP. An access log with 9M lines as over 1M IP addresses, and few of them appear more than about three times.
So these things are running on widely distributed botnets, likely on compromised computers, and they are doing their best to evade any sort of recognition or throttling. I don't think that any sort of throttling or database of known-bot IPs is going to help here...not quite sure what to do about it.
@corbet@adelie@LWN I have been using pyasn to block entire subnets. It's effective, but only in the same way carpet bombing is. I'm sure I've blocked legitimate systems, but c'est la vie.
@major I'm going to be a voice of dissent and will say that, from my experience, this isn't very effective as an immersion strategy and just leads to subconsciously associating the language you're learning with frustration. I found reading children's books aimed at younger readers to be the best way to build vocabulary and get familiar with sentence structures and common turns of phrase. But, YMMV. :)
Γ nei, jeg fikk endelig boken jeg bestilte fra Norge, bare for Γ₯ oppdage at den er pΓ₯ nynorsk. Jeg kan lese den, men det kommer til Γ₯ ΓΈdelegge rettskrivningen min. :/
Please do not use this service in violation of the Linux
Kernel Code of Conduct. Doing so will result in your account suspension with the referral of the matter
to the CoC committee.
"Repeating"/"boosting" someone else's status on this platform will be treated as endorsement and will fall under
rule #1.
You are encouraged to use this platform to promote your work on the Linux Kernel, but there is no restriction
on permitted topics (with the exception of anything covered by #1 above).
There is no requirement to post in English, but it should be considered the primary language of communication
on this platform.
Privacy notice
The admins of this service have access to all posted statuses. They aren't looking, but if it's something they
shouldn't know about, then you should not post it on this platform.