social.kernel.org

Conversation

bert hubert 🇺🇦🇪🇺🇺🇦

Edited 4 months ago

So after the @lwn post on being hammered by scrapers today, I ran an analysis on what I thought was a recent phenomenon: a query from what tries to pass as a browser from an IP address that does exactly *1* query in a 24 hour period. You can't filter an IP address that makes just one visit. Turns out this happens a lot, sometimes 250k unique single use addresses/day!

~n

nblr@chaos.social

4 months ago

Reply to @bert_hubert@eupolicy.social

Edited 4 months ago

@bert_hubert @lwn
Is there any pattern to the addresses?

I heard some rogue crawlers use cheaply made "free to play" mobile game apps that mainly serve as bot platform to query from hard to block residential ip space.

bert hubert 🇺🇦🇪🇺🇺🇦

bert_hubert@eupolicy.social

4 months ago

Reply to @nblr@chaos.social

@nblr @lwn The countries doing single use queries today:

Jonathan Corbet

corbet

4 months ago

Reply to @bert_hubert@eupolicy.social

@bert_hubert @lwn Today's attack on LWN was a good 250K addresses. Gotta download all those articles from 2010, just in case they changed somehow...

Something has to be done about this, but I sure don't know what. They are using other people's devices, so they don't really care about burning some CPU time on Anubis challenges - and they have evidently learned to do that.

Sometimes I think we need to just toss the net and start over.

Solarpunk Davy

SolarDavy@climatejustice.social

4 months ago

Reply to @bert_hubert@eupolicy.social

@bert_hubert maybe a lot of self-hosted rss readers? /j

bert hubert 🇺🇦🇪🇺🇺🇦

bert_hubert@eupolicy.social

4 months ago

Reply to @SolarDavy@climatejustice.social

@SolarDavy nope, nothing like that. They also check far more than once a day!

Solarpunk Davy

SolarDavy@climatejustice.social

4 months ago

Reply to @bert_hubert@eupolicy.social

@bert_hubert yeah, it was a very bad joke (I added the joke modifier).

I mean, self-hosted rss is soo niche, I would love it if it wasn't 😅

bert hubert 🇺🇦🇪🇺🇺🇦

bert_hubert@eupolicy.social

4 months ago

Reply to @SolarDavy@climatejustice.social

@SolarDavy it is not as niche as you might think! I get a shitload of RSS queries!

Solarpunk Davy

SolarDavy@climatejustice.social

4 months ago

Reply to

@jwildeboer @bert_hubert do you know if they're from self-hosted rss clients (for example miniflux)? Or more stuff like Feedly?

About social.kernel.org

Terms of service

Please do not use this service in violation of the Linux Kernel Code of Conduct. Doing so will result in your account suspension with the referral of the matter to the CoC committee.
"Repeating"/"boosting" someone else's status on this platform will be treated as endorsement and will fall under rule #1.
You are encouraged to use this platform to promote your work on the Linux Kernel, but there is no restriction on permitted topics (with the exception of anything covered by #1 above).
There is no requirement to post in English, but it should be considered the primary language of communication on this platform.

Privacy notice

The admins of this service have access to all posted statuses. They aren't looking, but if it's something they shouldn't know about, then you should not post it on this platform.

Please see the Linux Foundation Privacy Policy, which applies to this platform as well.

Getting your own account

If you would like an account on this instance, please check that the following applies to you:

You are listed in MAINTAINERS or CREDITS
OR: You have a kernel.org account or email address
OR: You have a long and established history of involvement with the Linux Kernel

If the above is true and you agree with the Terms of Service and Privacy Notice listed above, please use these instructions to request an account:

How to request an account on social.kernel.org