social.kernel.org

Conversation

K. Ryabitsev-Prime 🍁

x.x.x.x - - [10/Nov/2024:00:02:37 +0000] "GET / HTTP/1.1" 301 162 "-" "okhttp/4.9.0"

You know what’s interesting about this log line? It repeats 56,686,963 times in www.kernel.org logs for yesterday, across 4 nodes. That’s about 700 times a second, and this has been going on for months.

These requests aren’t intentionally malicious – they issue a simple GET /, receive their 301 redirect, and terminate the connection. From what I can tell, this is some kind of appliance or software installed on mobile clients that uses “can I reach www.kernel.org” as a network test.

This wouldn’t be that big of a deal – a single plaintext “GET /“ that triggers an immediate 301 is very cheap for us to generate, but the number of these requests has been steadily growing.

If you have any idea what this is and how to make it stop, please reach out?

36

437

296

K. Ryabitsev-Prime 🍁

Reply to @monsieuricon

I do have a solution in mind if it gets bad -- we already have cdn.kernel.org going through Fastly, so I will just point www.kernel.org to go through there, too. I am mostly perplexed and unamused that someone's quick thoughtless hack is starting to cause us problems.

2

4

29

Raphael

rami@chaos.social

Reply to @monsieuricon

@monsieuricon How to make it stop? Let any requests with this user agent just time out, then the connectivity check becomes useless 😈

3

0

0

Daniel

djh@chaos.social

Reply to @monsieuricon

@monsieuricon I wonder what happens if you add an iptables reject rule matching on that specific user agent for a week or so 🦧💡

1

0

1

K. Ryabitsev-Prime 🍁

Reply to

@artandtechnic It doesn't look like that -- the only records I have from the IPs that do that is that one GET /. They may or may not come back a few times a day, but nothing definitive.

0

0

7

K. Ryabitsev-Prime 🍁

Reply to @djh@chaos.social

@djh I thought about that, but my firewall match table would get huge and it would cause problems in itself. That's 12,000,000 unique IPs just on one single node.

2

0

9

Ben Cardoen

bencardoen@mstdn.science

Reply to @monsieuricon

@monsieuricon Perhaps OS fingerprinting with Wireshark can tell you if these are from the same device/OS or not, which could narrow things down?

1

0

0

K. Ryabitsev-Prime 🍁

Reply to @bencardoen@mstdn.science

@bencardoen Trouble is, I'm usually seeing the IP of the NAT routing endpoint, not the actual device, so this doesn't tell me very much.

0

0

8

Amber

puppygirlhornypost2@transfem.social

Reply to @monsieuricon

@monsieuricon id expect this for google.com but not kernel.org. Interesting.

1

0

0

Josh Anders

josh@joshanders.com

Reply to @monsieuricon

@monsieuricon Sounds like a OnePlus thing.

1

0

1

Robert Link

phaedral@mastodon.social

Reply to @monsieuricon

@monsieuricon I instantly think about microsoft and WSL, but that's just prejudice. I recall, however, working for Symantec in 2009, a QA pal being amazed at how many "needless" connections win spawned.

0

0

0

K. Ryabitsev-Prime 🍁

Reply to

@tony @bencardoen Yes, that's my current suspect as well -- some popular Android clone either on mobile or some set-top box. The number of unique IPs is in tens of millions, so it's probably a mobile app.

1

0

10

Raven667

raven667@hachyderm.io

Reply to

@R1Rail @monsieuricon my predecessor had to figure this kind of thing out a whole ago, got a paper out of it. https://pages.cs.wisc.edu/~plonka/lisa/lisa2003/lisa-netgear-sntp.pdf. happy hunting Mr.Icon

0

0

0

K. Ryabitsev-Prime 🍁

Reply to @josh@joshanders.com

@josh Good guess, I do see significantly more of this traffic on our Singapore and Amsterdam nodes.

0

0

9

Steven Reed

srtcd424@mas.to

Reply to @monsieuricon

@monsieuricon
If it's an app it's presumably capable of being updated.. I wonder if you can convince it the check has failed after you've seen the request headers? E.g. send an RST? Do that for 10% of requests (so as not to be a complete a-hole) and someone might notice and update...
@bencardoen @tony

1

0

0

K. Ryabitsev-Prime 🍁

Reply to @srtcd424@mas.to

@srtcd424 @bencardoen @tony this would be a lot of work for not obvious gain. My guess is that we're not the only host they check, so this may go completely unnoticed by the app makers, so we'll spend a lot of effort for minimal gain.

0

1

4

AntiComposite @ Wikimania

anticomposite@wikis.world

Reply to @monsieuricon

@monsieuricon People pick the weirdest things for their connectivity tests. https://phabricator.wikimedia.org/T273741 remains the weirdest I've seen though, where an unnamed app decided to use a random picture of a flower and ended up causing ~20% of the traffic to a Wikimedia Foundation datacenter.

0

1

0

Justin

Reply to @monsieuricon

@monsieuricon @djh you could possibly start returning 404 for that user agent.

2

0

0

AlisonW ♿🏳️‍🌈♾️

AlisonW@fedimon.uk

Reply to @monsieuricon

Edited 1 year ago

@monsieuricon
OkHttp appears to be a Curl alternative for Android so I'm guessing that someone used you in an example of how to use it in some guide or other. This is why example.com exists ffs!

0

0

0

Christophe B.

bladecoder@androiddev.social

Reply to @monsieuricon

@monsieuricon Not saying this is the culprit but this code seems to do the same thing:
https://github.com/TeamNewPipe/NewPlayer/blob/89d6f16872f656dd62e47320d9cfd904f087b601/test-app/src/main/java/net/newpipe/newplayer/testapp/TestMediaRepository.kt#L108

1

0

0

satmd

satmd@brettvormkopf.de

Reply to @monsieuricon

@monsieuricon
this could be a very bad attempt at checking internet connectivity for android apps. Some "clever" people decide to do such things with sites that are always up and stable to avoid false alerts for them. Reminds me of hardcoded ntp servers a few years ago.

It'll probably be fun to send 4xx with an error message back in order to get feedback. I assume the vendor doesn't care and the users don't know. I do think the source of that needs be stopped.

0

0

0

Palmer Dabbelt

palmer

Reply to @monsieuricon

@monsieuricon the ChangeLog claims v4.12.0 has "Fix: Don’t hang taking headers for HTTP 103 responses." Maybe that would slow them down?

1

0

8

Mark

weipah@chaos.social

Reply to @monsieuricon

@monsieuricon have you checked if those clients request any other assets from *.kernel.org
Can you tell when exactly this started maybe?

0

0

0

John Timaeus

johntimaeus@infosec.exchange

Reply to @rami@chaos.social

@rami @monsieuricon

I'd honestly just drop all responses to that user agent. Or delay them by ~30 seconds.

1

0

0

FunGuy2PlayWith

funguy2playwith@mastodon.online

Reply to @monsieuricon

@monsieuricon
Have you seen this? It mentions an okhttp 4.9.0 critical vulnerability? Might be related?

https://github.com/strimzi/strimzi-kafka-operator/issues/6934

0

0

0

John Kristoff

jtk@infosec.exchange

Reply to @monsieuricon

@monsieuricon We (Dataplane.org) see lots of the okhhtp agents fetching more than /, particularly from cloud/search companies like microsoft and google.

And github.com/square/okhttp you may have discovered seems to be some web client that "perseveres when the network is troublesome".

0

0

0

Luna Lactea

jackemled@furry.engineer

Reply to @monsieuricon

@monsieuricon Is there any way to automatically identify these requests & return a 403 instead?

0

0

0

penguin42

penguin42@mastodon.org.uk

Reply to @monsieuricon

@monsieuricon If it's that common, do you just need to find a friendly company/org with a large wifi setup and ask them to look at which devices are making kernel.org connections and see if they correspond to any start-of-MAC manufacturer codes?

1

0

0

FlyingMana

Flyingmana@phpc.social

Reply to @monsieuricon

@monsieuricon another option is to let them fail in certain timeframes, maybe every tuesday, and see who is going to start crying 👀

0

0

0

the vessel of morganna

astraleureka@treehouse.systems

Reply to @penguin42@mastodon.org.uk

@penguin42 @monsieuricon Most mobile devices default to randomised locally-assigned MACs these days, good for anti-fingerprinting in adversarial situations but makes diagnostics a right pain

1

0

0

penguin42

penguin42@mastodon.org.uk

Reply to @astraleureka@treehouse.systems

@astraleureka @monsieuricon Hmm, that's unfortunately very sensible. I guess then if you're lucky with a large org (or a conference??) you might be able to get back to particular users and ask nice ones. Much trickier though.

1

0

0

stux⚡️

stux@mstdn.social

Reply to @monsieuricon

@monsieuricon Ooof... that's not fun indeed!

Boosted and hopefully it'll get resolved soon

0

0

0

K. Ryabitsev-Prime 🍁

Reply to @palmer

@palmer lol, I'll try this. :)

0

0

3

K. Ryabitsev-Prime 🍁

Reply to @bladecoder@androiddev.social

@bladecoder not quite -- the requests are http, not https. Also, I doubt there are tens of millions of daily NewPipe users out there. :)

2

0

1

K. Ryabitsev-Prime 🍁

Reply to @justin@toot.io

@justin @djh it doesn't seem to make any difference what code we return -- my guess is that we're just one of the sites the check hits. If it gets an error, it just checks a different site.

0

0

4

Christian Huitema

huitema@social.secret-wg.org

Reply to @monsieuricon

@monsieuricon I would lie. Keep track of the IPs doing that, and if they keep at it reply 404. That way the devices will misbehave and whoever has shipped that code will get a bug report, snd eventually fix their code.

1

0

0

rastilin

rastilin@aus.social

Reply to @rami@chaos.social

@rami @monsieuricon

Absolutely. Just letting them time out is the correct answer.

0

0

0

UpLateGeek

UpLateGeek@bitbang.social

Reply to @monsieuricon

@monsieuricon might want to check if your upstream provider has DOS protection options available which would blackhole the traffic before they hit your network.

0

0

0

Princess Pixel Light

pixellight@pony.social

Reply to @feld@friedcheese.us

@feld @monsieuricon it's used by the vast majority of android apps, react native or not

0

0

0

Steve Purcell

sanityinc@hachyderm.io

Reply to @huitema@social.secret-wg.org

@huitema @monsieuricon yeah, and return a response body that explains why

0

0

0

Marsh Ray

marshray@infosec.exchange

Reply to @monsieuricon

@monsieuricon If you return html with an img tag, does it load it?

Does it run script?

1

0

0

Andrej Shadura

andrew_shadura@mastodon.social

Reply to @justin@toot.io

@justin, but okhttp is just a Java HTTP client library, in particular popular on Android, there's nothing wrong with it per se.

@monsieuricon @djh

1

0

0

Nik | Klampfradler 🎸🚲

nik@toot.teckids.org

Reply to @monsieuricon

@monsieuricon Drop the requests and see what starts burning.

0

0

0

Justin

Reply to @andrew_shadura@mastodon.social

@andrew_shadura Ah that's unfortunate. @monsieuricon @djh

0

0

0

UzakL

Reply to @monsieuricon

@monsieuricon Offline the server for half an hour and see who is complaining ?

0

0

0

chrysn

chrysn@chaos.social

Reply to @johntimaeus@infosec.exchange

@johntimaeus @rami @monsieuricon Delaying then by 30s at 700/s would add 20,000 TCP connections at any time, that may be way harder on the system than the GETs themselves. A 429 error with Retry-After would do something similar without that load.

3

0

0

Bastian 🦊 ooo-eeee-ooo

dasrecht@chaos.social

Reply to @monsieuricon

@monsieuricon just blocking it based on the user agent for a few hours as a brown out test. and then fully blocking it later on would be my approach.

similarly why google limits ICMP to 8.8.8.8

0

0

0

Raphael

rami@chaos.social

Reply to @chrysn@chaos.social

@chrysn @johntimaeus @monsieuricon haproxy has a "slient drop" feature for example 😎 https://www.haproxy.com/blog/use-haproxy-response-policies-to-stop-threats

0

0

0

datenwolf

datenwolf@chaos.social

Reply to @chrysn@chaos.social

@chrysn @johntimaeus @rami @monsieuricon

Unilaterally close the connection without sending a RST | FIN

0

0

0

tobigr

tobigr@floss.social

Reply to @monsieuricon

@monsieuricon @bladecoder NewPipe dev here: NewPlayer is a standalone lib which is currently under development. It is thought to be NewPipe's next media player framework, but has not been integrated in NewPipe yet. What you have linked here is the test app for the new player. It is not used except by <10 devs to test their changes. If you want me to, I can change the address to something else though.

1

0

0

Peter N. M. Hansteen

pitrh@mastodon.social

Reply to @monsieuricon

@monsieuricon This sounds eerily like another consumer device manufacturer thinking your site would be one of those that's always up and running, much like FreeBSD developer Poul-Henning Kamp discovered D-Link had done to his timeserver way back when - see eg https://www.theregister.com/2006/05/11/d-link_time_dispute_settlement/.

Happy bozo hunting!

1

0

0

The Penguin of Evil

etchedpixels@mastodon.social

Reply to @monsieuricon

@monsieuricon Have you talked to the okhttp team see if they have any ideas from their user data who it might be and if they can push a block into their library.

0

0

0

The Penguin of Evil

etchedpixels@mastodon.social

Reply to @chrysn@chaos.social

@chrysn @johntimaeus @rami @monsieuricon You only have to delay a small random subset of them by 60 seconds to create a random really annoying response lag in the app. Even more so if you take a small subset and serve them a few characters per second for 5 mins 8)

If it handled 429 with a retry then you could have also issued a 429 and blackholed them for a while. Alas not it seems.

0

0

1

4 "sparkling fire" censord 📞4236@emf

4censord@unfug.social

Reply to @monsieuricon

@monsieuricon first thing i'd check is if that just something the library does by default, okhttp is this: https://github.com/square/okhttp

0

0

0

Ross Burton

ross@hachyderm.io

Reply to @tobigr@floss.social

@tobigr @monsieuricon @bladecoder I'd recommend using a URL you control for testing purposes. You never know what will happen with something like kernel.org, from causing traffic to changes making your tests break.

1

0

0

Trouble

trouble@masto.ai

Reply to @ross@hachyderm.io

@ross @tobigr @monsieuricon @bladecoder Also, devs copy and paste code all the time, so even though YOUR codebase is only directly used by a few people, someone might copy it into a production app. There have been several network overload type issues over the years. The worst I know of is NTP on home gateways which took over a decade to resolve.

1

0

0

Trouble

trouble@masto.ai

Reply to @penguin42@mastodon.org.uk

@penguin42 @astraleureka @monsieuricon Despite random MAC addresses, device fingerprinting is still very easy, though slightly intrusive. Worse still, iOS "resets" the setting that turns static MAC addresses off with every upgrade, which screws up things like restaurant point-of-sale systems which rely on one iPad being "the master" at a static IP address (vs dynamically finding it)

0

0

0

Trouble

trouble@masto.ai

Reply to @pitrh@mastodon.social

@pitrh @monsieuricon Oh! I knew about the Linksys NTP bug way back when; I hadn't heard D-Link made a similar mistake! That's just so embarrassing.

0

0

0

Christophe B.

bladecoder@androiddev.social

Reply to @monsieuricon

@monsieuricon Indeed it's anecdotal and the only thing I could find on public GitHub.
I wouldn't be surprised to learn that it originates from shady Chinese phone firmwares.

0

0

0

tobigr

tobigr@floss.social

Reply to @trouble@masto.ai

@trouble @ross @monsieuricon @bladecoder As I already said, the repo linked is far from a state in which it could be used in production, let alone in a separate app.
Side note: we replaced the reference to kernel.org and now use our own domain

1

0

0

K. Ryabitsev-Prime 🍁

Reply to @tobigr@floss.social

@tobigr @trouble @ross @bladecoder Thank you, I do appreciate that! (And your tests are less likely to break this way, should we change something.)

0

0

2

Justin Derrick

JustinDerrick@mstdn.ca

Reply to @marshray@infosec.exchange

@marshray @monsieuricon Heh. Overnight bot army. Feed it a crypto mining script to help cover the costs.

0

0

0

x0

x0@dragonscave.space

Reply to @puppygirlhornypost2@transfem.social

@puppygirlhornypost2 @monsieuricon Or 1.1.1.1. You want a connectivity test? Ping the thing that boasts itself as one of the fastest DNS endpoints in the world, besides they did say loads of people were already doing exactly that.

0

0

0

Vizay Soni

vs4vijay@mastodon.zaclys.com

Reply to @monsieuricon

Few suggestions:
- we can check what's the HTTP Headers being sent, they might have some patterns
- we can look up into shodan.io search

0

0

0

Dmitry Borodaenko

dmitry@mastodon.circle.lt

Reply to @monsieuricon

@monsieuricon @djh The nice part of making these fail is that, if this isn't malicious, it might cause enough of a problem for the people who implemented it that they might be forced to fix it. This reasoning still applies if you do that in Fastly rather than on your server.

0

0

0

gim

Reply to @monsieuricon

@monsieuricon my guess is that if you'll start dropping such connections (via user agent or any other means) you'll quickly find out what it is 😈

0

0

0

mathew

mathew@universeodon.com

Reply to @rami@chaos.social

@rami @monsieuricon Or since it's likely to be mobile apps, see how much data you can send in the response.

0

0

0

Eckes

eckes@zusammenkunft.net

Reply to @monsieuricon

@monsieuricon let’s start a world wide infrastructure collapse by deadholing all those IPs :) (it’s Java btw)

0

0

0

K. Ryabitsev-Prime 🍁

Reply to

@deborahh @djh good suggestion, but with 20-odd million unique IPs, this is too much of a wumpus hunt to bother. :)

0

0

2

K. Ryabitsev-Prime 🍁

Reply to

@deborahh @djh eh, I made it not my problem by fronting that domain with Fastly. 😂

1

0

1

aismallard

aismallard@woem.space

Reply to @monsieuricon

@monsieuricon Easy way to contact them: replace the 301 redirect with a 4xx and a HTML body that says “please do not use this site as a health check”

0

0

0

Xdej

Reply to @monsieuricon

@monsieuricon
does fastly cost you something?
@djh @deborahh

0

0

0

please gently the rat

rolenthedeep@rattodon.nexus

Reply to @monsieuricon

@monsieuricon
Could you just block this type of request wholesale and wait until someone starts screaming?
@anthropy

0

0

0

Eva Mikkonen

evamik@uwu.mikkonen.com

Reply to @monsieuricon

@monsieuricon I would just make the matching user agent receive an error, that will make the message go through

0

0

0

ari@ak.ari.lt

Reply to @monsieuricon

@monsieuricon fail2ban lol

rate limit it all to 10 req a sec or so with ban time of 10 mins or so

0

0

0

Ken Whitesell

KenWhitesell@mastodon.social

Reply to @monsieuricon

@monsieuricon I find it hard to believe I'd be thinking of something you haven't already considered, but, in the off chance you haven't:

1) Do some analysis on the IP addresses causing this - are they coming from a particular geographic area? Are there any patterns, groupings, or clustering of addresses?

2) Dig deeper into the full request being made. Collect the full set of headers and not just the summary line. Does that show any identifying information?

2

0

0

Olivier Mengué

dolmen@mamot.fr

Reply to @KenWhitesell@mastodon.social

@KenWhitesell @monsieuricon https://social.kernel.org/objects/c92b8b24-95ae-4eb0-8d33-1ab9ca477e81

0

0

0

Ken Whitesell

KenWhitesell@mastodon.social

Reply to @KenWhitesell@mastodon.social

3) Are any sizable portion of these requests subsequently followed by more valid requests from the same address?

4) Is there any pattern to the timing of these requests by location?

0

0

0

About social.kernel.org

Terms of service

Please do not use this service in violation of the Linux Kernel Code of Conduct. Doing so will result in your account suspension with the referral of the matter to the CoC committee.
"Repeating"/"boosting" someone else's status on this platform will be treated as endorsement and will fall under rule #1.
You are encouraged to use this platform to promote your work on the Linux Kernel, but there is no restriction on permitted topics (with the exception of anything covered by #1 above).
There is no requirement to post in English, but it should be considered the primary language of communication on this platform.

Privacy notice

The admins of this service have access to all posted statuses. They aren't looking, but if it's something they shouldn't know about, then you should not post it on this platform.

Please see the Linux Foundation Privacy Policy, which applies to this platform as well.

Getting your own account

If you would like an account on this instance, please check that the following applies to you:

You are listed in MAINTAINERS or CREDITS
OR: You have a kernel.org account or email address
OR: You have a long and established history of involvement with the Linux Kernel

If the above is true and you agree with the Terms of Service and Privacy Notice listed above, please use these instructions to request an account:

How to request an account on social.kernel.org