social.kernel.org

K. Ryabitsev 🍁

@monsieuricon Moderator

Posts

1886

Following

223

Followers

2370

Director of Linux Foundation IT. Currently in charge of kernel.org infra.

This account is for Linux/Kernel/FOSS topics in general: #linux, #kernel, #foss, #git, #sysadmin, #infrastructure.

For my personal account, please follow @monsieuricon@castoranxieux.ca.

Montréal, Québec, Canada 🇨🇦🇺🇦

K. Ryabitsev 🍁

monsieuricon

1 month ago

I'm aware of Anubis and I'm afraid proof-of-work intermediaries are going to become the only way to deal with bots.

However, I don't like Anubis's general approach. I would prefer to have something built into varnish with some more logic that allows for more nuance. If there is a local cached page, allow the request. If there isn't, but the load/RAM usage is low, let the request through. If the load is high or if we're seeing lots of 503's, only then require proof-of-work.

K. Ryabitsev 🍁

repeated

nixCraft 🐧

nixCraft@mastodon.social

1 month ago

FOSS infrastructure is under attack by AI companies https://thelibre.news/foss-infrastructure-is-under-attack-by-ai-companies/ Please boost for awareness, reach and to public shame Microsoft, Meta, OpenAI, Perplexity and other such AI companies.

#opensource #programming #Developers

K. Ryabitsev 🍁

monsieuricon

1 month ago

In good news, I figured out what needed to happen so we don't share the same /64 with all other Linode systems in the same datacentre, which gets @spamhaus off our back.

K. Ryabitsev 🍁

monsieuricon

1 month ago

Reply to @monsieuricon

That's why I keep this server: for bitching about life and multilingual dad jokes that only 1-2 people following me would get.

K. Ryabitsev 🍁

monsieuricon

1 month ago

All Norwegian birds look fugl-y.

K. Ryabitsev 🍁

monsieuricon

1 month ago

Reply to @esgariot@mstdn.social

@esgariot @rails Yes, it would work, but would it be acceptable trade-off? That's not clear. Right now, I'm leaning towards setting up separate, authentication-required duplicates for some services that I can give to maintainers and developers, but that, again, is capitulating and admitting that the open web has failed.

K. Ryabitsev 🍁

monsieuricon

1 month ago

Reply to @rails@fosstodon.org

@rails There is not. There is, in fact, no reliable way to identify legitimate requests from bot traffic if you're only looking at logs or packets. The only way to reliably tell is by getting yourself into the page rendering client. E.g. this is what happens when you get CloudFlare's "prove you're not a bot" screen -- they use javascript to collect information about your browser and to watch the pointer behaviour to figure out if you're a bot or not (plus, massive amounts of data they have internally on your IP address).

K. Ryabitsev 🍁

monsieuricon

1 month ago

Reply to @mariusor@metalhead.club

Edited 1 month ago

@mariusor Everything that used to work no longer does. 🤷 First, we rate-limited by IP, but they switched to using public cloud farms. Next, we banned based on user-agent, but they started using a generic user-agent. Then, we started banning on "the same" user agent per number of requests, but that never really worked very well, and they switched to varied user-agents. Next, we started banning whole subnets and ASNs, but they switched to using residential IPs. This is where we are now -- bots descend on your public resource from tens of thousands of IPs from all over the world, with reasonably recent, varied user-agents, with any one IP sending no more than 1-2 requests. It's clearly all bot traffic, because there's clearly nobody who is going to be suddenly interested in random commits from 5 years ago, or in random conversations on linux-fsdevel from 9 years ago, but it's impossible to turn this logic into a reliable "no, you are a bot, go away" action without turning to fronting services or various anti-bot captchas.

K. Ryabitsev 🍁

monsieuricon

1 month ago

Reply to @algernon@come-from.mad-scientist.club

@algernon The gist of the problem is that it is impossible to identify "known bots." Yeah, there's a subset of requests that clearly identify themselves as "LLMWnatnotBot 1.x", but if you read Drew's article, the vast majority of traffic is one-two requests from random IPs with generic browser user-agents. There is no reliable way of telling them apart from legitimate requests. The only viable solution is to put everything behind CloudFlare or Fastly or Akamai and let them protect you against bot traffic, but *that is not a win*. That's capitulating and admitting that the open web has failed.

K. Ryabitsev 🍁

monsieuricon

1 month ago

Reply to @monsieuricon

FYI, Drew isn't making it up in this article. At any given time, if you check what I'm doing, chances are I'm trying to figure out ways to deal with bots.

https://drewdevault.com/2025/03/17/2025-03-17-Stop-externalizing-your-costs-on-me.html

K. Ryabitsev 🍁

monsieuricon

1 month ago

I know I haven't been able to work on b4 and other tooling as much as I was hoping, but between the Equinix exodus, having to continuously mitigate against LLM bot DDoS'ing our infra, and just general geopolitical sh*t that lives rent-free in my head... it's been difficult. But I have high hopes and lots of good ideas -- that's got to count for something, right?

K. Ryabitsev 🍁

repeated

LWN.net

LWN@fosstodon.org

1 month ago

Supply Chain Attacks on Linux distributions (Fenrisk)

https://lwn.net/Articles/1014741/ #LWN

K. Ryabitsev 🍁

repeated

buherator

buherator@infosec.place

1 month ago

Please stop externalizing your costs directly into my face

https://drewdevault.com/2025/03/17/2025-03-17-Stop-externalizing-your-costs-on-me.html

"Whether it’s cryptocurrency scammers mining with FOSS compute resources or Google engineers too lazy to design their software properly or Silicon Valley ripping off all the data they can get their hands on at everyone else’s expense… I am sick and tired of having all of these costs externalized directly into my fucking face. Do something productive for society or get the hell away from my servers"

K. Ryabitsev 🍁

monsieuricon

1 month ago

Reply to @swapgs@infosec.exchange

@swapgs This talk may help -- it's about things we've thought about. https://www.youtube.com/watch?v=K3SVt1WCheY

K. Ryabitsev 🍁

monsieuricon

1 month ago

Reply to @swapgs@infosec.exchange

@swapgs @1ns0mn1h4ck Free infra assessment? Yes please. Just give me a heads-up first. :)

K. Ryabitsev 🍁

monsieuricon

1 month ago

Donald Trump proudly demonstrates all the nothingburgers he got from Russia during his call.

K. Ryabitsev 🍁

repeated

tante

tante@tldr.nettime.org

1 month ago

LLM crawlers are aggressively destroying important community infrastructures but sadly there is not an easy fix. Still: Blocking those crawlers should be high on your list of todos

(Original title: LLM crawlers continue to DDoS SourceHut)

https://status.sr.ht/issues/2025-03-17-git.sr.ht-llms/

K. Ryabitsev 🍁

repeated

SherBeareth

SherBeareth@mastodon.world

1 month ago

#HolyFvck #ThisTooShallPass

K. Ryabitsev 🍁

monsieuricon

1 month ago

Other than FSF Europe, what other free software nonprofits are there that I can send people to if they don't want to contribute to a US-based entity?

K. Ryabitsev 🍁

repeated

MostlyHarmless

MostlyHarmless@thecanadian.social

1 month ago

Show older

About social.kernel.org

Terms of service

Please do not use this service in violation of the Linux Kernel Code of Conduct. Doing so will result in your account suspension with the referral of the matter to the CoC committee.
"Repeating"/"boosting" someone else's status on this platform will be treated as endorsement and will fall under rule #1.
You are encouraged to use this platform to promote your work on the Linux Kernel, but there is no restriction on permitted topics (with the exception of anything covered by #1 above).
There is no requirement to post in English, but it should be considered the primary language of communication on this platform.

Privacy notice

The admins of this service have access to all posted statuses. They aren't looking, but if it's something they shouldn't know about, then you should not post it on this platform.

Please see the Linux Foundation Privacy Policy, which applies to this platform as well.

Getting your own account

If you would like an account on this instance, please check that the following applies to you:

You are listed in MAINTAINERS or CREDITS
OR: You have a kernel.org account or email address
OR: You have a long and established history of involvement with the Linux Kernel

If the above is true and you agree with the Terms of Service and Privacy Notice listed above, please use these instructions to request an account:

How to request an account on social.kernel.org