social.kernel.org

Conversation

Greg K-H

Days since an "AI found security bug" turned out to be totally false due to the inability of the tool to actually parse C code: 0

I'm seeing multiple of these type of "reports" per week now for Linux. Why do people think that an LLM can somehow do better than a compiler and also not even test their proposed changes to verify they even do anything?

{sigh}

106

159

Petr Tesařík

ptesarik@infosec.exchange

6 days ago

Reply to @gregkh

@gregkh Some people simply lack the skill, but they'd like to add “contribution to the Linux kernel” to their CVs.
Disclaimer: I've no idea if that was actually the case here.

Lars Wirzenius

liw@toot.liw.fi

6 days ago

Reply to @gregkh

@gregkh I think it goes like this: Fame and fortune awaits whoever actually reports a security problem in the Linux kernel. There is no cost to the reporter in making an attempt, even if the attempt doesn't succeed. So using an LLM to generate what looks superficially like a good report means they have a chance of benefiting, and there's no downside in trying.

Same problem as with email spam, that is.

Yrjänä Rankka 🌻

ghard@mastodon.social

6 days ago

Reply to @gregkh

@gregkh ban early - ban often.

tizilogic

tizilogic@mastodon.gamedev.place

6 days ago

Reply to @gregkh

@gregkh so you're implying that those people actually "think" before submitting such reports...

that is very generous of you

daniel:// stenberg://

bagder@mastodon.social

6 days ago

Reply to @liw@toot.liw.fi

@liw @gregkh regular spam is easier to catch and filter off though... This is a growing kind of attack on Open Source projects which drains resources and is hard to combat effectively.

Thomas Depierre

Di4na@hachyderm.io

6 days ago

Reply to @liw@toot.liw.fi

@liw @gregkh there is more. You are coming from a different worldview than the reporter. The reporter, in a lot of cases, *genuinely believe that these tools are super powerful*. They are the AI of your movies. It is a belief I have seen everywhere in my circles of friends. If the AI "discovers a bug" then it has to be real and exist.

Validating does not even come to their mind, because "who am I to doubt the powerful machine". In their mind, they are the inferior, and just the messenger.

They cannot even *imagine* it could be wrong that much, or that validating it is possible.

Greg K-H

gregkh

6 days ago

Reply to @liw@toot.liw.fi

@liw We assign 13 CVEs for Linux every single day. "Fame and fortune" is not something that happens for any of those reports, as a CVE is trivial to get if you actually want to just fix a kernel bug for real.

Greg K-H

gregkh

6 days ago

Reply to @ptesarik@infosec.exchange

@ptesarik That's what `drivers/staging/` is for, we just took 10+ patches for that subsystem from new submitters yesterday. That's much easier to accomplish than trying to parse the output of an "AI tool" :)

(eval 'Toast)

toast@donotsta.re

6 days ago

Reply to @gregkh

@gregkh @liw doesn't need to be true; the kind of people that would do this kind of thing believe that's how it works, which is all that matters
if they had the braincells to realize it's not the case, they wouldn't be thinking of doing this to begin with

Tisha Tiger

tisha@htt.social

6 days ago

Reply to @gregkh

@gregkh And don't understand why these people are submitting garbage AI report.

What's the goal of it?

Michal

michal@toot.kottman.xyz

6 days ago

Reply to @gregkh

@gregkh this seems to be a very active topic right now

Advanced Persistent Teapot

http_error_418@hachyderm.io

6 days ago

Reply to @gregkh

@gregkh you know the adage that as soon as a measure becomes a target it stops being a useful measure? I think something like that has happened with bugs and bounties

sotolf

sotolf@polymaths.social

6 days ago

Reply to @gregkh

@gregkh I kind of doubt that they are capable of even testing it, or else they wouldn't use the lying machine in the first place.

Lars Wirzenius

liw@toot.liw.fi

6 days ago

Reply to @gregkh

@gregkh Yeah, I was trying to be funny in a sarcastic manner, again, and failed, again.

Christoph Schmees

PC_Fluesterer@social.tchncs.de

6 days ago

Reply to @gregkh

@gregkh

Full ACK.

Sad but true. 🤢

Glenn

glennsills@dotnet.social

6 days ago

Reply to @gregkh

@gregkh Some people easily fall for marketing pitches.

🇺🇦 haxadecimal

brouhaha@mastodon.social

6 days ago

Reply to @gregkh

@gregkh
We've spent billions of dollars on AI! You MUST use it, and believe its every pronouncement!

m_eiman

mikaeleiman@mastodon.sdf.org

6 days ago

Reply to @gregkh

@gregkh I wonder if LLMs are going to cause more problems under authoritarian regimes, where people are conditioned to do what they're told without question. Seems like perfect conditions for modern "AI" to cause all sorts of havoc, with all of it being excusable with "the computer told me to".

Petr Tesařík

ptesarik@infosec.exchange

5 days ago

Reply to @gregkh

@gregkh Yes, let's promote staging (again)! Sounds like a good plan to me.

Joe Brockmeier (@jzb)

jzb@mastodon.social

5 days ago

Reply to @liw@toot.liw.fi

@liw @gregkh I’d think the downside would be the reputational risk of being known to the maintainer(s) as the jerk who didn’t verify a vulnerability before reporting it.

If I ever thought I’d discovered a kernel vulnerability I’d be checking it over every way I knew how before submitting a report.

Jernej Simončič �

jernej__s@infosec.exchange

5 days ago

Reply to @gregkh

@gregkh Maybe "Days" should be changed to "Hours"?

Jernej Simončič �

jernej__s@infosec.exchange

5 days ago

Reply to @tisha@htt.social

@tisha @gregkh Bug bounties usually (and I've seen a report where it showed that some large companies pay out often enough even though the report is bogus).

Pavel Machek

pavel

5 days ago

Reply to @gregkh

@gregkh Perhaps someone should tell that to Sasha Levin, as he applies bad patches to AUTOSEL based on LLM output? :-(.

Arda Xi

ardaxi@hsnl.social

5 days ago

Reply to @Di4na@hachyderm.io

@Di4na @liw @gregkh What I don't understand in that context is why they believe the role of messenger is at all valuable in that case. Surely if the tool is so powerful and easy to use, the maintainers would already be using it themselves?

Free Pietje 🇵🇸 🍉

FreePietje@x0f.org

5 days ago

Reply to @gregkh

@gregkh
To test it requires actual work?

CounterPillow

CounterPillow@mastodon.social

5 days ago

Reply to @gregkh

@gregkh fwiw, curl just bans these types of people https://hackerone.com/reports/3230082

Michael K Johnson

mcdanlj@social.makerforums.info

5 days ago

Reply to @gregkh

@gregkh I suspect you've already seen this slide from @badger but just in case, or for anyone else reading this who doesn't (yet) follow him...

https://mastodon.social/@bagder/114856434115222517

Ted Mielczarek

tedmielczarek@mastodon.social

5 days ago

Reply to @ardaxi@hsnl.social

@ardaxi @Di4na @liw @gregkh this is definitely not logical, but also most people are not logical unfortunately. Same sort of thinking as someone replying to a question with "I asked ChatGPT and here's what it said".

Rihards Olups

richlv@mastodon.social

5 days ago

Reply to @mikaeleiman@mastodon.sdf.org

@mikaeleiman @gregkh ...and/or some companies?

pivotman319 🦊

winload_exe@wetdry.world

5 days ago

Reply to @gregkh

@gregkh linters literally do their job better than a speculation machine

Epic Null

Epic_Null@infosec.exchange

5 days ago

Reply to @winload_exe@wetdry.world

@winload_exe @gregkh Almost, but not quite, as if linters and other tools were carefully designed to do a particular job, and thus do it well.

Rupert Reynolds

RupertReynolds@hachyderm.io

5 days ago

Reply to @gregkh

@gregkh This is the grotty side of #LLMs, of course. There is a good side. Sometimes.

But mostly what I see is #AIslop, and because of that I give It about as much respect as I did the #BoredApe bubble.

If LLMs are to be taken seriously, their act needs clearing up!

Vincent Bernardi

Enzo90910@iosdev.space

5 days ago

Reply to @gregkh

@gregkh it’s the very beginning of a ddos attack

Bernd Petrovitsch🔴🔴🔴♂️🏳️‍🌈🇦🇹🇪🇺

ovrim@wien.rocks

5 days ago

Reply to @Di4na@hachyderm.io

@Di4na @liw @gregkh it's (hard?) work to verify/validate the bug and it's needs skill ...

Thomas Depierre

Di4na@hachyderm.io

5 days ago

Reply to @ardaxi@hsnl.social

@ardaxi @liw @gregkh "surely, if my religion is the right one, everyone would convert on their own". Basically, they think we are little kids that have not understood the Truth yet. It is their job to lift us from our limited ways.

John Breen

jab01701mid@mastodon.social

2 days ago

Reply to @gregkh

@gregkh Fun story... One month, it was my job to run Klokwork (static code analysis) against our own code, because somebody in management had decided it's important to fix all "vulnerabilities" that an automated tool can find. An expensive tool, mind you.
Two senior engineers and lots of build resources, for a month, and we changed hundreds of thousands of lines of code (some by script).
1/x

John Breen

jab01701mid@mastodon.social

2 days ago

Reply to @jab01701mid@mastodon.social

@gregkh After all that, the Product Manager did not want to merge it into production/main, because "too many lines of code changes".
I learned a lesson - when tasked, always ask "if I do this, will you ship it" of your Product Manager. Or just take the money to waste time...
But for fun, I ran Klokwork against the linux kernel source (we were cross-compiling an ARM kernel and rootfs/dist of our own) and the "violations" were voluminous.
But somehow nobody was worried about that.
2/x

John Breen

jab01701mid@mastodon.social

2 days ago

Reply to @jab01701mid@mastodon.social

@gregkh I don't think of LLM-based coding "assistants" any differently - in the hands of experts, probably useful. In the hands of ignorant, lazy people seeking quick solutions, dangerously untrustworthy results nobody wants.
And a distraction from efforts that could really improve your software or service.
3/3

Greg K-H

Petr Tesařík

Lars Wirzenius

Yrjänä Rankka 🌻

tizilogic

daniel:// stenberg://

Thomas Depierre

Greg K-H

Greg K-H

(eval 'Toast)

Tisha Tiger

Michal

Advanced Persistent Teapot

sotolf

Lars Wirzenius

Christoph Schmees

Glenn

🇺🇦 haxadecimal

m_eiman

Petr Tesařík

Joe Brockmeier (@jzb)

Jernej Simončič �

Jernej Simončič �

Pavel Machek

Arda Xi ​

Free Pietje 🇵🇸 🍉

CounterPillow

Michael K Johnson

Ted Mielczarek

Rihards Olups

pivotman319 🦊

Epic Null

Rupert Reynolds

Vincent Bernardi

Bernd Petrovitsch🔴🔴🔴♂️🏳️‍🌈🇦🇹🇪🇺

Thomas Depierre

John Breen

John Breen

John Breen

Terms of service

Privacy notice

Getting your own account

Arda Xi