Conversation

K. Ryabitsev-Prime 🍁

"For the people saying things like β€œI’m a PhD from xyz uni and I’m telling your LLMs are just stochastic tools that make everything up and the world will fall apart if you use them”, I’m here to tell you that you are out of date. The world of software engineering has changed dramatically in the last few months."
https://medium.com/@tridge60/rsync-and-outrage-d9849599e5a0

Thank you for voicing that, Tridge. Yes, I do a 100% agree that, as an industry, we're painting ourselves into a weird/dangerous corner where we're starting to depend on proprietary providers that produce code we don't have the bandwidth to fully own and understand. And yeah, pretty sure that $99 that I pay for a personal subscription is really more like $2000+ in real costs and is clearly not sustainable long-term. And yeah, all the other criticism for a technofascist utopia we're building around ourselves worries me greatly, too (but we've been doing that since long before LLMs).

However, anyone who claims that LLMs are not useful or produce universally bad results is just plain wrong, or are using dinky models. I've gone from yelling at junk produced by LLMs to going "yeah, okay, that's not bad." Sonnet is still crap, don't use it for anything serious. Anything you can produce on a local model running on your GPU is going to be unusable for anything serious beyond answering "how is babby formed" kinds of questions.

But let's move on from the "lolslop" kind of commentary. It's dumb.
9
18
25
@monsieuricon I realized late last year that the quality question is ultimately irrelevant
I still consider current LLM output to be woefully inadequate, but that's besides the point
I would be opposed to this stuff even if the output was ostensibly perfect, and they paid me to use it, and didn't burn down the planet (with a cherry on top)
the reasons why are complex and out of scope, but it's why I just stopped mentioning quality as a concern (even when it is, like in the rsync case)
0
0
1

@monsieuricon β€œIf we set aside the ethical concerns for a moment, …”

1
0
0

@monsieuricon My complaints with GenAI cannot be answered by improvements in LLMs.

I have ethical, economical, environmental, and educational qualms about them -- so even if they produce perfect output, I still have problems with adopting them as they are today.

So, yeah. I agree -- "slop" complaints are easy, but increasingly outdated and easily dismissed. If that's what GenAI opponents are leading with, in relation to code, then they're not going to succeed.

* The "slop" argument *does* work well with other GenAI uses like writing, art, etc., because it's less about the quality of output and more about the volume and dehumanization problem.

1
0
0

K. Ryabitsev-Prime 🍁

Edited yesterday
@krans Our whole industrty is ethical concerns! The cleanest, most FOSS version of android you can run is still built on code produced by offshore contractors who were greatly underpaid for their work, probably running on a device built in a factory that had bars on windows to stop people from jumping out and killing themselves.

It's the "and yet you participate in society" kind of argument.
1
0
0
@monsieuricon

> Anything you can produce on a local model running on your GPU is going to be unusable for anything serious beyond answering "how is babby formed" kinds of questions.

And that's an issue on itself, isn't it? For "it's okay" results you need big datacenters, lots of energy, half of internet of data, so if you rely on LLMs, you rely on big for-profit, that will soon request big money, with all moral and environmental issues.
1
0
0

@monsieuricon This is such a cop-out of an answer. Just because there are other things that entail ethical concerns doesn't mean you get to set aside all ethical concerns and run full-steam ahead into normalising and entrenching even more ethical problems.

@krans

1
0
0

@jzb @monsieuricon I have to say here that I also don't think it's particularly true that "actually LLMs are good at code now".

Advocates have said this repeatedly over the last several years, for pretty much every new model, and the results have always been underwhelming. LLMs are good at code now if you vastly lower your standards of code quality.

[Edit to add: I do think that in all those other fields you list, there are people willing to argue that AI tools are actually good, even when it's obviously slop to the rest of us.]

On top of that, though, just to agree with the other issues you raise: *even if* LLMs were as good as advocates claim, it wouldn't justify their use.

1
0
0
@tiredbun Yes, for sure -- kinda like when you need to send $10 to your coworker for lunch. You will be using a hugely corrupt banking industry that is consuming huge amounts of energy for sustaining itself and corrupting more institutions around the world.
1
0
0
@benjamineskola @krans The moment I get up in the morning and start work, I have to overcome the ethical concern that I'm just sitting in a cozy apartment while someone somewhere is getting bombed -- probably even someone I know -- and I'm doing nothing to help them. I pick battles where I can do most good with what I have.
1
0
0

@monsieuricon That's the classic "I'm an expert, I can handle this" response. And I agree that the only ethical use of LLMs is if you're already an expert on the topic you're using it for in order to spot LLM mistakes.

But it's human nature to have a deeply wrong self-image regarding one's own fields of expertise. (Even before LLMs, it was a common sight to see retired professors write crank research-style papers about topics they have no idea about, but insist to be taken seriously because of their career and academic title.)

LLM as a tool is primed to feed the user wrong answers and give them a false sense of knowledge where the user is incapable to detect if the plausibly-correct-looking answer is actually correct.

And an enticing, literally addicitive tool that inserts obscure bugs into large codebases that you as a software developer are incapable to detect due to hubris, human nature and employee productivity rewarding KPIs like LOC is definitely a problem for our craft.

2
1
0

@monsieuricon This is not actually an answer to my point, though; it's just repeating the same bad point as before.

Just because some ethical problems cannot be solved by you or me within the course of our everyday work, does not constitute justification to give up on ethics and adopt even more unethical ways of operating.

@krans

1
0
0
@hzulla
> And an enticing, literally addicitive tool that inserts obscure bugs into large codebases that you as a software developer are incapable to detect due to hubris, human nature and employee productivity rewarding KPIs like LOC is definitely a problem for our craft.

In other words, exactly like a team of contractors.
1
0
0
@benjamineskola @krans I believe we're now rehashing the arguments from the last season of "Good Place."
1
0
0

@monsieuricon While I appreciate the snark, I'm sceptical about the addictive potential of a team of contractors.

1
0
0

@monsieuricon I get it. You want to keep doing the unethical thing and you feel uncomfortable being challenged on it, which is why you keep repeating the same non-answers to everyone who calls you out on it.

@krans

1
0
1
@hzulla Dude, it's *exactly* the same. Having a team of contractors is precisely the "I need this fixed ASAP and I'm too highly paid to deal with this myself" kind of energy you get from typing in "hey, I'm getting this crash when running my code, can you fix it" into a Claude prompt.
1
0
2
@benjamineskola @krans I'm sorry I don't pass the purity test you were hoping to hold me up to.
1
0
0

@monsieuricon Again, your argument boils down to 'some other people are more unethical therefore I can do whatever unethical thing I want'.

It's an opinion, certainly. But don't try to pretend this is about not being able to meet unreasonable standards; it's about trying to pretend that any ethical standard at all is unreasonable.

@krans

0
0
0

@monsieuricon what you describe may be the motivation for a subset of users of LLM coding assistants, but there's a wide range of users who don't fit that description. (See the bcachefs incident as an example.)

1
0
1
@hzulla No argument there, but it's also not exactly new or unique. There are whole swaths of the gaming industry where addiction and preying on addiction is rampant, from microtransactions to weird Fallout companion mods.
1
0
0

@monsieuricon This "I'm an expert, I can handle this and use an LLM for good" argument is also in direct contradiction to the literal sales pitch of generative AI products that promise its users "a PHD in your pocket". They WANT you to use the product outside of your field of expertise and they also tell middle management that senior staff's in-house-expertise is overrated and can now be replaced with tokenmaxxing.

1
0
1
@hzulla True, but also not very different from the old "you don't need full time employees -- our team of contractors will get it done for a fraction of a cost"
1
0
0
@monsieuricon

That fact doesn't mean we need another banking industry we can't replace, when it should be still avoidable.

I really want to stop programmers from making world depend on another torment nexus, at least by asking themselves - do you want to depend on very expensive (once they'll need to actually make money) and impossible to replace (except by either removing dependency or by switching to another with identical issues) service?
1
0
0

@monsieuricon I see that you have made up your mind, but no, I would argue that it is indeed very different. If it were not, people wouldn't have the visceral reaction of disgust to the net negative of what operating and using LLM products is doing to business, arts, politics and society as a whole.

1
0
0
@tiredbun Yes, for sure, depending on it is very bad. I worry that our immediate future will be filled with footage of black smoke billowing out from datacentres, as political stability in the world deteriorates further.
0
0
0
@hzulla Yes, we're in the "cheerfully promote cocaine as your key ingredient" stage of things. We do this with every new thing we discover, largely because we are who we are as a species and because our predominant economic model encourages it.
1
0
0

@monsieuricon so you see all of the ethical and moral concerns, like the hundreds of thousands of engineers put out of work and the centralization of all software to the hands of a few billionaires with enough GPUs to write it, and you don't care because "it writes good code sometimes"?

1
0
0
@lhk Yes, that summarizes it pretty much.

PS: but thank you for removing all nuance nonetheless.
0
1
1

@monsieuricon I still think it's different than your cheerful view.

LLM proponents are basically saying "I'm a doctor, opioids are a helpful substance in medical practice when handled properly", while critics are saying that the corporate behaviour, marketing and politics around opioids are outrageous deliberate misbehaviour and have devastating effects on society. We're both not wrong.

With current AI products, we're talking about an industry that is intentionally financing fascism, increases fossil fuel use and willingly accepts the death of vulnerable people to sell their LLM product, which they then freely help their customers to use in Torment Nexus scenarios such as deportation police using face ID on citizens or automated weapons targeting in war.

It irks me to consider the downfall of the society I live in as an acceptable side effect for having a slightly better code completion assistant in my IDE.

1
0
0
@hzulla Okay, it's not quite fair to compare LLMs to face-id or auto-targeting computers (they are not gen-ai, they are matching/recognition AI, which existed long before LLMs).
1
0
0

Jarkko Sakkinen

Edited yesterday
@monsieuricon I wrote a longer take elsewhere, which is IMHO somewhat balanced point of view :-) Or at least I feel that I've found myself the right balance how I see the world from here forward ...

"I think the strategical challenge for companies and workplaces is actually when not to use AI i.e., balancing between human labour and automation.

ATM, the balance is completely broken, which is not shocking because new tech trends cause some mayhem and chaos.

My experiences on this topic are:

1. Linux kernel. I just write patches by handle like always. The screenshot shows what happens if I try to use AI with Linux kernel. When you know one project really well, it might be also easier to just write code. Otherwise it's like "C-to-English-to-C-WTF-#%%##%-C-AGAIN!?-..." generator, and the only outcome is what can be seen in the screenshot.

2. When using codegen for anything, have at least a single unplugged day in a week. It really can keep slop under control, and it helps to keep your AI use in those four other days simple, effective and productive :-)

3. Despite having all that knowledge and deduction even the strongest frontier model is just a fancy non-modifiable database with a hardcoded program run by inference at the data center. I strongly believe that the real benefits come only when a really good lawyer uses AI for "law stuff", developer for "developer stuff" and a scientist "sciency stuff".

Maximalists, shitfluencers and that type of scum and scammers have tried to sold the world the idea of human labour being obsolete and only an expense but really it's the only fuel that makes thing actually work.

One for the humankind, cheers 🍻"
1
0
2

@monsieuricon there are reports that LLMs were used by DOGE to create the list of grants to be cut, with an expected long term death toll in the hundred thousands. There are other reports about chatbots being used within the military during the current war.

1
0
0

@monsieuricon Don't listen to me, listen to Pete about this. He's specifically talking about generative AI / LLMs there:

0
0
0

@hzulla @monsieuricon crank style professors out of their field=the so called thought leaders of antivax and transphobia

1
0
0

Julian Andres Klode πŸ³οΈβ€πŸŒˆ

@monsieuricon wth I found Sonnet to be quite excellent. Miles ahead of Gemini 3 Pro and perhaps lacking a bit vs Opus

0
0
0

@monsieuricon Computer programming is a social activity. Without tools to thoroughly analyze LLM-generated code FLOSS becomes no better than proprietary black boxes. Users can only trust their β€žvendors”. Take for example the recent Bun rewrite. How is the community supposed to review a million lines of code? How to determine no additional malicious functionality made its way into the codebase?

This isn't a technical issue, it is a social one. LLMs are bad for the community.

0
0
0

@benjamineskola @jzb @monsieuricon I also think there's more to examine about the "vulnerability discovery" portion of this. Having now read quite a few such reports, I've seen some okayish results*, a lot of pointless reports, more than a few that are flat out wrong, and a bizarre number of duplicates that say they're duplicates. Which is to say that the quality there also seems, to me, to be wildly exaggerated. When the man said the new version would generate 10x the number of reports, he was unfortunately exactly correct.

But there's another aspect to that as well: as a maintainer, I'm effectively forced to use the LLM even if I don't want to, because that's the vulnerability scan I get, and I can't just ignore the results. I have to actually examine how underwhelming they are. It's one more way this supposed time saving tool is making more work, rather than less. Beyond that, it's also a very bad model of consent. It's 2026, and relatively unskilled would-be attackers get to choose what tools I use.

At least I don't have to use it to write code. Yet.

(* tbf: a few of these would have been serious if not for other circumstances intervening. I think I'm up to four.)

1
0
0

@vathpela @jzb @monsieuricon A point I keep coming back to is, yes, LLMs have been used to identify a number of significant vulnerabilities. But they've also cost a lot in doing so.

If the same amount of money (millions!) had been spent on 'traditional' security research, how many vulnerabilities would have been found? Probably significantly more. The calculation I saw recently suggested that even the much-hyped Mythos isn't actually particularly successful.

Part of why it's been so high-profile is that we've only seen the results of successful runs, and not the many more failures; it's survivorship bias. Being actually more directly exposed to the output of these things, as you say, means sorting through the garbage to identify the potentially relevant outputs.

It's basically just brute-forcing. We've had ways of doing that for decades, it's just that nobody has bothered doing so because it hasn't had any hype attached to it.

My own personal experience of the 'code reviews' they generate has been, like you, underwhelming; usually a lot of irrelevant nonsense and a few minor issues with their severity significantly exaggerated. Ideally I'd never have to see one again; unfortunately, I don't get given that choice, because I work with people who take it personally when I criticise their best friend Claude.

2
0
0

@benjamineskola @jzb @monsieuricon yeah, I've noticed that in the news articles about mathematical proofs as well - all of them sound like using the LLM was the first attempt to automate the search, and it seems that automating it a traditional way would have taken less time and resources. But the availability of the resources to automate it were there because the product wasn't the proof, it was an LLM marketing demo.

0
0
0

@vathpela @jzb @monsieuricon Oh, and the other way of qualifying the 'not useful' / 'universally bad' thing is that useless/bad is not a binary. It's a tradeoff against cost.

So yes, did Claude successfully identify an incorrect function in our work codebase a couple of months back? Yeah, sure. Was it worth the cost? Absolutely not. It flagged something up as a security risk that turned out to be nothing of the sort, just a leftover method that could never be used and couldn't cause any actual harm (would just lead to an error, no data loss or unauthorised access).

And so for the cost of actually running the analysis, and then for the cost of me having to dig through its pages of garbage to figure out whether there was anything of value there? (The 'report' included a bunch of other irrelevant 'issues' and conflated minor refactoring suggestions with security issues and exaggerated the significance of both.) Not worth it.

And that is before you even get onto the externalities: all the other costs that are actually borne by someone other than the user; the environmental damage and other ethical issues and the way they're just generally making the world a worse place, all so that a handful of people can avoid doing their jobs properly.

0
0
0

Jarkko Sakkinen

Edited 23 hours ago
BTW, ultimate and categorical hate is also somewhat destructive angle to look at world.

I've seen this in work life, as some really good system architects etc are like shamed if they even touch to AI tools, and then some people who really should be forbidden to ever touch the code use them extensively i.e., ones with football stadium size ego and incredible self-deception skills ;-)

This dilation in workforce is in my opinion an enormous risk for companies and for most part it is not even recognized. It can lead e.g., to fatal mistakes in HR cuts. AI centered ecosystem is from hell. Human centered ecosystem where experts can pick what they want including AI tools without having to feel bad about it is what works and is shown to work over the course of history.

Most companies are also seemingly forgotten the existence of users and throw user experience under the bus in favor of developer experience designed for middle managers.
0
0
0