Conversation

Jarkko Sakkinen

Edited 20 hours ago
The reason I've been making now so much AI noise is the realiziation that

1. I have bad vibes only ;-)
2. It is probably better to start taking baby steps right now with sec.
3. Got postulated that guardrails for malicious stochastic actions should be based on algorithm, not AI.

There's a lot of popular lore of some kind of guardian LLMs that overwatch frontier model but the problem is the introduction infinite recursion of distrust. All LLMs have the same underlying problem.
1
0
1

@jarkko it's market manipulation at this point.

1
0
1
@pinkforest In my experience and based on some ad-hoc random tests I've created for the models, the very latest of frontier model have a lot of power yes but they easily also emit behavior that appear as backstabbing .

Models shortcut tasks all the time: they great on taking an optimal path of actions, which does not necessary mean efficiency all. The very latest models seem to be more focused on finding interpretions of a task decription that result the minimum amount of tokens burnt.

So to summarize that I think the latest versions are worse than previous and it comes down to limitations of LLM architecture. I.e. they kind of get better but the improvements are not the welcome ones. mathematically latest do better :-)

It's interesting how AI native minions who think that they will take over the world have now started to push good old waterfall model and "spec driven development", which good old waterfall from the 50s. They think they are improving the process while they are actually dynamically reacting to model quirks.

The irony here is that given these properties you actually should have really good staff of human engineers for balance-and-check more so with e.g., Opus 4.7 than Qwen 3.6 27B. The latter does what asked and can do it really effectively if you know what you are doing. I.e. also in local model side it is skills and creativity (and great salary) that really works.
1
1
2
Edited 12 hours ago

@jarkko hahahaha like customer knows the spec... that is like the best joke ever. Boeing infamously outsourced the development to where they had to do spec for the sensors thing and they never really treated it as their core product taking it seriously. It almost destroyed that company. WIll be same story with AI if they think they can just do pure spec driven non-development where supposedly non experts can do functional systems if they get only the specification supposedly - chicken egg here how do non-domain experts know spec in first place 🤦‍♀️

1
0
0
@pinkforest yeah so i have not really followed what nokia has done :-) i heard that they have something going on with nvidia. it did not come as suprise because NVIDIA has quite strong R&D presence in Helsinki. E.g. NVIDIA RTX technology was engineered in Helsinki.
1
0
0
@pinkforest I try to not be in a camp because either way I get "under the influence". I measure,test and try to think what it means what I see. Yeah, and generally try to avoid making any fast conclusons :-) I'm not pressured to use them and I do have a stable job, so I thought it is good position make more serious security and threat analysis on LLMs (i.e. as an actor in a threat scenario, not scanning vulns using LLMs).
1
0
0
@pinkforest Engineering-wise having snapshots of "all of your base" does not work. Brains have all of already dead Internet. This is what I think I think ...

I think world model based AI could potentially be better and more co-operative approach to AI. Then, AI does not know "everything" (from the past that no longer exist) but instead can do useful stuff like self-drive cars. Compute budget is of course taken from language side given that physical resources have their limits. And stuff that John Carmack is doing is interesting and I follow that a lot. And it addition to I do like the thinking of Yann LeCun and congnitive scientist Gary Marcus. Those three are my top tier in this domain in the positive sense of the word.
1
0
0
@pinkforest What industry does ATM is horrible to watch and that part I don't like at all. There is no such thing as AI skills.
0
0
0