social.kernel.org

Conversation

ben 🇵🇸 ui

ben@m.benui.ca

1 year ago

Reply to

I'm requesting that my questions and answers be permanently deleted under GDPR.

ben 🇵🇸 ui

ben@m.benui.ca

1 year ago

Reply to @ben@m.benui.ca

It's just a reminder that anything you post on any of these platforms can and will be used for profit. It's just a matter of time until all your messages on Discord, Twitter etc. are scraped, fed into a model and sold back to you.

skategoat 🐐 🇵🇸

felipe@social.treehouse.systems

1 year ago

Reply to @ben@m.benui.ca

@ben (and unfortunately the fediverse)

Personne

per_sonne@ciberlandia.pt

1 year ago

Reply to @ben@m.benui.ca

@ben Feels like the Enclosures (Tragedy of the Commons).

Mighty Orbot

mighty_orbot@retro.pizza

1 year ago

Reply to @ben@m.benui.ca

@ben Stack Overflow has already been monetizing your answers with ads for years. If “used for profit” is your main complaint, you’re a little late.

AndrewFelix 🐀🏴‍☠️ 🇵🇸

andrewfelix@mastodon.social

1 year ago

Reply to @mighty_orbot@retro.pizza

@mighty_orbot @ben @mighty_orbot @ben The argument isn't about profit, which is pretty clearly outlined. OpenAI's explicit and ultimate intent is to replace people and in the meantime it's spitting out garbage information.

Bornach

bornach@masto.ai

1 year ago

Reply to @andrewfelix@mastodon.social

@andrewfelix @mighty_orbot @ben
And their software is laundering the original source of the information from which their AI training data was derived. Doesn't the original author deserve some credit for when ChatGPT regurgitates a lossy paraphrasing of a post scraped from the Internet?

AndrewFelix 🐀🏴‍☠️ 🇵🇸

andrewfelix@mastodon.social

1 year ago

Reply to @bornach@masto.ai

@bornach @mighty_orbot @ben 💯

Pavel Machek

pavel

1 year ago

Reply to

@ben Play stupid games, win stupid prices. Why does everyone believe that sabotaging LLM development is cool?

vbabka

1 year ago

Reply to @pavel

@pavel @ben it's not? :(

Pavel Machek

pavel

1 year ago

Reply to @vbabka

@vbabka @ben Its not. Using LLM to answer questions might not be good idea, but they should work rather well at translations, including translations between programming languates.

vbabka

1 year ago

Reply to @pavel

@pavel @ben translations are fine but not so sure about the programming languages part. Also, disagreement about using one's own content (created before LLMs took off) for LLM training is not the same thing as sabotaging, IMHO.

ljs

1 year ago

Reply to @vbabka

@vbabka @pavel @ben hint: LLMs have no understanding of anything, so absolutely aren't suited to programming since they'll hallucinate in (often) subtle ways that fits the syntax and people are notoriously bad at picking up on it.

Also they still work without credit/license etc. The fact they appear to work for a lot of programming situations makes it even more dangerous.

It'd be one thing if people were just using them but acknowledging their limitations, it's quite another in a world where people openly lie about their capabilities.

Totally and completely appropriate to not want your work part of it.

Aral Balkan

aral@mastodon.ar.al

1 year ago

Reply to

@ben They’re not yours, they’re theirs. Jeff Atwood thanks you for your free labour. (I’m kidding, he doesn’t. Feel grateful he even allowed you to contribute in the first place, serf.)

Speaking of Jeff Atwood, isn’t he the guy helping fund Mastodon now? 🤔

#SiliconValley #PeopleFarming #JeffAtwood #surveillance #capitalism #AllYourDataAreBelongToUs

Erik Uden 🍑

ErikUden@mastodon.de

1 year ago

Reply to @ben@m.benui.ca

@ben please do, you're awesome! ❤️

Orb 2069

Orb2069@mastodon.online

1 year ago

Reply to @felipe@social.treehouse.systems

@felipe @ben
Particularly your carefully crafted ALT tags.

datarama

datarama@hachyderm.io

1 year ago

Reply to @Orb2069@mastodon.online

@Orb2069 @felipe @ben This is something I've thought about ever since I started here. It's great that people here take their time to make the web better for disabled people.

But unfortunately, high-quality image descriptions are a gift to AI companies training text-to-image models. There is no act of altruism these assholes will not exploit.

Bornach

bornach@fosstodon.org

1 year ago

Reply to @datarama@hachyderm.io

@datarama @Orb2069 @felipe @ben

Maybe I should add extra stuff to ALT text that would be confusing to AI but amusing to the reader. I'm thinking along the lines of XKCD -- can't imagine how a generative AI trained only on xkcd Alt tags would respond to prompting

Phosphenes

Phosphenes@glasgow.social

1 year ago

Reply to @datarama@hachyderm.io

@datarama @Orb2069 @felipe @ben

In a perfect world, AI could be used to describe images to vision impaired people.
The real wrong isn't the AI itself, but that its owners use it only for selfish gains.

Kind of like GMOs, we could use them to feed more people for less but Monsanto only uses them to gouge farmers.

betalars @ GAMESCOM

betalars@chaos.social

1 year ago

Reply to @bornach@fosstodon.org

@bornach No you should not. This unfortunately is really a situation where doing the right thing is self-sabotage.

You can poinson the image using nightshade, but I would not count on it's effectiveness.

Orb 2069

Orb2069@mastodon.online

1 year ago

Reply to @bornach@fosstodon.org

Edited 1 year ago

@bornach @datarama @felipe @ben

... That's what I do. Wax poetic or oblique instead of just flatly describing the depicted.

As far as moral obligations, this is social media, not a debfibulator pack interface. Nobody is going to die because they can't tell what your cat is doing in the picture..

Orb 2069

Orb2069@mastodon.online

1 year ago

Reply to @Phosphenes@glasgow.social

Edited 1 year ago

@Phosphenes @datarama @felipe @ben
In a perfect world, AI wouldn't "hallucinate" (PR spin/flavor on just being wrong ), and might be useful for that sort of thing.

(Btw: Meta already does this, but their alt tags consist of something like " Image may contain <object>, <text>, <object>" - the data exists because they have to run image analysis for automated moderation anyways - they surface it because it satisfies ADA requirements )

Phosphenes

Phosphenes@glasgow.social

1 year ago

Reply to @Orb2069@mastodon.online

@Orb2069 @datarama @felipe @ben

Sounds like Meta is on the right track.

Orb 2069

Orb2069@mastodon.online

1 year ago

Reply to @Phosphenes@glasgow.social

@Phosphenes @datarama @felipe @ben
Eh? I mean, it's not super useful. Forex: "image contains woman, cat, salad, <badly ocr'd text>"
https://amp.ebaumsworld.com/pictures/woman-yelling-at-cat-memes/86009016/

ClickyMcTicker

ClickyMcTicker@hachyderm.io

1 year ago

Reply to @Orb2069@mastodon.online

@Orb2069 @Phosphenes @datarama @felipe @ben If the OCR worked better, I’d be able to (probably) tell exactly what that is. Woman/cat/salad with two lines of text is absolutely the Woman Yelling At Cat meme format. It would require some prior knowledge though.
On the flip side, you’d think they would run images through a reverse image search and tag hits on meme templates. I get hits which for it which have the title of the meme in text

Pavel Machek

pavel

1 year ago

Reply to @ljs

@ljs @vbabka @ben Hint: try it. It saved work for me.

ljs

1 year ago

Reply to @pavel

@pavel @ben @vbabka sigh you're disappointing me man.

But like all LLM proponents (just like all crypto guys I spoke to before, just like all anti vax guys I spoke to before, just like all [insert religious-style belief] proponents I spoke to before) you won't actually rebut what I say, you'll just assume that 'I don't get it' on some level.

I have tried LLMs dude, thanks for patronising me by assuming I haven't.

Unfollow.

13xforever

13xforever@social.treehouse.systems

1 year ago

Reply to

@ben I mean

user contributions licensed under CC BY-SA

I'm not a lawyer, but I don't think you can do anything about it, they're technically hosting a copy of your content with attribution to you, which doesn't make you an owner of the data, in particular this clause:

Adapt — remix, transform, and build upon the material for any purpose, even commercially.

gives them right to fuck their userbase in the ass by using the data in other services

ben 🇵🇸 ui

ben@m.benui.ca

1 year ago

Reply to @13xforever@social.treehouse.systems

@13xforever They're then selling that data to OpenAI which does not abide by this license. I'm not getting attribution there, and they're not licensing it as CC-SA-BY whch is required.

https://m.benui.ca/@ben/112401140834395509

M.O.M.O.

momo@mk.absturztau.be

1 year ago

Reply to

@ben@m.benui.ca Chaotic evil: send in an anti-circumvention DMCA notice for each question. Those have no process for disputing, so they will probably just delete your content and ban you, because it is easier.

Glowing Cat of the Nuclear Wastelands ☣

deathkitten@ibe.social

1 year ago

Reply to

@ben@m.benui.ca The enshittification will continue until the morale improves.

ben 🇵🇸 ui

ben@m.benui.ca

1 year ago

Reply to @ben@m.benui.ca

Thank you for the replies. As someone pointed out, anything posted on Stack Overflow is covered by CC BY-SA 4.0.

Under this license all usage must attribute the author and must have a similar license. Neither of which OpenAI fulfills.

Sue is Writing Solarpunk 🌞🌱

susankayequinn@wandering.shop

1 year ago

Reply to @ben@m.benui.ca

@ben "AI is a lying machine made out of crimes."
https://www.tiktok.com/@alex_falcone/video/7366006020352642347

also lunya now

7331@mastodon.de

1 year ago

Reply to @ben@m.benui.ca

Edited 1 year ago

@ben i haven't read their tos but are you sure that it doesn't include licensing whatever you say to stackoverflow? the last paragraph of the page you shared seems to allude to that
i mean, it's still immoral as heck but i guess that's one of the reasons we're all here instead of on a centralized content farm

Proto

protowlf@mastodon.gamedev.place

1 year ago

Reply to @ben@m.benui.ca

@ben if only there were a word for taking things you don't own. 🤔

Gosh it would make talking about gen AI easier if we had a word for that. 🤔

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ👩🏻‍💻

AeonCypher@lgbtqia.space

1 year ago

Reply to @datarama@hachyderm.io

@datarama @Orb2069 @felipe @ben

It's much better if people stop trying to fight the AI companies, and focused on making AI available for everyone.

Orb 2069

Orb2069@mastodon.online

1 year ago

Reply to @AeonCypher@lgbtqia.space

@AeonCypher @datarama @felipe @ben

Please, mr. Reply guy, tell me about the inevitability of AI.

When you're done, explain to me how you reliably achieve +95% accuracy on k-fold validation without undetectable overfitting - my prof never could provide a simple answer, and 1-out-of-20 seems like really not good odds for a new god.

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ👩🏻‍💻

AeonCypher@lgbtqia.space

1 year ago

Reply to @Orb2069@mastodon.online

@Orb2069 @datarama @felipe @ben

What a strange non-sequitor.
I wonder if you're actually trying to understand something, or if I should simply block you.

ben 🇵🇸 ui

ben@m.benui.ca

1 year ago

Reply to @ben@m.benui.ca

Also that CC claims that training an AI on data is "fair use". So fuck Creative Commons I guess.
https://creativecommons.org/2023/02/17/fair-use-training-generative-ai/

Kevin Karhan

kkarhan@infosec.space

1 year ago

Reply to @ben@m.benui.ca

@ben #Funfact: all "#learning" is #FairUse, otherwise you'd be a perpetual #DebtPeon to ]whoever made your schoolbooks and created whatever media you ever consumed](
http://felixreda.eu/2021/07/github-copilot-is-not-infringing-your-copyright/ ) !

Notwidthstanding the fact that #WastefulComputing for wannabe "#AI" is just marginally less bad than #Shitcoin-#Cryptocurrencies...

agersant

agersant@mastodon.gamedev.place

1 year ago

Reply to @ben@m.benui.ca

Edited 1 year ago

@ben I'm longing for a new set of free (as in beer) software and creative licenses that prevent all this garbage.

I put my software out there so other people can use it, I'm even ok if they make money out of it. But I'm not ok with my work being swallowed by a big machine so that people can print money without even knowing it exists at all.

Tim Clevenger

timjclevenger@infosec.exchange

1 year ago

Reply to @ben@m.benui.ca

@ben @pluralistic

Pavel Machek

pavel

1 year ago

Reply to @ljs

@ljs @ben @vbabka Well, your arguments were a bit disappointing, too. LMs are useful for trivial tasks, and for easy tasks where you can verify the result. I do both kinds of tasks from time to time.

ljs

1 year ago

Reply to @pavel

@pavel @ben @vbabka the ones so disappointing you entirely ignored them (because I guess it's beneath you to rebut them) and just said 'try it' as if I hadn't?

LLMs have uses, I disagree with their use for tasks like programming for the reasons previously stated that you ignored so not going to repeat.

ljs

1 year ago

Reply to @Orb2069@mastodon.online

Edited 1 year ago

@Orb2069 @AeonCypher @datarama @felipe @ben just trust them, everything will work in the next version!

Don't worry people who stand to make 100's of billions of dollars like Sam Altman say LLMs and deep learning can do things they emphatically cannot because they're just like altruistic or something.

Petr Tesarik

ptesarik@fosstodon.org

1 year ago

Reply to @ljs

@ljs @ben @pavel @vbabka LLMs often turn one type of work (create) into another type of work (review), consuming lots of energy in the process. For some people, it may be worth it (although if they had to pay the full costs of LLMs, humans might still be cheaper).

ljs

1 year ago

Reply to @ptesarik@fosstodon.org

Edited 1 year ago

@ptesarik @ben @pavel @vbabka the big problem is that people are very very bad at picking up on the kind of errors that an algorithm can generate.

We all implicitly assume errors are 'human shaped' i.e. the kind of errors a human would make.

An LLM can have a very good grasp of the syntax but then interpolates results in effect randomly as the missing component is a dynamic understanding of the system.

As a result, they can introduce very very subtle bugs that'll still compile/run etc.

People are also incredibly bad at assessing how much cost this incurs in practice.

Having something that can generate such errors for only trivial tasks strikes me as being worse than having nothing at all.

And the ongoing 'emperor's new clothes' issues with LLMs is this issue is insoluble. Hallucination is an unavoidable part of how they work.

The whole machinery of the thing is trying to infer patterns from a dataset, so at a fundamental level it's broken by design.

That's before we get on to the fact it's needs human input to work (you start putting LLM generated input in it completely collapses), so the whole thing couldn't work anyway on any long term scale.

That's before we get on to the fact it steals software and ignores license, the carbon costs and monetary costs of compute, and a myriad of other problems...

The whole problem with all this is it's a very very convincing magic trick and works so well that people are blinded to its flaws.

See https://en.wikipedia.org/wiki/ELIZA_effect?useskin=vector

AGRO TURBO SATAN 🇺🇦🇨🇿👃💨

lkundrak@metalhead.club

1 year ago

Reply to @ljs

@ljs @ben @pavel @vbabka first they were anti pdp-11 now they are anti vax and next you'll see them get anti alpha to the point they'll start removing support for old alpha processors

ljs

1 year ago

Reply to @lkundrak@metalhead.club

@lkundrak @ben @pavel @vbabka first they came for the pdp-11 and I said nothing...

AGRO TURBO SATAN 🇺🇦🇨🇿👃💨

lkundrak@metalhead.club

1 year ago

Reply to @pavel

@pavel @ben @ljs @vbabka people said this about heroin too

AGRO TURBO SATAN 🇺🇦🇨🇿👃💨

lkundrak@metalhead.club

1 year ago

Reply to @pavel

@pavel @ben i don't think they were sabotaging anything? nobody minds stackoverflow training their models on their own. they just chose not to help them because the conditions were not fair (the original author not having rights to the derived work)

AGRO TURBO SATAN 🇺🇦🇨🇿👃💨

lkundrak@metalhead.club

1 year ago

Reply to @ljs

@ljs @ben @pavel @vbabka i'd also come for a pdp-11 if i lived in the u.s. and had a place for it

Petr Tesarik

ptesarik@fosstodon.org

1 year ago

Reply to @lkundrak@metalhead.club

@lkundrak @ljs @ben @pavel @vbabka alpha considered harmful; if male, a gender stereotype even

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ👩🏻‍💻

AeonCypher@lgbtqia.space

1 year ago

Reply to @ljs

Edited 1 year ago

@ljs @datarama @Orb2069 @ben @felipe

Are you saying to trust me? I'm not a 'him'.

I'm quite strongly against OpenAI. What you are saying is quite the opposite of what I said.

The comment above continues to be an irrelevancy. A strung together set of jargonizations.

No one builds LLMs with k-fold validation. OpenAIs models are, likely intentionally, overfit. Which is why they are full of exact copies of data.

However, again, whatever you two think you're arguing against it's not related to a position I hold.

Kai Klostermann

OddDev@floss.social

1 year ago

Reply to @AeonCypher@lgbtqia.space

@AeonCypher @datarama @Orb2069 @felipe @ben

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ👩🏻‍💻

AeonCypher@lgbtqia.space

1 year ago

Reply to @OddDev@floss.social

@OddDev @datarama @Orb2069 @felipe @ben
Wow, third time in this conversation I've had someone use a typically male gendered word to refer to me.

Keep it up.

Felix Reda

senficon@ohai.social

1 year ago

Reply to

@tdr @kkarhan @ben Perhaps I can clarify, as I wrote the article. § 44b UrhG is the German transposition of Art. 4 DSM copyright directive, which I cover in the article: “Since the EU Copyright Directive of 2019, … where commercial uses are concerned, rightsholders who do not want their copyright-protected works to be scraped for data mining must opt-out in machine-readable form”, so although Germany had not adopted §44b yet, the article takes it into account.

Witold Kowalik

frankboon@pol.social

1 year ago

Reply to @Phosphenes@glasgow.social

@Phosphenes

Alas it is always the Luddite question is it not?
Ask not what the machine does but to whom and for who's benefit?
AI should be creating a better future for the benefit of all, and mostly for those of dire needs. Instead it reaps the benefits for the fat cats above, and indulges in the #enshitification of our reality.
An you've wrote "in a perfect world" - I don't think this should be considered in such terms. That should be our normal one.

@datarama @Orb2069 @felipe @ben

Orb 2069

Orb2069@mastodon.online

1 year ago

Reply to @AeonCypher@lgbtqia.space

@ljs @datarama @ben @felipe

Anyone taking bets on this being a bot?

Felix Reda

senficon@ohai.social

1 year ago

Reply to

@kylotan @tdr @kkarhan @ben Correct. That is a problem I’m working on right now. I wouldn’t say it’s deliberately weak, just that implementing and enforcing new regulation takes time and all things considered, this is still a new ruleset.

fredbrooker@witter.cz

1 year ago

Reply to @ben@m.benui.ca

@ben OpenAI are thieves - everybody knows - just jail 'em

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ👩🏻‍💻

AeonCypher@lgbtqia.space

1 year ago

Reply to @Orb2069@mastodon.online

@Orb2069 @ljs @datarama @ben @felipe

Are you accusing me of being a bot. Kindly go fuck yourself.

I actually work with the technology and actively work _against_ the corporate powers trying to monopolize it.

You on the other hand are spewing jargon you do not understand in order to look smart, and fearmongering about something you know nothing about.

Orb 2069

Orb2069@mastodon.online

1 year ago

Reply to @AeonCypher@lgbtqia.space

@AeonCypher @ljs @datarama @ben @felipe

What a strange non-sequitor.
I wonder if you're actually trying to understand something, or if I should simply block you.

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ👩🏻‍💻

AeonCypher@lgbtqia.space

1 year ago

Reply to @Orb2069@mastodon.online

@Orb2069 @ljs @datarama @ben @felipe

Oh, so you're the bot...
It's the only explanation for a verbatim response like this.

Artemis

artemis@dice.camp

1 year ago

Reply to @AeonCypher@lgbtqia.space

@AeonCypher @datarama @Orb2069 @felipe @ben
Not until it solves its energy consumption problem!

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ👩🏻‍💻

AeonCypher@lgbtqia.space

1 year ago

Reply to @artemis@dice.camp

@artemis @datarama @Orb2069 @felipe @ben
AI has a _projected_ energy consumption problem. This is the problem of #BigTech and #Capitalism. Not a problem of #AI as a technology.
You can run a Llama 3 model on a modern consumer graphics card.

ben 🇵🇸 ui

ben 🇵🇸 ui

skategoat 🐐 🇵🇸

Personne

Mighty Orbot

AndrewFelix 🐀🏴‍☠️ 🇵🇸

Bornach

AndrewFelix 🐀🏴‍☠️ 🇵🇸

Pavel Machek

Pavel Machek

Aral Balkan

Erik Uden 🍑

Orb 2069

datarama

Bornach

Phosphenes

betalars @ GAMESCOM

Orb 2069

Orb 2069

Phosphenes

Orb 2069

ClickyMcTicker

Pavel Machek

13xforever

ben 🇵🇸 ui

M.O.M.O.

Glowing Cat of the Nuclear Wastelands ☣

ben 🇵🇸 ui

Sue is Writing Solarpunk 🌞🌱

also lunya now

Proto

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ👩🏻‍💻

Orb 2069

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ👩🏻‍💻

ben 🇵🇸 ui

Kevin Karhan

agersant

Tim Clevenger

Pavel Machek

Petr Tesarik

AGRO TURBO SATAN 🇺🇦🇨🇿👃💨

AGRO TURBO SATAN 🇺🇦🇨🇿👃💨

AGRO TURBO SATAN 🇺🇦🇨🇿👃💨

AGRO TURBO SATAN 🇺🇦🇨🇿👃💨

Petr Tesarik

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ👩🏻‍💻

Kai Klostermann

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ👩🏻‍💻

Felix Reda

Witold Kowalik

Orb 2069

Felix Reda

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ👩🏻‍💻

Orb 2069

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ👩🏻‍💻

Artemis

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ👩🏻‍💻

Terms of service

Privacy notice

Getting your own account