social.kernel.org

Conversation

David Heidelberg

Do you hate #AI ? Think about #LLM as a lossy database. Which can be also queried by human language.

What Linux needs right now to enjoy this DB?

1. Reasonable support for accelerating models on #GPU (hello #OpenCL ), #NPU (TensorFlow Lite delegate #Mesa3D ) across various hardware
2. #Linux distributions shipping tools to run models, but most importantly stable interface to communicate with these models ( #OpenAI API on expected port) by default
3. #GNOME or #KDE levearaging it

David Heidelberg

okias@floss.social

11 months ago

Reply to @okias@floss.social

4. and almost most importantly, distros shipping models trained on #OpenSource datasets by default.

Pavel Machek

pavel

11 months ago

Reply to @okias@floss.social

@okias LLM outputs do not seem to be copyrightable. Plus, various opensource licenses are incompatible with each other, so you'd need gpl2-lm, bsd-lm, ...

Henri

slyecho@mdon.ee

11 months ago

Reply to @okias@floss.social

@okias These models are all researched on Linux, they are trained on Linux. The GP-GPU frameworks like Cuda and HIP work on Linux. All the Python code to run the models runs on Linux, also PyTorch with GPU acceleration. Optimized C/C++-based engines like llama.cpp run on Linux on Cuda and ROCm (I worked on this), as well as other projects that use llama.cpp underneath like Ollama.

NPUs are garbage for LLMs, only good for 10 year old image recognition models. Very hard to develop against because there is no standard APIs like we have for graphics.

KDE and GNOME cannot leverate it because they don't have a model that has an appropriate license. There is also no suitable model for GNOME's or KDE's usecase for a DE-level integration. I also can't think of a usecase at all.

David Heidelberg

okias@floss.social

11 months ago

Reply to @slyecho@mdon.ee

1/2 1. I agree; however, when I install my distro, I'm not getting the experience of having an LLM ready to use. #CUDA is proprietary and cannot be distributed with Linux distributions, and #ROCm only works with AMD cards. Thus #OpenCL is so far only vendor-agnostic option.

2. Perhaps @tomeu could jump in here? :)

David Heidelberg

okias@floss.social

11 months ago

Reply to @slyecho@mdon.ee

@slyecho 2/2
3. Environments could leverage it even without the model in place, offering the user the option to download it, but the infrastructure is lacking. Once the user downloads the model, GNOME/KDE could start using it automatically.

Use cases include everything from code generation in IDEs to offline translation, etc.

My main concern is regular user.

I can set up the LLM + OpenAI API + apps to be performant and useful, but it costs an arm and a leg

David Heidelberg

okias@floss.social

10 months ago

Reply to

@Paralyses2834 @tomeu Thanks, I need to look more into OneAPI. Anyway statement OpenCL is quite slow doesn't make much sense to me, could you elaborate bit?

About social.kernel.org

Terms of service

Please do not use this service in violation of the Linux Kernel Code of Conduct. Doing so will result in your account suspension with the referral of the matter to the CoC committee.
"Repeating"/"boosting" someone else's status on this platform will be treated as endorsement and will fall under rule #1.
You are encouraged to use this platform to promote your work on the Linux Kernel, but there is no restriction on permitted topics (with the exception of anything covered by #1 above).
There is no requirement to post in English, but it should be considered the primary language of communication on this platform.

Privacy notice

The admins of this service have access to all posted statuses. They aren't looking, but if it's something they shouldn't know about, then you should not post it on this platform.

Please see the Linux Foundation Privacy Policy, which applies to this platform as well.

Getting your own account

If you would like an account on this instance, please check that the following applies to you:

You are listed in MAINTAINERS or CREDITS
OR: You have a kernel.org account or email address
OR: You have a long and established history of involvement with the Linux Kernel

If the above is true and you agree with the Terms of Service and Privacy Notice listed above, please use these instructions to request an account:

How to request an account on social.kernel.org