Replies: 6 comments 21 replies
-
It runs on the Steam Deck, so why not. |
Beta Was this translation helpful? Give feedback.
-
AFAIK, a maximum 4GB of system RAM can be shared with an AMD APU integrated GPU. If 16GB were possible, I would immediately order a framework ryzen laptop... please update if you learn anything different.. EDIT: 16GB VRAM (UMA Frame Buffer Size in BIOS) possible on some manufacturer BIOS with at least 680M/780M (Zen 3/Zen 4) |
Beta Was this translation helpful? Give feedback.
-
I spent some time use setup_var changed my igpu size to 8g. But it's sad the speed not as good as my expectation, especially considered the openblas via clblas speed difference: the 7B speed in GPU evol time in CLBLas version --ngl 1000 is almost same as cpu, and yes the max mem is around 5.6.😂 |
Beta Was this translation helpful? Give feedback.
-
If there is a lot of interest in iGPUs it might be worth creating a zero-copy GPU implementation. That's only possible on iGPUs since they share the main memory with the CPU. |
Beta Was this translation helpful? Give feedback.
-
Today I do some test in windows, but I think it also will work in Linux. Just reduce the thread of cpu will increase the speed of printing. Update: not the same in Linux, still too slow to use. 13b(interactive mode) Update 2: And I failed to reproduce the windows speed, I assume it's the same problem like Intel p/e core... 30b: q4_0 60 layors only take 19g vram.(plus Blas) |
Beta Was this translation helpful? Give feedback.
-
May I know is there currently an iGPU zero copy implementation in llama.cpp? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Allocate huge vram to delicated AMD gpu
As we know 680m in 6700h, close to 2050,
May the cheapest way to do anything😅😂
Beta Was this translation helpful? Give feedback.
All reactions