enable CPU HBM #2603

jikunshang · 2023-08-14T05:18:21Z

this pr try to enable CPU HBM with memkind library to allocate hbm memory.

netrunnereve · 2023-08-15T01:43:51Z

Are you seeing any performance improvements?

jikunshang · 2023-08-31T08:39:33Z

Are you seeing any performance improvements?

sorry for late reply, I just got test machine recently.
On Xeon CPU Max 9462, model use llama2-7b:
baseline inference performance is 124 ms/token,
with this enhancement and HBM enabled, inference performance is 87 ms/token, which can get about 40% perf gain.

jikunshang · 2023-08-31T08:41:16Z

hi @ggerganov, can you take a review at your convenience?

ggml.c

hydroo · 2023-09-02T19:39:56Z

Sorry for my ignorance.
Why doesn't the system allocate on HBM without this change?
Is this a system with DDR and HBM, and some BIOS setting (I vaguely remember caching vs other modes (For Optane and Knights* chips)) makes it such that the system either prioritizes DDR or even never touches HBM?
I'm asking, because it's unintuitive to me that a system wouldn't perhaps prioritize using HBM over DDR.

jikunshang · 2023-09-04T00:42:24Z

Sorry for my ignorance. Why doesn't the system allocate on HBM without this change? Is this a system with DDR and HBM, and some BIOS setting (I vaguely remember caching vs other modes (For Optane and Knights* chips)) makes it such that the system either prioritizes DDR or even never touches HBM? I'm asking, because it's unintuitive to me that a system wouldn't perhaps prioritize using HBM over DDR.

Yes, you are right.
Actually, there are 3 kinds memory mode for Xeon Max Cpu serious: HBM only memory mode, Flat memory mode(1LM), Cache memory mode(2LM). For cache memory mode, it will work as you describe, but it will lock of fine grained memory management since all HBM memory are transparent, like L4 cache.
This code change is target for Flat memory mode, HBM and DDR are exposed to software as separate address space. we can use HBM on demand.
More details about HBM configuration can be found here

ggml.c

llama.cpp

ggerganov · 2023-09-04T19:58:16Z

Merge if CI passes

jikunshang · 2023-09-08T01:10:03Z

Hi @ggerganov can you approve again for workflows? thanks!

kunger97 · 2023-11-28T05:18:41Z

Does HBM compile the default in LLAMA.CPP or do you need to specify during compilation?

jikunshang force-pushed the cpu_hbm branch from 91b4c08 to c531d50 Compare August 14, 2023 05:32

jikunshang force-pushed the cpu_hbm branch from c531d50 to d2e2080 Compare August 31, 2023 08:36

ggerganov requested changes Sep 1, 2023

View reviewed changes

ggml.c Outdated Show resolved Hide resolved

ggerganov reviewed Sep 4, 2023

View reviewed changes

ggml.c Outdated Show resolved Hide resolved

ggerganov reviewed Sep 4, 2023

View reviewed changes

llama.cpp Outdated Show resolved Hide resolved

ggerganov approved these changes Sep 4, 2023

View reviewed changes

jikunshang and others added 6 commits September 8, 2023 01:07

add cpu hbm support

eeb20c0

add memalign 0 byte check

7ba244b

Update ggml.c

100d8e0

Update llama.cpp

b2a0939

ggml : allow ggml_init with 0 size

6701d16

retrigger ci

be4bbfb

jikunshang force-pushed the cpu_hbm branch from db9435f to be4bbfb Compare September 8, 2023 01:09

fix code style

41bb1a5

slaren merged commit 7f412da into ggml-org:master Sep 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

enable CPU HBM #2603

enable CPU HBM #2603

Uh oh!

jikunshang commented Aug 14, 2023

Uh oh!

netrunnereve commented Aug 15, 2023

Uh oh!

jikunshang commented Aug 31, 2023

Uh oh!

jikunshang commented Aug 31, 2023

Uh oh!

Uh oh!

hydroo commented Sep 2, 2023

Uh oh!

jikunshang commented Sep 4, 2023

Uh oh!

Uh oh!

Uh oh!

ggerganov commented Sep 4, 2023

Uh oh!

jikunshang commented Sep 8, 2023

Uh oh!

kunger97 commented Nov 28, 2023

Uh oh!

Uh oh!

enable CPU HBM #2603

enable CPU HBM #2603

Uh oh!

Conversation

jikunshang commented Aug 14, 2023

Uh oh!

netrunnereve commented Aug 15, 2023

Uh oh!

jikunshang commented Aug 31, 2023

Uh oh!

jikunshang commented Aug 31, 2023

Uh oh!

Uh oh!

hydroo commented Sep 2, 2023

Uh oh!

jikunshang commented Sep 4, 2023

Uh oh!

Uh oh!

Uh oh!

ggerganov commented Sep 4, 2023

Uh oh!

jikunshang commented Sep 8, 2023

Uh oh!

kunger97 commented Nov 28, 2023

Uh oh!

Uh oh!