llama : pre-allocate input tensors in a separate buffer #5100

slaren · 2024-01-23T22:38:13Z

ggml-ci

llama : pre-allocate input tensors in a separate buffer

eaa7722

ggml-ci

ggerganov approved these changes Jan 24, 2024

View reviewed changes

slaren merged commit 1387ea2 into master Jan 24, 2024

slaren deleted the sl/graph-inputs branch January 24, 2024 11:48

jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Feb 3, 2024

llama : pre-allocate input tensors in a separate buffer (ggml-org#5100)

bdab85c

cebtenzzre mentioned this pull request Feb 9, 2024

Add support for BERT embedding models #5423

Merged

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024

llama : pre-allocate input tensors in a separate buffer (ggml-org#5100)

2e2da1c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : pre-allocate input tensors in a separate buffer #5100

llama : pre-allocate input tensors in a separate buffer #5100

Uh oh!

slaren commented Jan 23, 2024

Uh oh!

Uh oh!

llama : pre-allocate input tensors in a separate buffer #5100

llama : pre-allocate input tensors in a separate buffer #5100

Uh oh!

Conversation

slaren commented Jan 23, 2024

Uh oh!

Uh oh!