Hotfix for the prompt being ignored with CUDA #2190

JohannesGaessler · 2023-07-12T07:48:45Z

Fixes #2187 . The issue seems to be caused by ggml-org/ggml#359 and ggml-org/ggml#373 . As far as I can tell the logic for broadcasting that was implemented in those PRs is incorrect. It implicitly assumes that the tensors are one-dimensional which discards almost all information from previous layers because the first row of the non-constant tensor gets broadcast across all rows. This PR is a hotfix that simply reverts those changes.

ggerganov · 2023-07-12T07:57:48Z

I think I just did the same thing in #2191

JohannesGaessler · 2023-07-12T08:01:37Z

Mostly, but I forgot to revert the non-CUDA changes.

Hotfix for the prompt being ignored with CUDA

dfdadc0

JohannesGaessler closed this Jul 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Hotfix for the prompt being ignored with CUDA #2190

Hotfix for the prompt being ignored with CUDA #2190

Uh oh!

JohannesGaessler commented Jul 12, 2023

Uh oh!

ggerganov commented Jul 12, 2023 •

edited

Loading

Uh oh!

JohannesGaessler commented Jul 12, 2023

Uh oh!

Uh oh!

Hotfix for the prompt being ignored with CUDA #2190

Hotfix for the prompt being ignored with CUDA #2190

Uh oh!

Conversation

JohannesGaessler commented Jul 12, 2023

Uh oh!

ggerganov commented Jul 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JohannesGaessler commented Jul 12, 2023

Uh oh!

Uh oh!

ggerganov commented Jul 12, 2023 •

edited

Loading