Skip to content

Commit 50d2227

Browse files
authored
ggml-cpu: reduce asm calls for hsum (#14037)
Signed-off-by: Aaron Teo <[email protected]>
1 parent 6231c5c commit 50d2227

File tree

1 file changed

+2
-4
lines changed

1 file changed

+2
-4
lines changed

ggml/src/ggml-cpu/simd-mappings.h

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -944,10 +944,8 @@ static inline void __lsx_f16x4_store(ggml_fp16_t * x, __m128 y) {
944944
for (int i = 0; i < offset; ++i) { \
945945
x[i] = vec_add(x[i], x[offset + i]); \
946946
} \
947-
res = vec_extract(x[0], 0) + \
948-
vec_extract(x[0], 1) + \
949-
vec_extract(x[0], 2) + \
950-
vec_extract(x[0], 3); \
947+
float32x4_t tmp = x[0] + vec_reve(x[0]); \
948+
res = tmp[0] + tmp[1]; \
951949
}
952950

953951
#define GGML_F32_VEC GGML_F32x4

0 commit comments

Comments
 (0)