[CANN]Support operator SIN COS ARGMAX #12709

noemotiovon · 2025-04-02T08:28:02Z

Why is this PR needed?

Optimize the sin , cos, argmax operator in the CANN backend with the aclnn acceleration library.

Test

  # SIN
  SIN(type=f16,ne=[10,2,2,2]): OK
  SIN(type=f32,ne=[10,2,2,2]): OK
  5294/5294 tests passed
  Backend CANN0: OK
  
  # COS
  COS(type=f16,ne=[10,2,2,2]): OK
  COS(type=f32,ne=[10,2,2,2]): OK
  5294/5294 tests passed
  Backend CANN0: OK
  
  #ARGMAX
  ARGMAX(type=f32,ne=[32,1,1,1]): OK
  ARGMAX(type=f32,ne=[100,10,1,1]): OK
  ARGMAX(type=f32,ne=[1024,10,1,1]): OK
  ARGMAX(type=f32,ne=[1024,12,1,1]): OK
  ARGMAX(type=f32,ne=[2000,10,1,1]): OK
  ARGMAX(type=f32,ne=[5438,3,1,1]): OK
  5294/5294 tests passed
  Backend CANN0: OK

Signed-off-by: noemotiovon <[email protected]>

noemotiovon · 2025-04-02T09:39:07Z

ggml/src/ggml-cann/aclnn_ops.cpp

+    ACL_CHECK(aclnnArgMax(workspaceAddr, workspaceSize, executor, ctx.stream()));
+
+    size_t cpy_size = ggml_nbytes(dst);
+    ACL_CHECK(aclrtMemcpyAsync(dst->data, cpy_size, buffer, cpy_size,


The extra copy here is necessary because the shape computed by the aclnn operator differs from dst, so a buffer is used to hold the data before copying it back to dst.

No need to alloc a extra buffer. use dst->data instead.

Thank you for your suggestion! I forgot to directly modify dst->data's shape, haha.

hipudding · 2025-04-03T00:59:28Z

ggml/src/ggml-cann/aclnn_ops.cpp

+
+    aclTensor* acl_src = ggml_cann_create_tensor(src0);
+    aclTensor* acl_dst = ggml_cann_create_tensor(dst);
+    aclnn_argmax(ctx, acl_src, acl_dst, dst);


It seems aclnn_argmax need dst, and acl_dst can be created in aclnn_argmax, and do not need pass as a parameter.
Besides, If aclnn_argmax is not used by other internal functions, these two function should combine together.

Okay, I will inline this method.

Signed-off-by: noemotiovon <[email protected]>

[CANN]support sin cos argmax

def7d45

Signed-off-by: noemotiovon <[email protected]>

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Apr 2, 2025

noemotiovon commented Apr 2, 2025

View reviewed changes

hipudding self-requested a review April 3, 2025 00:51

hipudding assigned hipudding and noemotiovon Apr 3, 2025

hipudding added the Ascend NPU issues specific to Ascend NPUs label Apr 3, 2025

hipudding reviewed Apr 3, 2025

View reviewed changes

noemotiovon added 2 commits April 3, 2025 01:29

[CANN]codestyle adjustment

bbe0fd2

Signed-off-by: noemotiovon <[email protected]>

[CANN]Remove redundant code

b853129

Signed-off-by: noemotiovon <[email protected]>

hipudding approved these changes Apr 3, 2025

View reviewed changes

hipudding merged commit 65cfe13 into ggml-org:master Apr 3, 2025
48 checks passed

noemotiovon mentioned this pull request Apr 7, 2025

llama.cpp 缺失算子补全 cosdt/llama.cpp#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CANN]Support operator SIN COS ARGMAX #12709

[CANN]Support operator SIN COS ARGMAX #12709

Uh oh!

noemotiovon commented Apr 2, 2025

Uh oh!

noemotiovon Apr 2, 2025

Uh oh!

hipudding Apr 3, 2025

Uh oh!

noemotiovon Apr 3, 2025

Uh oh!

hipudding Apr 3, 2025 •

edited

Loading

Uh oh!

noemotiovon Apr 3, 2025

Uh oh!

Uh oh!

Uh oh!

[CANN]Support operator SIN COS ARGMAX #12709

[CANN]Support operator SIN COS ARGMAX #12709

Uh oh!

Conversation

noemotiovon commented Apr 2, 2025

Why is this PR needed?

Test

Uh oh!

noemotiovon Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

hipudding Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

noemotiovon Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

hipudding Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

noemotiovon Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

hipudding Apr 3, 2025 •

edited

Loading