OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat #13840

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

max-krasnyansky merged 2 commits into ggml-org:master from rmatif:new-opencl-kernels-2

Jun 2, 2025

Contributor

rmatif commented May 28, 2025

The previous PR had a lot of conflicts since group norm was added in this commit: a3c3084, and I messed up some git commands.

The ops in the title are now added, and with these additions, OpenCL (hopefully) supports all the ops used in stable-diffusion.cpp.

All tests passed using test-backend-ops. Tested on: Adreno 750, 740, 730.

@lhez @max-krasnyansky


          add concat, pad, repeat, tsembd, tanh, upscale

4b5d450

github-actions bot added the ggml label

rmatif changed the title ~~OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat #13781~~ OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat

Contributor

lhez commented May 28, 2025

Thank you @rmatif for the PR. I missed your previous one. Taking a looking into the PR now.

lhez reviewed

View reviewed changes

ggml/src/ggml-opencl/kernels/concat.cl Outdated

+                      y_val_ptr = (global float *)(dst_base + (ulong)current_i3*d_nb3 + (ulong)current_i2*d_nb2 + (ulong)current_i1*d_nb1 + (ulong)current_i0*d_nb0);
+                      *y_val_ptr = *x_val_ptr;
+                  }
+              }

Contributor

lhez May 30, 2025

Can you add a newline at the end of the file?

lhez reviewed

View reviewed changes

ggml/src/ggml-opencl/kernels/repeat.cl Outdated

+                          current_dst_el_ptr[k] = current_src_el_ptr[k];
+                      }
+                  }
+              }

Contributor

lhez May 30, 2025

Can you add a newline at the end of the file?

lhez reviewed

View reviewed changes

ggml/src/ggml-opencl/kernels/tsembd.cl Outdated

+                  local_arg = local_timestep_val * local_freq;
+                  local_embed_data_ptr[local_j] = cos(local_arg);
+                  local_embed_data_ptr[local_j + local_half_dim] = sin(local_arg);
+              }

Contributor

lhez May 30, 2025

Can you add a newline at the end of the file?

lhez reviewed

View reviewed changes

ggml/src/ggml-opencl/kernels/unary.cl Outdated

+                          *dst_val_ptr = tanh(*src_val_ptr);
+                      }
+                  }
+              }

Contributor

lhez May 30, 2025

Can you add a newline at the end of the file?

lhez reviewed

View reviewed changes

ggml/src/ggml-opencl/kernels/upscale.cl Outdated

+                                 val_d * dx * dy;
+                  dst_base[index] = result;
+              }

Contributor

lhez May 30, 2025

Can you add a newline at the end of the file?

lhez reviewed

View reviewed changes

ggml/src/ggml-opencl/kernels/unary.cl Outdated

		@@ -0,0 +1,63 @@
		#pragma OPENCL EXTENSION cl_khr_fp16 : enable

Contributor

lhez May 30, 2025 •

edited

Loading

Can you rename this file to tanh.cl? We have been putting each unary op in separate files. This also should be more friendly to compilers on A6x.

lhez reviewed

View reviewed changes

ggml/src/ggml-opencl/ggml-opencl.cpp Outdated

                       case GGML_OP_NORM:
                       case GGML_OP_RMS_NORM:
                           return true;
+                              case GGML_OP_REPEAT:

Contributor

lhez May 30, 2025

The indention seems off for this case.


          small fixes

1008c1f

Contributor Author

rmatif commented May 30, 2025

@lhez Thanks for the review, can you check again now please ?

rmatif requested a review from lhez

May 30, 2025 18:23

Contributor

lhez commented Jun 2, 2025

I think it looks good.

max-krasnyansky approved these changes

View reviewed changes

Collaborator

max-krasnyansky left a comment

Nice!. Thanks for adding more kernels!

max-krasnyansky merged commit bfb1e01 into ggml-org:master

46 checks passed

furyhawk pushed a commit to furyhawk/llama.cpp that referenced this pull request


          OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (ggml-org#1…

7e872fa

…3840)

* add concat, pad, repeat, tsembd, tanh, upscale

* small fixes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml