You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add Joint Matrix support for combination 32x32x16 for DG2
1. Implement joint_martix_store, joint_matrix_load, joint_matrix_apply and joint_matrix_mad for combination 32x32x16 for SYCL Joint Matrix, for sub group size = 8, for intel_gpu_dg2 device. These includes Matrix A bfloat16 32x16 PackedA_RowMajor, Matrix B bfloat16 16x32 PackedB_RowMajor, Matrix B bfloat16 16x32 PackedB_PackedB, Matrix C float 32x32 Accumulator_RowMajor.
2. Modify the interface and implementation in the DEFINE_LOAD_LARGE and DEFINE_STORE_LARGE functions located in Source/IGC/BiFModule/Languages/OpenCL/PreRelease/IBiF_matrix.cl.
0 commit comments