Skip to content

Commit f4e25e1

Browse files
mcremon-metafacebook-github-bot
authored andcommitted
Allow int8 type in quantized_matmul (#5898)
Summary: Pull Request resolved: #5898 As titled. Reviewed By: zonglinpeng Differential Revision: D63659947 fbshipit-source-id: 850e7ce773508ad9ecd3ba36a70e87497a876bcd
1 parent 012cba9 commit f4e25e1

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

backends/cadence/reference/operators/quantized_matmul_out.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -49,7 +49,7 @@ __attribute__((noinline)) void qmatmul(
4949
sum += (X[i * n + k] - X_zero_point) * (y[k * p + j] - Y_zero_point);
5050
}
5151
}
52-
Z[i * p + j] = kernels::quantize<uint8_t>(sum, Z_scale, Z_zero_point);
52+
Z[i * p + j] = kernels::quantize<TZ>(sum, Z_scale, Z_zero_point);
5353
}
5454
}
5555
}

0 commit comments

Comments
 (0)