Skip to content

Commit 26aced7

Browse files
committed
Update on "{executorch][llama] support mqa"
This diff adds support for multi query attention for sdpa with kv cache Differential Revision: [D56228316](https://our.internmc.facebook.com/intern/diff/D56228316/) [ghstack-poisoned]
1 parent 2ff9055 commit 26aced7

File tree

1 file changed

+2
-2
lines changed
  • examples/models/llama2/custom_ops

1 file changed

+2
-2
lines changed

examples/models/llama2/custom_ops/TARGETS

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,14 +1,14 @@
11
# Any targets that should be shared between fbcode and xplat must be defined in
22
# targets.bzl. This file can contain fbcode-only targets.
33

4-
load("@fbcode_macros//build_defs:python_unittest.bzl", "python_unittest")
4+
load("@fbsource//xplat/executorch/build:runtime_wrapper.bzl", "runtime")
55
load(":targets.bzl", "define_common_targets")
66

77
oncall("executorch")
88

99
define_common_targets()
1010

11-
python_unittest(
11+
runtime.python_test(
1212
name = "test_sdpa_with_kv_cache",
1313
srcs = [
1414
"test_sdpa_with_kv_cache.py",

0 commit comments

Comments
 (0)