Skip to content

Commit bed2032

Browse files
committed
Update base for Update on "[ET-VK] Adding batch processing in x axis to conv2d dw shader by caching input texel for reuse."
This diff adds batch processing in the x axis to the conv2d dw shader by reusing input texel overlapping between consecutive tiles. The changes include modifying the glsl code for the conv2d dw output tile, adding a new parameter to the yaml file, and modifying the Convolution.cpp file to use the new parameter. Differential Revision: [D67868671](https://our.internmc.facebook.com/intern/diff/D67868671/) [ghstack-poisoned]
2 parents 7260da1 + 241cd0c commit bed2032

File tree

4 files changed

+14
-3
lines changed

4 files changed

+14
-3
lines changed

.lintrunner.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -321,6 +321,7 @@ command = [
321321
'--config=.mypy.ini',
322322
'--show-disable',
323323
'--',
324+
'--explicit-package-bases',
324325
'@{{PATHSFILE}}'
325326
]
326327
init_command = [

backends/cadence/aot/compiler.py

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -131,7 +131,10 @@ def quantize_pt2(
131131
Prepare, convert and fuse the model using the given quantizer.
132132
Returns a GraphModule with the quantized model.
133133
"""
134-
# Quantizer
134+
# Make the model inference mode by calling model.eval()
135+
model.eval()
136+
137+
# Instantiate the quantizer to CadenceQuantizer if not supplied
135138
if not quantizer:
136139
quantizer = CadenceQuantizer()
137140

docs/source/conf.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -197,7 +197,7 @@
197197

198198
# Custom directives defintions to create cards on main landing page
199199

200-
from custom_directives import (
200+
from custom_directives import ( # type: ignore[import-not-found]
201201
CustomCardEnd,
202202
CustomCardItem,
203203
CustomCardStart,

examples/models/llama/export_llama_lib.py

Lines changed: 8 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -79,7 +79,14 @@
7979
verbosity_setting = None
8080

8181

82-
EXECUTORCH_DEFINED_MODELS = ["stories110m", "llama2", "llama3", "llama3_1", "llama3_2"]
82+
EXECUTORCH_DEFINED_MODELS = [
83+
"stories110m",
84+
"llama2",
85+
"llama3",
86+
"llama3_1",
87+
"llama3_2",
88+
"static_llama",
89+
]
8390
TORCHTUNE_DEFINED_MODELS = ["llama3_2_vision"]
8491

8592

0 commit comments

Comments
 (0)