Intorduce XNNPACKHeaderto manage flatbuffer data and constant data #1523

mcr229 · 2024-01-03T01:54:07Z

Summary:
Introducing the XNNPACKHeader to manage the flatbuffer data and constant data.

Previously, we have serialized constant data along with flatbuffer. However, with large weights and large tensors in general, this takes a large amount of time and memory converting our dataclass --> json --> flatbuffer. This has become a blocker on some larger models

To fix, we circumvent serializing constant tensors via flatbuffer, by appending the constant data after the flatbuffer payload. In order to do this, we need an XNNPACKHeader which will give us the flatbuffer offset, flatbuffer size, constant data offset, and constant data sizes.

It will look something like this:

             ┌───────────────────────────────────┐
             │XNNPACK Header                     │
             ├───────────────────────────────────┤
             │Padding for 16 byte alignment      │
             ├───────────────────────────────────┤
             │Flatbuffer-serialized payload data │
             │                                   │
             │                                   │
             ├───────────────────────────────────┤
             │Padding for 16 byte alignment      │
             ├───────────────────────────────────┤
             │Constant Data                      │
             │                                   │
             │                                   │
             └───────────────────────────────────┘

Within the XNNPACK Header, we hold the following:

4 bytes to offset the header magic
4 bytes for the header magic
4 bytes for the header length
8 bytes for the flatbuffer offset
8 bytes for the flatbuffer size
8 bytes for constant data offset
8 bytes for constant data size

Differential Revision: D52497977

pytorch-bot · 2024-01-03T01:54:10Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/1523

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 694e067 with merge base 428da4f ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / test-models-linux (cmake, vit, portable, linux.2xlarge) / linux-job (gh)
RuntimeError: Command docker exec -t acf9afd8a50ebcd5640e4b8ad76cbff6784ac3762edbbac96fdf34772cc28179 /exec failed with exit code 1
pull / test-models-linux (cmake, vit, xnnpack-delegation, linux.2xlarge) / linux-job (gh)
RuntimeError: Command docker exec -t 59943fad7353ddfc39b6e666f0cc207f156c42c7bd9695247c48f546c2d2eac9 /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-01-03T01:54:17Z