[llava][21/N] Add llava runner test binary and build script #4667

larryliu0820 · 2024-08-12T08:04:31Z

Stack from ghstack (oldest at bottom):

-> [llava][21/N] Add llava runner test binary and build script #4667

Add a main.cpp and CMakeLists.txt for llava runner. This runner takes in an image in the format of .pt (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner.

Run build.sh to build the runner.

To serialize the image into a .pt file, run the following script:

import torch
from torch import nn

copy = torch.tensor(resized)
m = nn.Module()
par = nn.Parameter(copy, requires_grad=False)
m.register_parameter("0",par)
tensors = torch.jit.script(m)
tensors.save("image.pt")

To run the runner, use the following command:

cmake-out/examples/models/llava/llava_main                                                               \    
    --tokenizer_path tokenizer.bin                                                                                    \
    --model_path llava_kv_768.pte                                                                                  \
    --prompt "\nWhat are the things I should be cautious about when I visit here?"  \
    --image_path image.pt                                                                                                \
    --temperature 0

Differential Revision: D61146432

As titled. This PR moves the token generation loop in llama2 runner into a new class so it can be reused. [ghstack-poisoned]

…o a class" As titled. This PR moves the token generation loop in llama2 runner into a new class so it can be reused. Differential Revision: [D61047601](https://our.internmc.facebook.com/intern/diff/D61047601) [ghstack-poisoned]

As titled. This PR moves the token generation loop in llama2 runner into a new class so it can be reused. Differential Revision: [D61047601](https://our.internmc.facebook.com/intern/diff/D61047601) [ghstack-poisoned]

[ghstack-poisoned]

pytorch-bot · 2024-08-12T08:04:34Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/4667

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 5b687ce with merge base 84100d1 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 48f2916 Pull Request resolved: #4667

lucylq

Lgtm after linter/nits, thank you

… and build script" Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` [ghstack-poisoned]

Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` [ghstack-poisoned]

ghstack-source-id: a377c43 Pull Request resolved: #4667

… and build script" Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` [ghstack-poisoned]

Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` [ghstack-poisoned]

ghstack-source-id: cdbb8e0 Pull Request resolved: #4667

larryliu0820 · 2024-08-15T22:39:53Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

larryliu0820 · 2024-08-15T22:50:29Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

larryliu0820 · 2024-08-15T22:51:37Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

… and build script" Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]

Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]

ghstack-source-id: cd7200d Pull Request resolved: #4667

larryliu0820 · 2024-08-16T07:48:50Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

… and build script" Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]

Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]

ghstack-source-id: 7aa2dc3 Pull Request resolved: #4667

larryliu0820 · 2024-08-16T08:27:34Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` imported-using-ghimport Test Plan: Imported from OSS Reviewed By: kirklandsign Differential Revision: D61146432 Pulled By: larryliu0820

… and build script" Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]

Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]

ghstack-source-id: f91a174 Pull Request resolved: #4667

larryliu0820 · 2024-08-16T16:53:27Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: Pull Request resolved: #4750 Pull Request resolved: #4667 Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` imported-using-ghimport Test Plan: Imported from OSS Reviewed By: kirklandsign Differential Revision: D61146432 Pulled By: larryliu0820

… and build script" Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]

Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]

ghstack-source-id: 6172f65 Pull Request resolved: #4667

larryliu0820 · 2024-08-16T17:32:39Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: Pull Request resolved: #4750 Pull Request resolved: #4667 Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` imported-using-ghimport Test Plan: Imported from OSS Reviewed By: kirklandsign Differential Revision: D61146432 Pulled By: larryliu0820

… and build script" Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]

Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]

ghstack-source-id: 0bbf086 Pull Request resolved: #4667

Summary: Pull Request resolved: pytorch#4750 Pull Request resolved: pytorch#4667 Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` imported-using-ghimport Test Plan: Imported from OSS Reviewed By: kirklandsign Differential Revision: D61146432 Pulled By: larryliu0820

larryliu0820 · 2024-08-16T18:10:48Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: Pull Request resolved: #4750 Pull Request resolved: #4667 Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` imported-using-ghimport Test Plan: Imported from OSS Reviewed By: kirklandsign Differential Revision: D61146432 Pulled By: larryliu0820

… and build script" Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]

Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]

ghstack-source-id: d29b855 Pull Request resolved: #4667

larryliu0820 · 2024-08-16T19:49:34Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: Pull Request resolved: #4750 Pull Request resolved: #4667 Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` imported-using-ghimport Test Plan: Imported from OSS Reviewed By: kirklandsign Differential Revision: D61146432 Pulled By: larryliu0820

… and build script" Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]

Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]

ghstack-source-id: 470d722 Pull Request resolved: #4667

larryliu0820 · 2024-08-16T21:20:01Z

@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: Pull Request resolved: pytorch#4750 Pull Request resolved: pytorch#4667 Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` imported-using-ghimport Test Plan: Imported from OSS Reviewed By: kirklandsign Differential Revision: D61146432 Pulled By: larryliu0820

larryliu0820 added 6 commits August 9, 2024 14:38

[llava][18/N] Move token generation loop to a class

5127805

As titled. This PR moves the token generation loop in llama2 runner into a new class so it can be reused. [ghstack-poisoned]

Update on "[llava][18/N] Move token generation loop to a class"

2709f63

As titled. This PR moves the token generation loop in llama2 runner into a new class so it can be reused. Differential Revision: [D61047601](https://our.internmc.facebook.com/intern/diff/D61047601) [ghstack-poisoned]

[llava][19/N] Add multimodal runner base class and build file

cc54086

[ghstack-poisoned]

[llava][20/N] Add llava runner using building blocks in e/llm/runner

1abb48f

[ghstack-poisoned]

[llava][21/N] Add llava runner test binary and build script

7e4e9a4

[ghstack-poisoned]

larryliu0820 mentioned this pull request Aug 12, 2024

[llava][18/N] Move token generation loop to a class #4652

Merged

This was referenced Aug 12, 2024

[llava][19/N] Add multimodal runner base class and build file #4665

Merged

[llava][20/N] Add llava runner using building blocks in e/llm/runner #4666

Merged

larryliu0820 added a commit that referenced this pull request Aug 12, 2024

[llava][21/N] Add llava runner test binary and build script

44d33fa

ghstack-source-id: 48f2916 Pull Request resolved: #4667

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 12, 2024

larryliu0820 marked this pull request as ready for review August 12, 2024 16:38

lucylq approved these changes Aug 13, 2024

View reviewed changes

larryliu0820 added 2 commits August 14, 2024 01:30

larryliu0820 added a commit that referenced this pull request Aug 14, 2024

[llava][21/N] Add llava runner test binary and build script

c2a2b1a

ghstack-source-id: a377c43 Pull Request resolved: #4667

larryliu0820 added 2 commits August 15, 2024 15:31

larryliu0820 added a commit that referenced this pull request Aug 15, 2024

[llava][21/N] Add llava runner test binary and build script

3e770d9

ghstack-source-id: cdbb8e0 Pull Request resolved: #4667

larryliu0820 added 2 commits August 16, 2024 00:45

larryliu0820 added a commit that referenced this pull request Aug 16, 2024

[llava][21/N] Add llava runner test binary and build script

533b199

ghstack-source-id: cd7200d Pull Request resolved: #4667

larryliu0820 added 2 commits August 16, 2024 01:26

larryliu0820 added a commit that referenced this pull request Aug 16, 2024

[llava][21/N] Add llava runner test binary and build script

74b32fd

ghstack-source-id: 7aa2dc3 Pull Request resolved: #4667

larryliu0820 changed the base branch from gh/larryliu0820/50/base to main August 16, 2024 09:01

larryliu0820 added 2 commits August 16, 2024 09:47

larryliu0820 added a commit that referenced this pull request Aug 16, 2024

[llava][21/N] Add llava runner test binary and build script

059df55

ghstack-source-id: f91a174 Pull Request resolved: #4667

larryliu0820 added 2 commits August 16, 2024 10:31

larryliu0820 added a commit that referenced this pull request Aug 16, 2024

[llava][21/N] Add llava runner test binary and build script

4009634

ghstack-source-id: 6172f65 Pull Request resolved: #4667

larryliu0820 added 2 commits August 16, 2024 11:08

larryliu0820 added a commit that referenced this pull request Aug 16, 2024

[llava][21/N] Add llava runner test binary and build script

f3ac765

ghstack-source-id: 0bbf086 Pull Request resolved: #4667

larryliu0820 added 2 commits August 16, 2024 12:47

larryliu0820 added a commit that referenced this pull request Aug 16, 2024

[llava][21/N] Add llava runner test binary and build script

390d029

ghstack-source-id: d29b855 Pull Request resolved: #4667

larryliu0820 added 2 commits August 16, 2024 13:43

larryliu0820 added a commit that referenced this pull request Aug 16, 2024

[llava][21/N] Add llava runner test binary and build script

1800e8e

ghstack-source-id: 470d722 Pull Request resolved: #4667

larryliu0820 merged commit 45e9f6b into main Aug 16, 2024
56 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[llava][21/N] Add llava runner test binary and build script #4667

[llava][21/N] Add llava runner test binary and build script #4667

Uh oh!

larryliu0820 commented Aug 12, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 12, 2024 •

edited

Loading

Uh oh!

lucylq left a comment

Uh oh!

larryliu0820 commented Aug 15, 2024

Uh oh!

larryliu0820 commented Aug 15, 2024

Uh oh!

larryliu0820 commented Aug 15, 2024

Uh oh!

larryliu0820 commented Aug 16, 2024

Uh oh!

larryliu0820 commented Aug 16, 2024

Uh oh!

larryliu0820 commented Aug 16, 2024

Uh oh!

larryliu0820 commented Aug 16, 2024

Uh oh!

larryliu0820 commented Aug 16, 2024

Uh oh!

larryliu0820 commented Aug 16, 2024

Uh oh!

larryliu0820 commented Aug 16, 2024

Uh oh!

Uh oh!

Uh oh!

[llava][21/N] Add llava runner test binary and build script #4667

[llava][21/N] Add llava runner test binary and build script #4667

Uh oh!

Conversation

larryliu0820 commented Aug 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/4667

✅ No Failures

Uh oh!

lucylq left a comment

Choose a reason for hiding this comment

Uh oh!

larryliu0820 commented Aug 15, 2024

Uh oh!

larryliu0820 commented Aug 15, 2024

Uh oh!

larryliu0820 commented Aug 15, 2024

Uh oh!

larryliu0820 commented Aug 16, 2024

Uh oh!

larryliu0820 commented Aug 16, 2024

Uh oh!

larryliu0820 commented Aug 16, 2024

Uh oh!

larryliu0820 commented Aug 16, 2024

Uh oh!

larryliu0820 commented Aug 16, 2024

Uh oh!

larryliu0820 commented Aug 16, 2024

Uh oh!

larryliu0820 commented Aug 16, 2024

Uh oh!

Uh oh!

Uh oh!

larryliu0820 commented Aug 12, 2024 •

edited

Loading

pytorch-bot bot commented Aug 12, 2024 •

edited

Loading