Update on "[llava][21/N] Add llava runner test binary and build script"

larryliu0820 · larryliu0820 · commit 5b687ce35b4e · 2024-08-16T13:43:14.000-07:00
Add a `main.cpp` and CMakeLists.txt for llava runner. This runner takes in an image in the format of `.pt` (a serialized pytorch model) along with text prompt. It will generate text tokens in a way similar to llama runner. Run `build.sh` to build the runner. To serialize the image into a `.pt` file, run the following script: ```python import torch from torch import nn copy = torch.tensor(resized) m = nn.Module() par = nn.Parameter(copy, requires_grad=False) m.register_parameter("0",par) tensors = torch.jit.script(m) tensors.save("image.pt") ``` To run the runner, use the following command: ``` cmake-out/examples/models/llava/llava_main \ --tokenizer_path tokenizer.bin \ --model_path llava_kv_768.pte \ --prompt "\nWhat are the things I should be cautious about when I visit here?" \ --image_path image.pt \ --temperature 0 ``` Differential Revision: [D61146432](https://www.internalfb.com/diff/D61146432) [ghstack-poisoned]
diff --git a/.ci/scripts/test_llava.sh b/.ci/scripts/test_llava.sh
@@ -79,7 +79,7 @@ run_and_verify() {
     RESULT=$(cat result.txt)
     # set the expected prefix to be the same as prompt because there's a bug in sdpa_with_kv_cache that causes <unk> tokens.
     EXPECTED_PREFIX="ASSISTANT:"
-    if [[ "${RESULT}" == "${EXPECTED_PREFIX}"* ]]; then
+    if [[ "${RESULT}" == *"${EXPECTED_PREFIX}"* ]]; then
         echo "Expected result prefix: ${EXPECTED_PREFIX}"
         echo "Actual result: ${RESULT}"
         echo "Success"