Skip to content

Commit 9cd402c

Browse files
committed
tests : add test-model-random
This generates random models and then tests different concurrencies of batches to check if the output is consistent. This can detect when e.g. the recurrent cache has been broken, or anything else which would affect the consistency of the output when inferencing multiple distinct sequences. More architectures will be added, but for now this starts with Mamba. Eventually, consistency of pooled embeddings will also be tested. The goal is to reduce accidental regressions by making it easy to quickly test a lot of edge cases on the supported architectures, without having to download any model.
1 parent 2e89f76 commit 9cd402c

File tree

2 files changed

+921
-0
lines changed

2 files changed

+921
-0
lines changed

tests/CMakeLists.txt

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -193,6 +193,7 @@ endif()
193193
# llama_build_and_test(test-opt.cpp) # SLOW
194194
llama_build_and_test(test-gguf.cpp)
195195
llama_build_and_test(test-backend-ops.cpp)
196+
llama_build_and_test(test-model-random.cpp)
196197

197198
llama_build_and_test(test-model-load-cancel.cpp LABEL "model")
198199
llama_build_and_test(test-autorelease.cpp LABEL "model")

0 commit comments

Comments
 (0)