ci : add LoRA test to CI #2650

slaren · 2023-08-18T00:59:13Z

Downloads a LoRA trained on shakespeare.txt and compares the perplexity on this dataset with and without applying the LoRA.

Only for 3B and f16 currently, if it looks ok I can try training another LoRA for 7B, and possibly add tests for quantized models.

Fixes #2634

slaren · 2023-08-18T16:07:57Z

It would be good to test LoRA with quantized models as well, both with and without a f16 --lora-base model, but it looks like we are already very close to the 20 minute time limit. Can we do anything about that?

ggerganov · 2023-08-18T16:23:18Z

I can easily increase it, but the runs will start to take forever eventually as we add more tests.

I can deploy more nodes, so another solution is to group the tests into groups and have different nodes run different groups. To do that, we have to make run.sh check an env variable to determine which group the node is serving:

Here are the current env variables on the CUDA node for example:

https://github.com/ggml-org/ci/tree/results/llama.cpp/f6/03b287bec853b69f6e963377626f26ec560d92/ggml-4-x86-cuda-v100#environment

We can add GG_BUILD_GROUP and use it in the script to run or skip tests.

slaren · 2023-08-18T16:54:24Z

I have added a test with q8_0 only for now, hopefully it is not too slow. This is with CPU only, the CUDA backend only supports LoRA with f16 models.

slaren · 2023-08-18T17:10:19Z

Looks like it didn't timeout. This should be good enough for now, we can add the rest of the quantized models once we figure the build groups.

Some things to review:

I am not sure if I am following the naming convention of the files very well
I noticed that the 7B CUDA perplexity tests have -t 1, but not the generation tests, so I added it to these as well

ggerganov · 2023-08-21T13:51:49Z

Let's update this PR after #2398 merge and updating the convert-lora-to-ggml.py script to export .gguf

ggml-ci

use 1 thread for CUDA generation tests ggml-ci

ggml-ci

ggerganov · 2023-08-27T06:26:47Z

I've bumped the CI timeout to 30 minutes.

For now, we can keep just the F16 and Q8_0 LoRAs as I think this covers large portion of the functionality and keeps the time slot small. Will merge this if the CI passes

* ci : add lora test ggml-ci * move lora summary to the top, add lora logs ggml-ci * ci : decrease CPU ppl runs to 2 to avoide 20 min timeout ggml-ci * add 7b lora test use 1 thread for CUDA generation tests ggml-ci * add test with q8_0 (cpu only) ggml-ci --------- Co-authored-by: Georgi Gerganov <[email protected]>

ggerganov force-pushed the lora-ci branch from 6094877 to 5ab1aa1 Compare August 18, 2023 10:03

slaren marked this pull request as ready for review August 18, 2023 17:04

slaren and others added 5 commits August 27, 2023 09:20

ci : add lora test

465a988

ggml-ci

move lora summary to the top, add lora logs

6e5297b

ggml-ci

ci : decrease CPU ppl runs to 2 to avoide 20 min timeout

acca961

ggml-ci

add 7b lora test

f430e7f

use 1 thread for CUDA generation tests ggml-ci

add test with q8_0 (cpu only)

958f5f7

ggml-ci

ggerganov force-pushed the lora-ci branch from e9b504d to 958f5f7 Compare August 27, 2023 06:24

ggerganov approved these changes Aug 27, 2023

View reviewed changes

ggerganov merged commit 789c8c9 into master Aug 27, 2023

ggerganov deleted the lora-ci branch August 27, 2023 07:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ci : add LoRA test to CI #2650

ci : add LoRA test to CI #2650

Uh oh!

slaren commented Aug 18, 2023

Uh oh!

slaren commented Aug 18, 2023

Uh oh!

ggerganov commented Aug 18, 2023 •

edited

Loading

Uh oh!

slaren commented Aug 18, 2023 •

edited

Loading

Uh oh!

slaren commented Aug 18, 2023 •

edited

Loading

Uh oh!

ggerganov commented Aug 21, 2023

Uh oh!

ggerganov commented Aug 27, 2023

Uh oh!

Uh oh!

ci : add LoRA test to CI #2650

ci : add LoRA test to CI #2650

Uh oh!

Conversation

slaren commented Aug 18, 2023

Uh oh!

slaren commented Aug 18, 2023

Uh oh!

ggerganov commented Aug 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

slaren commented Aug 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

slaren commented Aug 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Aug 21, 2023

Uh oh!

ggerganov commented Aug 27, 2023

Uh oh!

Uh oh!

ggerganov commented Aug 18, 2023 •

edited

Loading

slaren commented Aug 18, 2023 •

edited

Loading

slaren commented Aug 18, 2023 •

edited

Loading