devops : only build some specific targets for full image #9218

ngxson · 2024-08-28T09:44:34Z

Normally, the docker full-* image tag build all the targets. However, the entrypoint tools.sh only handle 3 tools: cli, server and quantize

This makes the build very wasteful, since most of the build time and storage space ended up not being used. Some CI run also breaks due to out of space (ref: https://github.com/ggerganov/llama.cpp/actions/runs/10584764326/job/29329766919)

This PR fixes the issue by only build targets used by tools.sh

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

ggerganov · 2024-08-28T10:20:07Z

AFAIU #9213 should reduce the build size dramatically because of linking ggml and llama dynamically.

ngxson · 2024-08-30T12:20:41Z

@ggerganov @slaren The current dockerfile introduced in #9213 only build llama-cli target for full-cuda image. I added llama-quantize and llama-server as required by tools.sh

Please let me know if this is OK. Thank you.

slaren · 2024-08-30T13:16:45Z

That was a mistake, I forgot to remove the --target.

ngxson · 2024-08-30T14:25:48Z

@slaren No problem! In fact, I just want to ask if it's ok to build only these 3 targets for full image (that's contrary to what you wanted to do, which is to build all binaries by removing --target)

For now, tools.sh does not provide entrypoint for other binaries like llama-bench or llama-perplexity. So, user can't access them without overriding --entrypoint in the docker run command. I'm not sure if anyone does that, though.

slaren · 2024-08-30T14:39:02Z

I have no idea who uses the full image, however with dynamic linking each executable should be <1MB, and building all of them will only increase the size of the image slightly (in my local build, the entire bin directory of a full build is only 32MB). Considering that the image is ~10GB, I don't see a good reason to not just build everything, even if only to avoid breaking it for people who depend on that.

ngxson · 2024-08-30T14:55:00Z

@slaren Hmm ok that makes sense. I think the better way would be to provide entrypoint to other tools via tools.sh. I'll do that in another PR then. Thanks for the response!

Maybe we should also bring dynamic link to full-rocm and full image.

devops : only build specific targets for full image

6aaa183

ngxson requested a review from ggerganov August 28, 2024 09:44

add comment

7a383db

github-actions bot added the devops improvements to build systems and github actions label Aug 28, 2024

mofosyne added the Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix label Aug 30, 2024

Merge branch 'master' into xsn/full_image_less

2e15e0f

ggerganov approved these changes Aug 30, 2024

View reviewed changes

ngxson closed this Aug 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

devops : only build some specific targets for full image #9218

devops : only build some specific targets for full image #9218

Uh oh!

ngxson commented Aug 28, 2024 •

edited

Loading

Uh oh!

ggerganov commented Aug 28, 2024

Uh oh!

ngxson commented Aug 30, 2024

Uh oh!

slaren commented Aug 30, 2024

Uh oh!

ngxson commented Aug 30, 2024

Uh oh!

slaren commented Aug 30, 2024

Uh oh!

ngxson commented Aug 30, 2024 •

edited

Loading

Uh oh!

Uh oh!

devops : only build some specific targets for full image #9218

devops : only build some specific targets for full image #9218

Uh oh!

Conversation

ngxson commented Aug 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Aug 28, 2024

Uh oh!

ngxson commented Aug 30, 2024

Uh oh!

slaren commented Aug 30, 2024

Uh oh!

ngxson commented Aug 30, 2024

Uh oh!

slaren commented Aug 30, 2024

Uh oh!

ngxson commented Aug 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ngxson commented Aug 28, 2024 •

edited

Loading

ngxson commented Aug 30, 2024 •

edited

Loading