llama server embedding

Download model at: nomic-ai/nomic-embed-text-v1.5-GGUF

Config

models:
  "embedding":
    unlisted: true
    cmd: |
      /path/to/llama-server-latest --port ${PORT}
      -m /models/nomic-embed-text-v1.5.Q8_0.gguf
      --ctx-size 8192
      --batch-size 8192
      --rope-scaling yarn
      --rope-freq-scale 0.75
      -ngl 99
      --embeddings
      --no-mmap

Testing Model

$ curl -s 10.0.1.50:8080/v1/embeddings \
    -X POST \
    -H "Content-Type: application/json" \
    -d '{"model": "embedding", "input": "the text to embed"}' | jq .data;

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama server embedding

Config

Testing Model

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally