sh llama-server -hf ggml-org/embeddinggemma-300M-GGUF --embeddings
Then the endpoint can be accessed at http://localhost:8080/embedding, for example using curl: