diff --git a/docs/server.md b/docs/server.md index 030c591bf..6a6988b69 100644 --- a/docs/server.md +++ b/docs/server.md @@ -61,7 +61,7 @@ You'll first need to download one of the available multi-modal models in GGUF fo Then when you run the server you'll need to also specify the path to the clip model used for image embedding and the `llava-1-5` chat_format ```bash -python3 -m llama_cpp.server --model --clip-model-path --chat-format llava-1-5 +python3 -m llama_cpp.server --model --clip_model_path --chat_format llava-1-5 ``` Then you can just use the OpenAI API as normal @@ -88,4 +88,4 @@ response = client.chat.completions.create( ], ) print(response) -``` \ No newline at end of file +```