# llama-server-binaries **Repository Path**: cuigray/llama-server-binaries ## Basic Information - **Project Name**: llama-server-binaries - **Description**: Llama.cpp server windows x64 + AVX-512 + Vulkan (r9) 非 nvidia - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: https://github.com/avdg/llama-server-binaries/releases/download/r9/llama-server.exe - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2026-06-05 - **Last Updated**: 2026-06-06 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # llama-server-binaries Compiled llama server binaries Please find the downloads at https://github.com/avdg/llama-server-binaries/releases. The release notes also includes compilation information and references to the original code. ## Running the llama server from the commandline ``` ./llama-server.exe --model ``` Once the server is running, the server should be available at http://localhost:8080/ You can add more options, which you can see if you run `./llama-server.exe --help` Most important settings: - If you want to change the port where the server is available at, add something like `--port 8081` - If you want to make the server available on the network, add `--host 0.0.0.0` - If you want to host your own files from the build-in webserver, add `--path ` - If you want to manually select the gpu on where to run at, add `--main-gpu `