This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| tips:rocm [2025/12/29 07:36] โ [3\. ROCm (Optional but Recommended)] sscipioni | tips:rocm [2025/12/29 23:04] (current) โ [3\. ROCm (Optional but Recommended)] sscipioni | ||
|---|---|---|---|
| Line 43: | Line 43: | ||
| ```bash | ```bash | ||
| # Install essential ROCm packages | # Install essential ROCm packages | ||
| - | yay -S rocm-core amdgpu_top rocminfo | + | yay -S rocm-core amdgpu_top rocminfo |
| yay -S rocm-hip-sdk rocm-opencl-runtime | yay -S rocm-hip-sdk rocm-opencl-runtime | ||
| sudo usermod -a -G render, | sudo usermod -a -G render, | ||
| Line 139: | Line 139: | ||
| + | |||
| + | ====== lemonade-server ====== | ||
| + | |||
| + | <code | download> | ||
| + | yay -S lemonade-server | ||
| + | </ | ||
| + | |||
| + | oga-hybrid mode: this splits the work, the NPU handles the prefill (prompt processing), | ||
| + | <code | download> | ||
| + | lemonade-server run Qwen3-Coder-30B-A3B-Instruct-GGUF --recipe oga-hybrid --llamacpp rocm | ||
| + | </ | ||
| + | |||
| + | <code | download> | ||
| + | curl http:// | ||
| + | -H " | ||
| + | -d '{ | ||
| + | " | ||
| + | " | ||
| + | " | ||
| + | }' | ||
| + | curl http:// | ||
| + | </ | ||
| ====== Benchmark ====== | ====== Benchmark ====== | ||