This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| tips:llm [2025/12/20 17:49] – sscipioni | tips:llm [2025/12/26 15:35] (current) – sscipioni | ||
|---|---|---|---|
| Line 23: | Line 23: | ||
| | A100 40GB | 156.7 | 45.2 | | | | A100 40GB | 156.7 | 45.2 | | | ||
| | M3 Max 128GB | 34.8 | 4.2 | | | | M3 Max 128GB | 34.8 | 4.2 | | | ||
| - | | Strix Halo 128GB | | 5.1 | 85.02 | | + | | Strix Halo 128GB ollama |
| + | | Strix Halo 128GB llama.cpp | | | 90 | | ||
| | RTX 3060 | | | 131.76 | | | RTX 3060 | | | 131.76 | | ||
| Line 41: | Line 42: | ||
| | qwen2.5: | | qwen2.5: | ||
| | llama3.3: | | llama3.3: | ||
| + | | functiongemma | ||
| + | | danielsheep/ | ||
| + | | gpt-oss: | ||