User Tools

Site Tools


tips:llm

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
tips:llm [2026/02/15 08:29] sscipionitips:llm [2026/04/15 11:25] (current) sscipioni
Line 39: Line 39:
 | qwen3-coder-next  | completion tools | "79.7B"   | 262144    | "Q4_K_M" | 33.06 | 380.21 | | qwen3-coder-next  | completion tools | "79.7B"   | 262144    | "Q4_K_M" | 33.06 | 380.21 |
 | qwen2.5-coder:14b-instruct-q4_K_M  | completion tools insert | "14.8B"   | 32768    | "Q4_K_M" | 17.25 | 527.74 | | qwen2.5-coder:14b-instruct-q4_K_M  | completion tools insert | "14.8B"   | 32768    | "Q4_K_M" | 17.25 | 527.74 |
 +| gemma4:latest  | completion vision audio tools thinking | "8.0B"   | 131072    | "Q4_K_M" | 50.15 | 1704.89 |
 +| gemma4:e2b  | completion vision audio tools thinking | "5.1B"   | 131072    | "Q4_K_M" | 83.07 | 2799.72 |
 +
 +NVIDIA GeForce RTX 3060
 +^ model                  ^ capabilities             ^ size     ^ context  ^ quantization                                                                      ^ eval rate [token/s]  ^ prompt eval rate [token/s]  ^
 +| gemma4:e2b  | completion vision audio tools thinking | "5.1B"   | 131072    | "Q4_K_M" | 102.44 | 4202.89 |
 +
  
  
tips/llm.1771140583.txt.gz · Last modified: by sscipioni