Can your GPU run this model?

Use this index to open compatibility checks with quantization fit, VRAM requirements, and speed estimates.

Q4 fit focusEstimated tokens/secPractical alternatives

Popular compatibility checks

Compatibility details available on page
Compatibility details available on page
Compatibility details available on page
Compatibility details available on page
Compatibility details available on page
Compatibility details available on page
Compatibility details available on page
Compatibility details available on page
Compatibility details available on page
Compatibility details available on page
Compatibility details available on page
Compatibility details available on page

More checks

Popular GPU compatibility pages

Popular model requirement pages

Compatibility FAQ

How do I know if my GPU can run an AI model?
Start with the compatibility page for your GPU and model pair. Focus on the Q4 verdict first, then review VRAM headroom and estimated tokens per second.
What does a tight fit mean in compatibility checks?
A tight fit means the model can run, but VRAM headroom is limited. You may need lower batch sizes, shorter context windows, or a lower-bit quantization tier.
If my GPU cannot run the model, what should I do?
Use the alternatives section on each page to find higher-VRAM GPUs, or open the model requirements page and choose a smaller model or lower-bit quantization.