Q415GB VRAM minimum

lmstudio-community/Qwen3-Coder-30B-A3B-Instruct-MLX-4bit Q4 VRAM Requirements

This page answers lmstudio-community/Qwen3-Coder-30B-A3B-Instruct-MLX-4bit q4 quantization queries with explicit calculations from our model requirement dataset and compatibility speed table.

Requirement Snapshot

Current quantization-specific requirement breakdown

Selected quantizationQ4

Minimum VRAM15GB

Q4 baseline15GB

Q8 baseline31GB

FP16 baseline61GB

Methodology

No hand-wavy numbers

Exact Q4 requirement from model requirement data.

Throughput data below uses available compatibility measurements/estimates and is sorted by tokens per second for this model.

Need general guidance? Review full methodology.

Best GPUs for lmstudio-community/Qwen3-Coder-30B-A3B-Instruct-MLX-4bit (Q4)

GPU	VRAM	Quantization	Speed	Compatibility
AMD Instinct MI300X	192GB	Q4	426 tok/s	View full compatibility
NVIDIA H200 SXM 141GB	141GB	Q4	342 tok/s	View full compatibility
NVIDIA H100 SXM5 80GB	80GB	Q4	282 tok/s	View full compatibility
AMD Instinct MI250X	128GB	Q4	237 tok/s	View full compatibility
RTX 5090	32GB	Q4	179 tok/s	View full compatibility
NVIDIA H100 PCIe 80GB	80GB	Q4	176 tok/s	View full compatibility
NVIDIA A100 80GB SXM4	80GB	Q4	162 tok/s	View full compatibility
AMD Instinct MI210	64GB	Q4	138 tok/s	View full compatibility
NVIDIA A100 40GB PCIe	40GB	Q4	136 tok/s	View full compatibility
NVIDIA RTX 6000 Ada	48GB	Q4	101 tok/s	View full compatibility
NVIDIA L40	48GB	Q4	95 tok/s	View full compatibility
NVIDIA L40S	48GB	Q4	93 tok/s	View full compatibility

Back to lmstudio-community/Qwen3-Coder-30B-A3B-Instruct-MLX-4bit model page Full hardware requirements