Apple M4 Max~8 tok/s (Q4)

MiniMaxAI/MiniMax-M2.1 speed on Apple M4 Max

Quantization-specific throughput and VRAM requirements for MiniMaxAI/MiniMax-M2.1 running on Apple M4 Max.

Speed Snapshot

Topline estimate from compatibility data

ModelMiniMaxAI/MiniMax-M2.1

GPUApple M4 Max

Q4 speed8 tok/s

Q4 VRAM required128GB

Data Source

Calculation and benchmark status

Speed values come from the compatibility dataset (`estimatedTokensPerSec`) and are sorted by quantization.

For full verdict logic and alternate GPUs, see the canonical compatibility page.

Quantization Speed Table

Quantization	VRAM needed	VRAM available	Speed	Verdict
Q4	128GB	128GB	8 tok/s	⚠️ Tight fit
Q8	256GB	128GB	6 tok/s	❌ Not recommended
FP16	512GB	128GB	4 tok/s	❌ Not recommended