localai.computer

Can AMD Instinct MI300X run MiniMaxAI/MiniMax-M2.1?

Runs Q4192GB VRAM availableRequires 128GB+

AMD Instinct MI300X meets the minimum VRAM requirement for Q4 inference of MiniMaxAI/MiniMax-M2.1. Review the quantization breakdown below to see how higher precision settings impact VRAM and throughput.

What this means for you

AMD Instinct MI300X can run MiniMaxAI/MiniMax-M2.1 with Q4 quantization. At approximately 98 tokens/second, you can expect Good speed - acceptable for interactive use.

You have 64GB headroom, which is sufficient for system overhead and smooth operation.

Quantization breakdown

Quantization	VRAM needed	VRAM available	Estimated speed	Verdict
Q4	128GB	192GB	97.67 tok/s	✅ Fits comfortably
Q8	256GB	192GB	63.39 tok/s	❌ Not recommended
FP16	512GB	192GB	36.72 tok/s	❌ Not recommended

Suitable alternatives

NVIDIA H200 SXM 141GB

141GB

78.93 tok/s

Price: —

AMD Instinct MI250X

128GB

60.89 tok/s

Price: —

192GB

13.96 tok/s

Price: $5,999.00

128GB

7.93 tok/s

Price: —

128GB

6.95 tok/s

Price: $3,999.00

More questions

AMD Instinct MI300X specs & pricing Full guide for MiniMaxAI/MiniMax-M2.1 MiniMaxAI/MiniMax-M2.1 speed on AMD Instinct MI300X MiniMaxAI/MiniMax-M2.1 Q4 requirements