⚡ Model Quantization Size Estimator

Calculate compressed model size · FP32 · FP16 · BF16 · INT8 · INT4 · GPTQ · AWQ · GGUF

Original size
FP32
Quantized size
GGUF Q4_K_M
Compression ratio
× smaller
Memory savings
% saved
Original
100%
Quantized
40%
FormatBitsSize (GB)Ratio vs FP32Savings
* Sizes are estimates: 1B parameters = 10⁹. Actual overhead may vary.