⚡ Model Quantization Size Estimator
Calculate compressed model size · FP32 · FP16 · BF16 · INT8 · INT4 · GPTQ · AWQ · GGUF
Parameter count
Unit
Billions (B)
Millions (M)
Original precision
FP32 (32-bit)
FP16 (16-bit)
BF16 (16-bit)
Target quantization
INT8 (8-bit)
INT4 (4-bit)
GPTQ-4bit
AWQ-4bit
GGUF Q4_K_M
GGUF Q5_K_M
GGUF Q8_0
📊 Calculate
Original size
—
FP32
Quantized size
—
GGUF Q4_K_M
Compression ratio
—
× smaller
Memory savings
—
% saved
Original
100%
—
Quantized
40%
—
Format
Bits
Size (GB)
Ratio vs FP32
Savings
* Sizes are estimates: 1B parameters = 10⁹. Actual overhead may vary.