EcoCompute.

"Quantization doesn't always save energy. See for yourself."

Every quantization tool tells you how to quantize. We tell you whether you should.

From the working paper “Weight-Only Quantization Does Not Always Save Energy” · under review at Sustainable Computing: Informatics and Systems (2 reviews received).

Inference energy per 1M tokens

Crossover curve · the bigger the model, the more quantization saves

NF4 INT8 Your model Above zero = penalty (more energy) · below = savings