Half-Quadratic Quantization of large machine learning models | Flume