quantization Examples and Free Source Code

Why does TFLite INT8 quantization decompose BatchMatMul (from Einsum) into many FullyConnected layer...

tensorflow onnx quantization tflite einsum

Straight-Through estimation for vector quantization inside a recurrent neural network...

tensorflow quantization

Quantize Image using PIL and numpy...

image-processing python-imaging-library quantization

RuntimeError: CUDA error: named symbol not found when using TorchAoConfig with Qwen2.5-VL-7B-Instruc...

python pytorch huggingface-transformers huggingface quantization

What is the difference, if any, between model.half() and model.to(dtype=torch.float16) in huggingfac...

python huggingface-transformers huggingface quantization half-precision-float

How to Load a 4-bit Quantized VLM Model from Hugging Face with Transformers?...

python nlp huggingface-transformers huggingface quantization

How to quantize a HF safetensors model and save it to llama.cpp GGUF format with less than q8_0 quan...

large-language-model huggingface quantization llamacpp

Why are model_q4.onnx and model_q4f16.onnx not 4 times smaller than model.onnx?...

deep-learning large-language-model huggingface onnx quantization

HuggingFace - 'optimum' ModuleNotFoundError...

python huggingface-transformers quantization modulenotfounderror pruning

Does static quantization enable the model to feed a layer with the output of the previous one, witho...

neural-network artificial-intelligence onnx quantization static-quantization

Speeding up load time of LLMs...

huggingface-transformers large-language-model quantization

Llama QLora error: Target modules ['query_key_value', 'dense', 'dense_h_to_4h&#3...

python quantization large-language-model peft

How to set training=False for keras-model/layer outside of the __call__ method?...

tensorflow keras transfer-learning quantization tfmot

Diffrence between gguf and lora...

large-language-model quantization peft

Quantization 4 bit and 8 bit - error in 'quantization_config'...

gpu local large-language-model quantization 8-bit

Quantization and torch_dtype in huggingface transformer...

huggingface-transformers huggingface quantization

Image quantization with Numpy...

python numpy quantization

jpeg python 8x8 window DCT and quantisation process...

python huffman-code quantization dct

What's an elegant way to avoid "hopping" quantization errors when graphing a divergent...

c++qt sampling graphing quantization

There exists ONNX or Tensorflow CNN 4-bit quantized models available?...

tensorflow keras onnx quantization

What is the mathematical definition of the quantile transformation in xgboost.QuantileDMatrix?...

python machine-learning xgboost quantile quantization

Quantizing normally distributed floats in Python and NumPy...

python numpy floating-point k-means quantization

Tensorflow quantization process in detail - Anyone don't talk about this in detail...

python tensorflow tensorflow2.0 tensorflow-lite quantization

ValueError: Unsupported ONNX opset version: 13...

python pytorch onnx quantization onnxruntime

NeuQuant.js (JavaScript color quantization) hidden bug in JS conversion...

javascript neural-network quantization

How to quantize inputs and outputs of optimized tflite model...

python tensorflow-lite quantization google-coral

torch Parameter grad return none...

python deep-learning pytorch quantization

How do you find the quantization parameter inside of the ONNX model resulted in converting already q...

onnx yolov5 quantization tf2onnx

Why are some nn.Linear layers not quantized by Pytorch?...

pytorch quantization static-quantization

Method to quantize a range of values to keep precision when signficant outliers are present in the d...

python precision outliers quantization data-transform